Gpfs Adm and PRG Ref
Gpfs Adm and PRG Ref
SA23-2221-04
General Parallel File System
SA23-2221-04
Note
Before using this information and the product it supports, read the information in “Notices” on page 473.
| This edition applies to version 3 release 4 of IBM General Parallel File System Multiplatform (program number
| 5724-N94), IBM General Parallel File System for POWER® (program number 5765-G66), and to all subsequent
| releases and modifications until otherwise indicated in new editions. Significant changes or additions to the text
| and illustrations are indicated by a vertical line (|) to the left of the change.
IBM welcomes your comments; see the topic “How to send your comments” on page x. When you send
information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes
appropriate without incurring any obligation to you.
© Copyright IBM Corporation 1998, 2010.
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp.
Contents
Tables . . . . . . . . . . . . . . . vii Backing up a file system using the mmbackup
command . . . . . . . . . . . . . . 28
About this information . . . . . . . . ix Backing up a file system using the GPFS policy
engine . . . . . . . . . . . . . . . 29
Who should read this information . . . . . . . ix
Backing up file system configuration information 30
Conventions used in this information . . . . . . ix
Setting up Tivoli Storage Manager for use by
Prerequisite and related information . . . . . . x
GPFS . . . . . . . . . . . . . . . 30
How to send your comments . . . . . . . . . x
Defining include-exclude options . . . . . . 31
Using APIs to develop backup applications. . . 31
Summary of changes . . . . . . . . . xi
Chapter 4. Managing disks . . . . . . 33
Chapter 1. Performing GPFS Displaying disks in a GPFS cluster . . . . . . 33
administration tasks . . . . . . . . . 1 Adding disks to a file system . . . . . . . . 34
Requirements for administering a GPFS file system . 1 Deleting disks from a file system . . . . . . . 35
adminMode configuration parameter . . . . . 2 Replacing disks in a GPFS file system . . . . . 36
Common GPFS command principles . . . . . . 3 Additional considerations for managing disks . . . 38
Specifying nodes as input to GPFS commands . . 3 Displaying GPFS disk states . . . . . . . . . 38
Disk availability . . . . . . . . . . . . 38
Chapter 2. Managing your GPFS cluster 5 Disk status . . . . . . . . . . . . . 38
Creating your GPFS cluster . . . . . . . . . 5 Changing GPFS disk states and parameters. . . . 39
Displaying GPFS cluster configuration information . 5 Changing your NSD configuration. . . . . . . 40
Adding nodes to a GPFS cluster . . . . . . . . 6 Changing NSD server usage and failback . . . . 41
Deleting nodes from a GPFS cluster . . . . . . 7 Enabling and disabling Persistent Reserve . . . . 42
Changing the GPFS cluster configuration data . . . 8
Node quorum considerations . . . . . . . . 13 Chapter 5. Managing GPFS quotas. . . 43
Node quorum with tiebreaker considerations . . . 13 Enabling and disabling GPFS quota management. . 43
Displaying and changing the file system manager Default quotas . . . . . . . . . . . . . 44
node. . . . . . . . . . . . . . . . . 13 Explicitly establishing and changing quotas . . . 45
Determining how long mmrestripefs takes to Checking quotas. . . . . . . . . . . . . 45
complete . . . . . . . . . . . . . . . 14 Listing quotas . . . . . . . . . . . . . 46
Starting and stopping GPFS . . . . . . . . . 14 Activating quota limit checking. . . . . . . . 47
Deactivating quota limit checking . . . . . . . 47
Chapter 3. Managing file systems . . . 17 Creating file system quota reports . . . . . . . 48
Mounting a file system . . . . . . . . . . 17 Restoring quota files . . . . . . . . . . . 49
Mounting a file system on multiple nodes . . . 17
GPFS-specific mount options . . . . . . . 18 Chapter 6. Managing GPFS access
Unmounting a file system . . . . . . . . . 19 control lists and NFS export . . . . . 51
Unmounting a file system on multiple nodes . . 19 Traditional GPFS ACL administration. . . . . . 51
Deleting a file system . . . . . . . . . . . 19 Setting traditional GPFS access control lists . . . 52
Determining which nodes have a file system Displaying traditional GPFS access control lists 53
mounted . . . . . . . . . . . . . . . 19 Applying an existing traditional GPFS access
Checking and repairing a file system . . . . . . 20 control list . . . . . . . . . . . . . . 53
Listing file system attributes . . . . . . . . . 21 Changing traditional GPFS access control lists . . 54
Modifying file system attributes . . . . . . . 22 Deleting traditional GPFS access control lists . . 54
Querying and changing file replication attributes . . 23 NFS V4 ACL administration . . . . . . . . . 54
Querying file replication . . . . . . . . . 23 NFS V4 ACL Syntax . . . . . . . . . . 55
Changing file replication attributes . . . . . 23 NFS V4 ACL translation . . . . . . . . . 57
Using Direct I/O on a file in a GPFS file system . . 24 Setting NFS V4 access control lists . . . . . . 57
Restriping a GPFS file system . . . . . . . . 24 Displaying NFS V4 access control lists . . . . 58
Querying file system space . . . . . . . . . 25 Applying an existing NFS V4 access control list 58
Querying and reducing file system fragmentation . 26 Changing NFS V4 access control lists . . . . . 58
Querying file system fragmentation . . . . . 26 Deleting NFS V4 access control lists . . . . . 58
Reducing file system fragmentation . . . . . 27 Considerations when using GPFS with NFS V4
Backing up a file system . . . . . . . . . . 28 ACLs . . . . . . . . . . . . . . . 59
NFS and GPFS . . . . . . . . . . . . . 59
Contents v
vi GPFS: Administration and Programming Reference
Tables
1. Conventions . . . . . . . . . . . . ix 5. Wildcard characters . . . . . . . . . . 31
| 2. New, changed, and deleted messages xiii 6. Removal of a file with ACL entries DELETE
3. Configuration attributes on the mmchconfig and DELETE_CHILD . . . . . . . . . 56
command . . . . . . . . . . . . . 9 7. GPFS commands . . . . . . . . . . . 65
4. Options in the dsm.sys file for controlling 8. GPFS programming interfaces . . . . . . 321
backup processing . . . . . . . . . . 31 9. GPFS user exits . . . . . . . . . . . 463
This edition applies to GPFS version 3.4 for AIX®, Linux®, and Windows®.
To find out which version of GPFS is running on a particular AIX node, enter:
lslpp -l gpfs\*
To find out which version of GPFS is running on a particular Linux node, enter:
rpm -qa | grep gpfs
| To find out which version of GPFS is running on a particular Windows node, open the Programs and
| Features control panel. The IBM® General Parallel File System installed program name includes the
| version number.
Throughout this information you will see various command and component names beginning with the
prefix mm. This is not an error. GPFS shares many components with the related products IBM
Multi-Media Server and IBM Video Charger.
GPFS for Windows note: UNIX® file name conventions are used throughout this information. For
example, the GPFS cluster configuration data is stored in the /var/mmfs/gen/mmsdrfs file. On Windows,
the UNIX name space starts under the %SystemRoot%\SUA directory, and UNIX-style file names need
to be converted accordingly. For example, the cluster configuration file mentioned above is
C:\Windows\SUA\var\mmfs\gen\mmsdrfs.
Table 1. Conventions
Convention Usage
bold bold words or characters represent system elements that you must use literally, such as
commands, flags, path names, directories, file names, values, and selected menu options.
bold underlined Bold underlined keywords are defaults. These take effect if you do not specify a different
keyword.
constant width Examples and information that the system displays appear in constant-width typeface.
For the latest support information, see the GPFS Frequently Asked Questions at (http://
publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp?topic=/com.ibm.cluster.gpfs.doc/gpfs_faqs/
gpfsclustersfaq.html).
Include the publication title and order number, and, if applicable, the specific location of the information
about which you have comments (for example, a page number or a table number).
To contact the GPFS development organization, send your comments to the following e-mail address:
| Important: Nodes running GPFS 3.4 can coexist and interoperate with nodes running GPFS 3.3. GPFS
3.4 is not compatible with GPFS 3.2 or earlier versions. After you install GPFS 3.4 on some of the nodes
in the cluster, nodes that are still running GPFS 3.2 or earlier will not be able to join the cluster. Similarly,
in a multicluster environment in which clusters are remotely mounting file systems from other clusters, if
| one of the clusters is migrated to GPFS 3.4, the remaining clusters are expected to be at GPFS 3.3 or later.
| Changes to the GPFS licensed program for version 3, release 4 include the following:
| v Enhanced Windows cluster support:
| – Windows Server 2008 R2 x64
| – Directly attached disks support higher bandwidth via local I/O
| The use of fully SAN-connected Windows clusters may enable much higher bandwidth to Windows
| systems using GPFS, and SAN connectivity may provide greater flexibility in configuring GPFS
| clusters.
| – Homogeneous Windows clusters
| GPFS clusters can now be formed using only Windows nodes; Linux or AIX are no longer required
| as NSD servers. The Windows nodes can perform most of the required management and
| administrative operations. The exceptions include:
| - Certain GPFS commands to apply policy, administer quotas and ACLs.
| - The ability to mount DMAPI-enabled file systems.
| - Support for Tivoli® Storage Manager (TSM) Backup Archive client or the native Windows Backup
| utility.
| v Performance and scaling improvements:
| Extended attributes for a file can now be stored in the file's inode. This can improve the performance
| of applications that use extended attributes, and reduce the amount of disk space required to store
| them. This change enables the high-performance GPFS policy engine to refer to extended attributes
| with the same scale-out performance as the regular file attributes, providing richer semantics to content
| managers and middleware.
| v Migration improvements include the new mmmigratefs command, which you can use for migrating
| file systems.
| v Diagnostic improvements include the new mmdiag command, which better enables IBM Service to
| determine the state of your GPFS cluster.
| v Support for more than 2,147,483,648 files in a file system. For the current maximum tested limit, see the
| GPFS FAQ at:
| http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/topic/com.ibm.cluster.gpfs.doc/gpfs_faqs/
| gpfs_faqs.html
| v Withdrawal of support for the following:
| – data shipping mode
| – 32-bit AIX kernels
| – the High Performance Switch
| – IBM Virtual Shared Disks
| New file systems must be created using network shared disks (NSDs) only.
| The "Readers' Comments - We'd Like to Hear from You" section at the back of this publication has been
| replaced with the topic “How to send your comments” on page x.
For information on getting started with GPFS, see the General Parallel File System: Concepts, Planning, and
Installation Guide. This includes:
1. Installing GPFS
2. GPFS cluster creation considerations
3. Configuring and tuning your system for GPFS
4. Starting GPFS
5. Network Shared Disk creation considerations
6. File system creation considerations
This guide covers the administration and maintenance of GPFS and your file systems, and includes the
following topics:
1. “Requirements for administering a GPFS file system” and “Common GPFS command principles” on
page 3
2. Chapter 2, “Managing your GPFS cluster,” on page 5
3. Chapter 3, “Managing file systems,” on page 17
4. Chapter 4, “Managing disks,” on page 33
5. Chapter 5, “Managing GPFS quotas,” on page 43
6. Chapter 6, “Managing GPFS access control lists and NFS export,” on page 51
7. Chapter 7, “Communicating file access patterns to GPFS,” on page 63
8. Chapter 8, “GPFS commands,” on page 65
9. Chapter 9, “GPFS programming interfaces,” on page 321
10. Chapter 10, “GPFS user exits,” on page 463
11. Chapter 11, “Considerations for GPFS applications,” on page 467
12. Chapter 12, “File system format changes between versions of GPFS,” on page 469
For more advanced GPFS administration topics, see General Parallel File System: Advanced Administration
Guide.
| On Windows, root authority normally means users in the Administrators group. However, for clusters
| with both Windows and UNIX nodes, only the special Active Directory domain user root qualifies as
| having root authority for the purposes of administering GPFS. See the Installing GPFS prerequisites topic in
| the GPFS: Concepts, Planning, and Installation Guide for additional details.
The GPFS commands are designed to maintain the appropriate environment across all nodes in the
cluster. To achieve this, the GPFS commands utilize the remote shell and remote file copy commands that
you specify on either the mmcrcluster or the mmchcluster command.
In principle, you can issue GPFS administration commands from any node in the cluster. The nodes that
you plan to use for administering GPFS must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages.
Similarly, the nodes on which the GPFS commands are issued must be able to copy files to and from any
other node in the cluster without the use of a password and without producing any extraneous messages.
The way the passwordless access is achieved depends on the particular remote execution program and
authentication mechanism being used. For example, for rsh and rcp, you might need a properly
configured .rhosts file in the root user's home directory on each node in the GPFS cluster. If the remote
program is ssh, you may use private identity files that do not have a password. Or, if the identity file is
password protected, you can use the ssh-agent utility to establish an authorized session before issuing
mm commands.
GPFS does not need to know which nodes are being used for administration purposes. It is the
administrator's responsibility to issue mm commands only from nodes that are properly configured and
can access the rest of the nodes in the cluster.
Note: The rsh and rcp commands provided by the Windows SUA environment do not support GPFS. If
your cluster will include Windows nodes, you have to designate ssh and scp as the remote
communication program.
Note List:
1. If the GPFS cluster is configured to support Clustered NFS (CNFS), all CNFS member nodes must
belong to the subset of nodes that are able to execute remote commands without the need of a
password.
| Clusters created with the GPFS 3.3 or later level of the code have adminMode set to central by default.
| Clusters migrated from GPFS 3.2 or earlier versions will continue to operate as before and will have
| adminMode set to allToAll.
You can change the mode of operations at any time with the help of the mmchconfig command. For
example, to switch the mode of administration from allToAll to central, issue:
mmchconfig adminMode=central
Use the mmlsconfig command to display the mode of administration currently in effect for the cluster.
For commands operating on a file system, the stripe group manager node is always implicitly included in
the node list. Not every GPFS command supports all of the above node specification options. To learn
what kinds of node specifications are supported by a particular GPFS command, see the relevant
command description in Chapter 8, “GPFS commands,” on page 65.
See the General Parallel File System: Concepts, Planning, and Installation Guide for details on how GPFS
clusters are created and used.
See the “mmlscluster Command” on page 228 for complete usage information.
You must follow these rules when adding nodes to a GPFS cluster:
v You may issue the command only from a node that already belongs to the GPFS cluster.
v A node may belong to only one GPFS cluster at a time.
v The nodes must be available for the command to be successful. If any of the nodes listed are not
available when the command is issued, a message listing those nodes is displayed. You must correct
the problem on each node and reissue the command to add those nodes.
v After the nodes are added to the cluster, you must use the mmchlicense command to designate
appropriate GPFS licenses to the new nodes.
See the “mmaddnode Command” on page 77 and the “mmlscluster Command” on page 228 for complete
usage information.
where nodes_to_delete contains the nodes k164n01 and k164n02. The system displays information
similar to:
Verifying GPFS is stopped on all affected nodes ...
mmdelnode: Command successfully completed
mmdelnode: 6027-1371 Propagating the cluster configuration data to
all affected nodes. This is an asynchronous process.
See the “mmdelnode Command” on page 182 and the “mmlscluster Command” on page 228 for
complete usage information.
Exercise caution when shutting down GPFS on quorum nodes or deleting quorum nodes from the GPFS
cluster. If the number of remaining quorum nodes falls below the requirement for a quorum, you will be
unable to perform file system operations. See the General Parallel File System: Concepts, Planning, and
Installation Guide and search on quorum.
For example, to change the primary server for the GPFS cluster data, enter:
mmchcluster -p k164n06
Attention: The mmchcluster command, when issued with either the -p or -s option, is designed to
operate in an environment where the current primary and secondary GPFS cluster configuration servers
are not available. As a result, the command can run without obtaining its regular serialization locks. To
assure smooth transition to a new cluster configuration server, no other GPFS commands (mm...
commands) should be running when the command is issued nor should any other command be issued
until the mmchcluster command has successfully completed.
See the “mmchcluster Command” on page 99 and the “mmlscluster Command” on page 228 for complete
usage information.
You may be able to tune your cluster for better performance by re-configuring one or more attributes.
Before you change any attributes, consider how the changes will affect the operation of GPFS. For a
detailed discussion see the General Parallel File System: Concepts, Planning, and Installation Guide and the
mmcrcluster command.
Table 3 details the GPFS cluster configuration attributes which can be changed by issuing the
mmchconfig command. Variations under which these changes take effect are noted:
1. Take effect immediately and are permanent (-i).
2. Take effect immediately but do not persist when GPFS is restarted (-I).
3. Require that the GPFS daemon be stopped on all nodes for the change to take effect.
4. May be applied to only a subset of the nodes in the cluster.
Table 3. Configuration attributes on the mmchconfig command
Attribute name and Description -i option -I option GPFS must List of Change takes
allowed allowed be stopped NodeNames effect
on all nodes allowed
adminMode yes no no no immediately
Specify the nodes you want to target for change and the attributes with their new values on the
mmchconfig command. For example, to change the pagepool value for each node in the GPFS cluster
immediately, enter:
mmchconfig pagepool=100M -i
See the “mmchconfig Command” on page 102 for complete usage information.
For a discussion on node quorum, see General Parallel File System: Concepts, Planning, and Installation Guide
and search on node quorum.
When using node quorum with tiebreaker, define between one, two, or three disks to be used as
tiebreaker disks when any quorum node is down. Issue this command:
mmchconfig tiebreakerDisks="nsdName;nsdName;nsdName"
If you are using node quorum with tiebreaker and want to change to using node quorum, issue this
command:
mmchconfig tiebreakerDisks=DEFAULT
Note:
For a more detailed discussion on the role of the file system manager node, see General Parallel File
System: Concepts, Planning, and Installation Guide and search on Special management functions.
The node that is the file system manager can also be used for applications. In some cases involving very
large clusters or applications that place a high stress on metadata operations, it may be useful to specify
which nodes are used as file system managers. Applications that place a high stress on metadata
operations are usually those that involve large numbers of very small files, or that do very fine-grain
parallel write-sharing among multiple nodes.
You can display the file system manager node by issuing the mmlsmgr command. You can display the
information for an individual file system, a list of file systems, or for all of the file systems in the cluster.
For example, to display the file system manager for the file system fs1, enter:
mmlsmgr fs1
The output shows the device name of the file system and the file system manager's node number and
name:
See the “mmlsmgr Command” on page 244 for complete usage information.
You can change the file system manager node for an individual file system by issuing the mmchmgr
command. For example, to change the file system manager node for the file system fs1 to k145n32, enter:
mmchmgr fs1 k145n32
The output shows the file system manager's node number and name, in parentheses, as recorded in the
GPFS cluster data:
GPFS: 6027-628 Sending migrate request to current manager node 19.134.68.69 (k145n30).
GPFS: 6027-629 Node 19.134.68.69 (k145n30) resigned as manager for fs1.
GPFS: 6027-630 Node 19.134.68.70 (k145n32) appointed as manager for fs1.
See the “mmchmgr Command” on page 128 for complete usage information.
To determine how long the mmrestripefs command will take to complete, consider these points:
1. How much data is to be moved by issuing the df -k command.
2. How many GPFS client nodes there are to do the work.
| 3. How much Network Shared Disk (NSD) server bandwidth is available for I/O.
4. If you have added new disks to a file system, after the disks have been added, determine how much
free space is on each of the new disks by issuing the mmdf Device command.
The restriping of a file system is done by having multiple threads on each node in the cluster work on a
subset of files. If the files are large, multiple nodes can participate in restriping it in parallel.
Consequently, the more GPFS client nodes there are performing work for the restripe, the faster the
mmrestripefs command will complete. The nodes that should participate in the restripe are specified on
the command using the -N parameter. Based on raw I/O rates, you should be able to estimate the length
of time for the restripe. However, to account for the overhead of scanning all metadata, that value should
be doubled.
Assuming that you have enough nodes to saturate the disk servers, and have to move all of the data, the
time to read and write every block of data is roughly:
2 * fileSystemSize / averageDiskserverDataRate
As an upper bound, due to overhead of scanning all of the metadata, this time should be doubled. If
other jobs are loading the NSD servers heavily, this time may increase even more.
Note: There is no particular reason to stop all other jobs while the mmrestripefs command is running.
The CPU load of the command is minimal on each node and only the files that are being restriped at any
moment are locked to maintain data integrity.
For existing GPFS clusters, before starting GPFS, ensure that you have:
1. Verified the installation of all prerequisite software.
Start the daemons on all of the nodes in the cluster by issuing the mmstartup -a command:
mmstartup -a
Check the messages recorded in /var/adm/ras/mmfs.log.latest on one node for verification. Look for
messages similar to this:
GPFS: 6027-300 mmfsd ready
This indicates that quorum has been formed and this node has successfully joined the cluster, and is now
ready to mount file systems.
If GPFS does not start, see the GPFS: Problem Determination Guide and search on GPFS daemon will not
come up.
See the “mmstartup Command” on page 307 for complete usage information.
If it becomes necessary to stop GPFS, you can do so from the command line by issuing the mmshutdown
command:
mmshutdown -a
See the “mmshutdown Command” on page 302 for complete usage information.
Managing filesets, storage pools and policies is also a file system management task. See the GPFS:
Advanced Administration Guide for more information.
If you allowed the default value for the automatic mount option (-A yes) when you created the file
system, then you do not need to use this procedure after restarting GPFS on the nodes.
where device is the name of the file system. For example, to mount the file system fs1, enter:
mmmount fs1
To mount a file system only on a specific set of nodes, use the -N flag of the mmmount command.
All of the mount options can be specified using the -o parameter. Multiple options should be separated
only by a comma. If an option is specified multiple times, the last instance is the one that takes effect.
Certain options can also be set with specifically designated command flags. Unless otherwise stated,
mount options can be specified as:
The option={1 | 0 | yes | no} syntax should be used for options that can be intercepted by the mount
command and not passed through to GPFS. An example is the atime option in the Linux environment.
If the file system will not unmount, see the General Parallel File System: Problem Determination Guide and
search for file system will not unmount.
where device is the name of the file system. For example, to unmount the file system fs1, enter:
mmumount fs1
To unmount a file system only on a specific set of nodes, use the -N flag of the mmumount command.
Specify the file system to be deleted on the mmdelfs command. For example, to delete the file system
fs1, enter:
mmdelfs fs1
Note that the mmlsmount -L command reports file systems that are in use at the time the command is
issued. A file system is considered to be in use if it is explicitly mounted with the mount or mmmount
This is an example of a mmlsmount -L command for a mounted file system named fs1:
File system fs1 (mnsd.cluster:fs1) is mounted on 5 nodes:
9.114.132.101 c5n101 mnsd.cluster
9.114.132.100 c5n100 mnsd.cluster
9.114.132.106 c5n106 mnsd.cluster
9.114.132.97 c5n97 cluster1.cluster
9.114.132.92 c5n92 cluster1.cluster
The online mode operates on a mounted file system and is chosen by issuing the -o option. Conversely,
the offline mode operates on an unmounted file system. In general it is unnecessary to run mmfsck in
offline mode unless under the direction of the IBM Support Center.
The online mode checks and recovers unallocated blocks on a mounted file system. If a GPFS file
operation fails due to an out of space condition, the cause may be disk blocks that have become
unavailable after repeated node failures. The corrective action taken is to mark the block free in the
allocation map. Any other inconsistencies found are only reported, not repaired.
Note:
1. If you are running the online mmfsck command to free allocated blocks that do not belong to any
files, plan to make file system repairs when system demand is low. This is I/O intensive activity and
it can affect system performance.
2. If you are repairing a file system due to node failure and the file system has quotas enabled, it is
suggested that you run the mmcheckquota command to recreate the quota files.
To repair any other inconsistencies, you must run the offline mode of the mmfsck command on an
unmounted file system. The offline mode checks for these file inconsistencies that might cause problems:
v Blocks marked allocated that do not belong to any file. The corrective action is to mark the block free
in the allocation map.
v Files and directories for which an inode is allocated and no directory entry exists, known as orphaned
files. The corrective action is to create directory entries for these files in a lost+found subdirectory in
the root directory of the fileset to which the file or directory belongs. A fileset is a subtree of a file
system namespace that in many respects behaves like an independent file system. The index number of
the inode is assigned as the name. If you do not allow the mmfsck command to reattach an orphaned
file, it asks for permission to delete the file.
v Directory entries pointing to an inode that is not allocated. The corrective action is to remove the
directory entry.
v Incorrectly formed directory entries. A directory file contains the inode number and the generation
number of the file to which it refers. When the generation number in the directory does not match the
generation number stored in the file's inode, the corrective action is to remove the directory entry.
v Incorrect link counts on files and directories. The corrective action is to update them with accurate
counts.
v Policy files that are not valid. The corrective action is to delete the file.
The mmfsck command performs other functions not listed here, as deemed necessary by GPFS.
You cannot run the mmfsck command on a file system that has disks in a down state. You must first run
the mmchdisk command to change the state of the disks to unrecovered or up. To display the status of
the disks in the file system, issue the mmlsdisk command.
For example, to check the file system fs1 without making any changes to the file system, enter:
mmfsck fs1
1212416 inodes
87560 allocated
0 repairable
0 repaired
0 damaged
0 deallocated
0 orphaned
0 attached
7211746 subblocks
227650 allocated
0 unreferenced
0 deletable
0 deallocated
144504 addresses
0 suspended
See the “mmchdisk Command” on page 112, “mmcheckquota Command” on page 116, “mmfsck
Command” on page 204, and “mmlsdisk Command” on page 232 for complete usage information.
If you specify no options with the mmlsfs command, all file system attributes are listed.
| For example, to list all of the attributes for the file system gpfs2, enter:
Note that some of the attributes displayed by the mmlsfs command represent default mount options.
Since the scope of mount options is an individual node, it is possible to have different values on different
nodes. For exact mtime (-E option) and suppressed atime (-S option), the information displayed by the
mmlsfs command represents the current setting on the file system manager node. If these options are
changed with the mmchfs command, the change may not be reflected until the file system is remounted.
See the “mmlsfs Command” on page 238 for complete usage information. See the General Parallel File
System: Concepts, Planning, and Installation Guide and search on GPFS architecture and file system creation
considerations for a detailed discussion of file system attributes.
Note: All files created after issuing the mmchfs command take on the new attributes. Existing files are
not affected. Use the mmchattr or mmrestripefs -R command to change the replication factor of existing
files. See “Querying and changing file replication attributes” on page 23.
For example, to change the default data replication factor to 2 for the file system fs1, enter:
mmchfs fs1 -r 2
See the “mmchfs Command” on page 121 and the “mmlsfs Command” on page 238 for complete usage
information. See the General Parallel File System: Concepts, Planning, and Installation Guide and search on
GPFS architecture and file system creation considerations for a detailed discussion of file system attributes.
For complete usage information, see the “mmlsattr Command” on page 224 and the “mmchattr
Command” on page 96.
See the “mmlsattr Command” on page 224 for complete usage information. See the General Parallel File
System: Concepts, Planning, and Installation Guide and search on GPFS architecture and file system creation
considerations for a detailed discussion of file system attributes.
You can only increase data and metadata replication as high as the maximum data and maximum
metadata replication factors for that file system. You cannot change the maximum data and maximum
metadata replication factors once the file system has been created.
Specify the file name, attribute, and new value with the mmchattr command. For example, to change the
metadata replication factor to 2 and the data replication factor to 2 for the file named project7.resource in
the file system fs1, enter:
mmchattr -m 2 -r 2 /fs1/project7.resource
This caching policy bypasses file cache and transfers data directly from disk into the user space buffer, as
opposed to using the normal cache policy of placing pages in kernel memory. Applications with poor
cache hit rates or very large I/Os may benefit from the use of Direct I/O.
Direct I/O may also be specified by supplying the O_DIRECT file access mode on the open() of the file.
Restriping offers the opportunity to specify useful options in addition to rebalancing (-b option).
Re-replicating (-r or -R option) provides for proper replication of all data and metadata. If you use
replication, this option is useful to protect against additional failures after losing a disk. For example, if
you use a replication factor of 2 and one of your disks fails, only a single copy of the data would remain.
If another disk then failed before the first failed disk was replaced, some data might be lost. If you expect
delays in replacing the failed disk, you could protect against data loss by suspending the failed disk
using the mmchdisk command and re-replicating. This would assure that all data existed in two copies
on operational disks.
If files are assigned to one storage pool, but with data in a different pool, the placement (-p) option will
migrate their data to the correct pool. Such files are referred to as ill-placed. Utilities, such as the
mmchattr command or policy engine, may change a file's storage pool assignment, but not move the
data. The mmrestripefs command may then be invoked to migrate all of the data at once, rather than
migrating each file individually. Note that the rebalance (-b) option also performs data placement on all
files, whereas the placement (-p) option rebalances only the files that it moves.
If you do not replicate all of your files, the migrate (-m) option is useful to protect against data loss when
you have an advance warning that a disk may be about to fail, for example, when the error logs show an
excessive number of I/O errors on a disk. Suspending the disk and issuing the mmrestripefs command
with the -m option is the quickest way to migrate only the data that would be lost if the disk failed.
If you do not use replication, the -m and -r options are equivalent; their behavior differs only on
replicated files. After a successful re-replicate (-r option) all suspended disks are empty. A migrate
operation, using the -m option, leaves data on a suspended disk as long as at least one other replica of
the data remains on a disk that is not suspended. Restriping a file system includes re-replicating it; the -b
option performs all the operations of the -m and -r options.
Consider the necessity of restriping and the current demands on the system. New data which is added to
the file system is correctly striped. Restriping a large file system requires extensive data copying and may
affect system performance. Plan to perform this task when system demand is low.
If you are sure you want to proceed with the restripe operation:
1. Use the mmchdisk command to suspend any disks to which you do not want the file system
restriped. You may want to exclude disks from file system restriping because they are failing. See
“Changing GPFS disk states and parameters” on page 39.
24 GPFS: Administration and Programming Reference
2. Use the mmlsdisk command to assure that all disk devices to which you do want the file system
restriped are in the up/normal state. See “Displaying GPFS disk states” on page 38.
Specify the target file system with the mmrestripefs command. For example, to rebalance (-b option) file
system fs2 after adding an additional RAID device, enter:
mmrestripefs fs2 -b
Note: Rebalancing of files is an I/O intensive and time consuming operation, and is important only for
file systems with large files that are mostly invariant. In many cases, normal file update and creation will
rebalance your file system over time, without the cost of the rebalancing.
See the “mmrestripefs Command” on page 295 for complete usage information.
Note: The mmdf command may require considerable metadata I/O, and should be run when the system
load is light.
Specify the file system you want to query with the mmdf command. For example, to query available
space on all disks in the file system fs1, enter:
mmdf fs1
Disks in storage pool: fs1sp1 (Maximum disk size allowed is 122 GB)
hd30n01 8897968 8 no yes 8895488 (100%) 424 ( 0%)
Inode Information
-----------------
Number of used inodes: 9799
Number of free inodes: 4990393
Number of allocated inodes: 5000192
Maximum number of inodes: 5000192
See the “mmdf Command” on page 189 for complete usage information.
In order to write to a file system, free full blocks of disk space are required. Due to fragmentation, it is
entirely possible to have the situation where the file system is not full, but an insufficient number of free
full blocks are available to write to the file system. Replication can also cause the copy of the fragment to
be distributed among disks in different failure groups. The mmdefragfs command can be used to query
the current fragmented state of the file system and reduce the fragmentation of the file system.
In order to reduce the fragmentation of a file system, the mmdefragfs command migrates fragments to
free space in another fragmented disk block of sufficient space, thus creating a free full block. There is no
requirement to have a free full block in order to run the mmdefragfs command. The execution time of
the mmdefragfs command depends on the size and allocation pattern of the file system. For a file system
with a large number of disks, the mmdefragfs command will run through several iterations of its
algorithm, each iteration compressing a different set of disks. Execution time is also dependent on how
fragmented the file system is. The less fragmented a file system, the shorter time for the mmdefragfs
command to execute.
The fragmentation of a file system can be reduced on all disks which are not suspended or stopped. If a
disk is suspended or stopped, the state of the disk, not the utilization information, will be displayed as
output for the mmdefragfs command.
The mmdefragfs command can be run on both a mounted or an unmounted file system, but achieves
best results on an unmounted file system. Running the command on a mounted file system can cause
conflicting allocation information and consequent retries to find a new free subblock of the correct size to
store the fragment in.
See the “mmdefragfs Command” on page 167 for complete usage information.
See the “mmdefragfs Command” on page 167 for complete usage information.
You can use the mmbackup command to back up the files of a GPFS file system to a backup Tivoli
Storage Manager (TSM) server.
Alternatively, you can utilize the GPFS policy engine (mmapplypolicy command) to generate lists of files
to be backed up and provide them as input to some other external storage manager.
The file system configuration information can be backed up using the mmbackupconfig command.
Note: Windows nodes do not support the mmbackup and mmapplypolicy commands.
The mmbackup command supports backing up of GPFS file system data to multiple Tivoli Storage
Manager servers. The ability to partition file backups across multiple TSM servers is particularly useful
for installations that have a large number of files. For information on setting up multiple TSM servers,
see “Tivoli Storage Manager requirements” on page 29.
Unless otherwise specified, the mmbackup command backs up the current active version of the GPFS file
system. If you want to create a backup of the file system at a specific time, you can use the
mmcrsnapshot command to create a snapshot of the file system and then specify that snapshot on the
mmbackup -S option.
If an unlinked fileset is detected, the mmbackup processing will issue an error message and exit. You can
force the backup operation to proceed by specifying the mmbackup -f option. In this case, files that
belong to unlinked filesets will not be backed up, but will be removed from the expire list.
| GPFS 3.3 and later versions of the mmbackup command will preserve this type of processing for
incremental backups until a new full backup is performed. Once a full backup is performed, mmbackup
will store the files under their GPFS root directory path name; all files under /Device/.snapshots/
.mmbuSnapshot will be marked for expiration.
Certain features, such as backing up a directory or backing up from an arbitrary snapshot, cannot be
| used until a full backup is performed with the GPFS 3.3 or later version of the mmbackup command.
For more information on GPFS policies and rules refer to Information Lifecycle Management for GPFS in the
GPFS: Advanced Administration Guide.
The output file generated by the mmbackupconfig command is used as input to the mmrestoreconfig
command.
Note: The mmbackupconfig command only backs up the file system configuration information. It does
not back up any user data or individual file attributes.
The mmbackup command requires a working Tivoli Storage Manager (TSM) environment:
1. The TSM client code must be installed on the node that will run the mmbackup command. More than
one TSM client node can be specified with the mmbackup command to enhance performance. You
must ensure that TSM clients and the include-exclude list are set up correctly before executing the
mmbackup command. Both the dsm.sys and dsm.opt configuration files are required and are present
in the TSM configuration directory. Only root users and authorized users with write permission to
these files can change them. The lines changed in the dsm.opt and dsm.sys files to enable mmbackup
to work with TSM must be changed in these files on all of the TSM client nodes used by mmbackup.
For example, the dsm.opt file may be:
SErvername k164n06.kgn.ibm.com
After Tivoli Storage Manager evaluates all exclude.fs and exclude.dir statements, the following options
are evaluated against the remaining list of objects available for processing.
Table 4. Options in the dsm.sys file for controlling backup processing
Options Descriptions
exclude These options are equivalent. Use these options to exclude a file or group of files from backup
exclude.backup services and space management services (if the HSM client is installed). The exclude.backup
exclude.file option only excludes files from normal backup, but not from HSM.
exclude.file.backup
include Use these options to include files or assign management classes for backup processing.
include.backup
include.file
To specify groups of files that you want to include or exclude, use the wildcard characters listed in
Table 5. This table applies to include and exclude statements only.
Note: A very large include-exclude list can decrease backup performance. Use wildcards and eliminate
unnecessary include statements to keep the list as short as possible.
Table 5. Wildcard characters
Character Function
* The match-all character. For example:
v The pattern ab*, matches ab, abb, abxxx, but does not match a, b, aa, bb.
v The pattern ab*rs, matches abrs, abtrs, abrsrs, but does not match ars, or aabrs, abrss.
v The pattern ab*ef*rs, matches abefrs, abefghrs, but does not match abefr, abers.
v The pattern abcd.*, matches abcd.c, abcd.txt, but does not match abcd, abcdc, or abcdtxt.
The gpfs_ireadx() subroutine is more efficient than read() or gpfs_iread() for sparse files and for
incremental backups. The gpfs_ireaddir() or gpfs_ireaddir64() subroutine is more efficient than readdir(),
because it returns file type information. There are also subroutines for reading symbolic links,
gpfs_ireadlink() or gpfs_ireadlink64() and for accessing file attributes, gpfs_igetattrs().
IBM has supplied a backup application program, tsbackup.C, to serve as an example. This example is
located in /usr/lpp/mmfs/samples/util and consists of these files:
v tsbackup.C - A utility for backing up a GPFS file system to a TSM server using TSM clients.
v tsbackup.h - A header file containing necessary declarations.
Note: A LUN provided by a storage subsystem is a disk for the purposes of this documentation, even if
the LUN is made up of multiple physical disks.
The default is to display information for all disks defined to the cluster (-a). Otherwise, you may choose
to display the information for a particular file system (-f) or for all disks which do not belong to any file
system (-F).
To display the default information for all of the NSDs belonging to the cluster, enter:
mmlsnsd
To find out the local device names for the disks, use the mmlsnsd command with the -m option. For
example, issuing mmlsnsd -m produces output similar to this:
Storage in a file system is divided in storage pools. The maximum size of any one disk that can be added
to an existing storage pool is set approximately to the sum of the disk sizes when the storage pool is
created. The actual value is shown in the mmdf command output.
Once a storage pool is created, the maximum size cannot be altered. However, you can create a new pool
with larger disks, and then move data from the old pool to the new one.
When establishing a storage pool and when adding disks later to an existing storage pool, you should try
to keep the sizes of the disks fairly uniform. GPFS allocates blocks round robin, and as the utilization
level rises on all disks, the small ones will fill up first and all files created after that will be spread across
fewer disks, which reduces the amount of prefetch that can be done for those files.
For example, to add the disk gpfs2nsd to the file system fs2, have it join failure group 1 in the storage
pool system, and rebalance the existing files to take advantage of the added space, enter:
mmadddisk fs2 gpfs2nsd::::1 -r
Note: Rebalancing of files is an I/O intensive and time consuming operation, and is important only for
file systems with large files that are mostly invariant. In many cases, normal file update and creation will
| rebalance your file system over time, without the cost of the rebalancing.
Note: See “Querying file system space” on page 25 for more information about diagnosing space
problems.
Consider how fragmentation may increase your storage requirements, especially when the file system
contains a large number of small files. A margin of 150 percent of the size of the disks being deleted
should be sufficient to allow for fragmentation when small files predominate. For example, in order to
| delete a 400 GB disk from your file system, which contains user home directories with small files, you
| should first determine that the other disks in the file system contain a total of 600 GB of free space.
If you do not replicate your file system data, you should rebalance the file system using the mmrestripefs
-b command. If you replicate your file system data, run the mmrestripefs -r command after the disk has
been deleted. This ensures that all data will still exist with correct replication after the disk is deleted. The
mmdeldisk command only migrates data that would otherwise be lost, not data that will be left in a
single copy.
Note: Rebalancing of files is an I/O intensive and time consuming operation, and is important only for
file systems with large files that are mostly invariant. In many cases, normal file update and creation will
rebalance your file system over time, without the cost of the rebalancing.
Do not delete stopped disks, if at all possible. Start any stopped disk before attempting to delete it from
the file system. If the disk cannot be started you will have to consider it permanently damaged. You will
need to delete the disk using the appropriate options. If metadata was stored on the disk, you will need
to execute the offline version of the mmfsck command. See the General Parallel File System: Problem
Determination Guide and search for NSD failures for further information on handling this.
When deleting disks from a file system, the disks may or may not be available. If the disks being deleted
are still available, GPFS moves all of the data from those disks to the disks remaining in the file system.
However, if the disks being deleted are damaged, either partially or permanently, it is not possible to
Specify the file system and the names of one or more disks to delete with the mmdeldisk command. For
| example, to delete the disk hd2n97 from the file system fs2 enter:
mmdeldisk fs2 hd2n97
Refer to “mmdeldisk Command” on page 173 for syntax and usage information.
When replacing disks in a GPFS file system, first decide if you will:
1. Create new disks using the mmcrnsd command.
You should also decide whether to use the rewritten disk descriptor file produced by the mmcrnsd
command, or create a new list of disk descriptors. When using the rewritten file, the Disk Usage and
Failure Group specifications will remain the same as specified on the mmcrnsd command.
2. Select NSDs no longer in use by another GPFS file system. Issue the mmlsnsd -F command to display
the available disks.
To replace a disk in the file system, use the mmrpldisk command. For example, to replace the NSD
hd3n97 in file system fs2 with the existing NSD hd2n97, which is no longer in use by another file system,
enter:
| Note: If you attempt to replace a stopped disk and the file system is not replicated, the attempt will fail.
However, you can replace a stopped disk if the file system is replicated. You can do so in one of the
following ways:
v Deletion, addition, and rebalancing method:
1. Use the mmdeldisk command to delete the stopped disk from the file system.
2. Use the mmadddisk command to add a replacement disk.
3. Use the mmrestripefs -b command to rebalance the file system.
While this method requires rebalancing, it returns the system to a protected state faster (because it can
use all of the remaining disks to create new replicas), thereby reducing the possibility of losing data.
—Or—
v Direct replacement method:
Use the mmrpldisk command to directly replace the stopped disk.
The mmrpldisk command only runs at single disk speed because all data being moved must be
written to the replacement disk. The data is vulnerable while the command is running, and should a
second failure occur before the command completes, it is likely that some data will be lost.
If you need to delete, replace, or suspend a disk and you need to write new data while the disk is offline,
you can disable strict replication before you perform the disk action. However, data written while
replication is disabled will not be properly replicated. Therefore, after you perform the disk action, you
must re-enable strict replication and run the mmrestripefs -r command. To determine if a file system has
strict replication enforced, issue the mmlsfs -K command.
The information includes parameters that were specified on the mmcrfs command, and the current
availability and status of the disks. For example, to display the current status of the disk hd8vsdn100 in
the file system fs1, enter:
mmlsdisk fs1 -d hd8vsdn100
Refer to “mmlsdisk Command” on page 232 for syntax and usage information.
Disk availability
A disk's availability determines whether GPFS is able to read and write to the disk. There are four
possible values for availability:
up The disk is available to GPFS for normal read and write operations.
down No read and write operations can be performed on the disk.
recovering
| An intermediate state for disks coming up, during which GPFS verifies and corrects data. write
| operations can be performed while a disk is in this state, but read operations cannot (because
| data on the disk being recovered might be stale until the mmchdisk start command completes).
unrecovered
| The disk was not successfully brought up.
Disk availability is automatically changed from up to down when GPFS detects repeated I/O errors. You
can also change the availability of a disk by issuing the mmchdisk command.
Disk status
Disk status controls data placement and migration. Status changes as a result of a pending delete
operation, or when the mmchdisk command is issued to allow file rebalancing or re-replicating prior to
disk replacement or deletion.
Disk status has five possible values, but three are transitional:
GPFS migrates data off disks with a status of being emptied, replacing, or suspended onto disks with a
status of ready or replacement. During disk deletion or replacement, data is automatically migrated as
part of the operation. Issue the mmrestripefs command to initiate data migration from a suspended disk.
See “Deleting disks from a file system” on page 35, “Replacing disks in a GPFS file system” on page 36,
and “Restriping a GPFS file system” on page 24.
Refer to “Displaying GPFS disk states” on page 38 for a detailed description of disk states. You can
change both the availability and status of a disk using the mmchdisk command:
v Change disk availability using the mmchdisk command and the stop and start options
v Change disk status using the mmchdisk command and the suspend and resume options.
Issue the mmchdisk command with one of the following four options to change disk state:
suspend
Instructs GPFS to stop allocating space on the specified disk. Place a disk in this state prior to
disk deletion or replacement. This is a user-initiated state that GPFS will never use without an
explicit command to change disk state.
Note: A disk remains suspended until it is explicitly resumed. Restarting GPFS or rebooting
nodes does not restore normal access to a suspended disk.
resume
Informs GPFS that a disk previously suspended is now available for allocating new space.
Resume a disk only when you've suspended it and decided not to delete or replace it. If the disk
is currently in a stopped state, it remains stopped until you specify the start option. Otherwise,
normal read and write access to the disk resumes.
stop Instructs GPFS to stop any attempts to access the specified disk. Use this option to inform GPFS
that a disk has failed or is currently inaccessible because of maintenance. A disk's availability
remains down until it is explicitly started with the start option.
start Informs GPFS that a disk previously stopped is now accessible. GPFS does this by first changing
the disk availability from down to recovering. The file system metadata is then scanned and any
missing updates (replicated data that was changed while the disk was down) are repaired. If this
operation is successful, the availability is then changed to up.
For example, to suspend the hd8vsdn100 disk in the file system fs1, enter:
mmchdisk fs1 suspend -d hd8vsdn100
You can also use the mmchdisk command with the change option to change the Disk Usage and Failure
Group parameters for one or more disks in a GPFS file system. This can be useful in situations where, for
example, a file system that contains only RAID disks is being upgraded to add conventional disks that
are better suited to storing metadata. After adding the disks using the mmadddisk command, the
metadata currently stored on the RAID disks would have to be moved to the new disks to achieve the
desired performance improvement. To accomplish this, first the mmchdisk change command would be
issued to change the Disk Usage parameter for the RAID disks to dataOnly. Then the mmrestripefs
command would be used to restripe the metadata off the RAID device and onto the conventional disks.
For example, to specify that metadata should no longer be stored on disk hd8vsdn100, enter:
mmchdisk fs1 change -d "hd8vsdn100:::dataOnly"
See the “mmchdisk Command” on page 112 and the “mmlsdisk Command” on page 232 for complete
usage information.
For example, to assign node k145n07 as an NSD server for disk gpfs47nsd:
1. Make sure that k145n07 is not already assigned to the server list by issuing the mmlsnsd command.
mmlsnsd -d "gpfs47nsd"
The useNSDserver file system mount option can be used to set the order of access used in disk
discovery, and limit or eliminate switching from local access to NSD server access, or the other way
around. This option is specified using the -o flag of the mmmount, mount, mmchfs, and mmremotefs
commands, and has one of these values:
always
Always access the disk using the NSD server.
asfound
Access the disk as found (the first time the disk was accessed). No change of disk access from
local to NSD server, or the other way around, is performed by GPFS.
For example, to always use the NSD server when mounting file system fs1, issue this command:
mmmount fs1 -o useNSDserver=always
To change the disk discovery of a file system that is already mounted: cleanly unmount it, wait for the
unmount to complete, and then mount the file system using the desired -o useNSDserver option.
For fast recovery times with Persistent Reserve, you should also set the failureDetectionTime configuration
parameter. For fast recovery, a recommended value would be 10. You can set this by issuing the
command:
mmchconfig failureDetectionTime=10
To determine if the disks on the servers and the disks of a specific node have PR enabled, issue the
following command from the node:
mmlsnsd -X
If the GPFS daemon has been started on all the nodes in the cluster and the file system has been
mounted on all nodes that have direct access to the disks, then pr=yes should be on all hdisks. If you do
not see this, there is a problem. Refer to the General Parallel File System: Problem Determination Guide for
additional information on Persistent Reserve errors.
For GPFS fileset quotas, see the section Filesets in General Parallel File System: Advanced Administration
Guide.
Note: Windows nodes may be included in clusters that use GPFS quotas; however, Windows nodes do
not support the quota commands.
Once GPFS quota management has been enabled, you may establish quota values by:
v Setting default quotas for all new users, groups of users, or filesets.
v Explicitly establishing or changing quotas for users, groups of users, or filesets.
v Using the gpfs_quotactl() subroutine.
See the “mmcheckquota Command” on page 116, the “mmchfs Command” on page 121, the “mmcrfs
Command” on page 144, and the “mmedquota Command” on page 199 for complete usage information.
For additional information on quotas, see the General Parallel File System: Concepts, Planning, and
Installation Guide.
Default quotas
Applying default quotas provides for minimum quota limits for all new users, groups of users, or filesets
for a file system. If default quota values for a file system are not enabled, a new user, group of users or
fileset for that file system has a quota limit of zero, which establishes no limit to the amount of space that
they can use.
To apply different quota values for a particular user, group, or fileset, the system administrator must
explicitly configure those values using the mmedquota command. If after explicit quotas for a user,
group, or filesets have been established, it is necessary to reapply the default limits for that user, group,
or fileset, you must issue the mmedquota -d command.
For example, to create default quotas for users of the file system fs0, enter:
mmdefedquota -u fs0
See the “mmdefedquota Command” on page 159, “mmdefquotaoff Command” on page 162,
“mmdefquotaon Command” on page 164, and “mmedquota Command” on page 199 commands for
complete usage information.
When setting quota limits for a file system, replication within the file system should be considered. GPFS
quota management takes replication into account when reporting on and determining if quota limits have
been exceeded for both block and file usage. In a file system that has either type of replication set to a
value of two, the values reported on by both the mmlsquota command and the mmrepquota command
are double the value reported by the ls command.
The mmedquota command opens a session using your default editor, and prompts you for soft and hard
limits for inodes and blocks. For example, to set user quotas for user jesmith, enter:
mmedquota -u jesmith
Note: A zero quota limit indicates no quota limits have been established.
The current (in use) inode and block usage is for display only; it cannot be changed. When establishing a
new quota, zeros appear as limits. Replace the zeros, or old values if you are changing existing limits
with values based on the user's needs and the resources available. When you close the editor, GPFS
checks the values and applies them. If a value which is not valid is specified, GPFS generates an error
message. If this occurs, reenter the mmedquota command.
You may find it helpful to maintain a quota prototype, a set of limits that you can apply by name to any
user, group, or fileset without entering the individual values manually. This makes it easy to set the same
limits for all. The mmedquota command includes the -p option for naming a prototypical user, group, or
fileset on which limits are to be based.
For example, to set group quotas for all users in a group named blueteam to the prototypical values
established for prototeam, issue:
mmedquota -g -p prototeam blueteam
You may also reestablish default quotas for a specified user, group of users, or fileset when using the -d
option on the mmedquota command.
See the “mmedquota Command” on page 199 for complete usage information.
Checking quotas
The mmcheckquota command counts inode and space usage for a file system and writes the collected
data into quota files.
When issuing the mmcheckquota command on a mounted file system, negative in doubt values may be
reported if the quota server processes a combination of up-to-date and back-level information. This is a
transient situation and may be ignored.
During the normal operation of file systems with quotas enabled (not running mmcheckquota online),
the usage data reflects the actual usage of the blocks and inodes in the sense that if you delete files you
should see the usage amount decrease. The in doubt value does not reflect how much the user has used
already, it is just the amount of quotas that the quota server has assigned to its clients. The quota server
does not know whether the assigned amount has been used or not. The only situation where the in doubt
value is important to the user is when the sum of the usage and the in doubt value is greater than the
user's quota hard limit. In this case, the user is not allowed to allocate more blocks or inodes unless he
brings the usage down.
For example, to check quotas for the file system fs1 and report differences between calculated and
recorded disk quotas, enter:
mmcheckquota -v fs1
The information displayed shows that the quota information for USR7 was corrected. Due to a system
failure, this information was lost at the server, which recorded 0 subblocks and 0 files. The current usage
data counted is 96 subblocks and 3 files. This is used to update the quota:
fs1: quota check found the following differences:
USR7: 96 subblocks counted (was 0); 3 inodes counted (was 0)
See the “mmcheckquota Command” on page 116 for complete usage information.
Listing quotas
The mmlsquota command displays the file system quota limits, default quota limits, and current usage
information.
GPFS quota management takes replication into account when reporting on and determining if quota
limits have been exceeded for both block and file usage. In a file system that has either type of replication
set to a value of two, the values reported on by both the mmlsquota command and the mmrepquota
command and the values used to determine if quota limits have been exceeded will be double the value
reported by the ls command. When issuing the mmlsquota command on a mounted file system, negative
in doubt values may be reported if the quota server processes a combination of up-to-date and back-level
information. This is a transient situation and may be ignored.
Specify the quota information for one user, group of users, or fileset with the mmlsquota command. If
neither -g , nor -u, nor -j is specified, quota data is displayed for the user who entered the command.
For example, to display default quota information for users of all the file systems in the cluster, enter:
mmlsquota -d -u
This output shows for file system fs1 a default quota limit of 10240K for users has been established. For
file systems fs2 and fs3 no default quotas for users has been established.
To collect and display updated quota information about a group named blueteam, specify the -g and -e
options:
mmlsquota -g blueteam -e
See the “mmlsquota Command” on page 253 for complete usage information.
You can have quotas activated automatically whenever the file system is mounted by specifying the
quota option (-Q yes) when creating (mmcrfs -Q yes) or changing (mmchfs -Q yes) a GPFS file system.
When creating a file system, the default is to not have quotas activated, so you must specify this option if
you want quotas activated.
The mmquotaon command is used to turn quota limit checking back on if it had been deactivated by
issuing the mmquotaoff command. Specify the file system name, and whether user, group, or fileset
quotas are to be activated. If you want all three - user, group, and fileset quotas activated, specify only
the file system name. After quotas have been turned back on, issue the mmcheckquota command to
count inode and space usage.
For example, to activate user quotas on the file system fs1, enter:
mmquotaon -u fs1
See the “mmquotaon Command” on page 274 and the “mmlsfs Command” on page 238 for complete
usage information.
If this occurs, use the mmcheckquota command after reactivating quotas to reconcile allocation data.
When quota enforcement is deactivated, disk space and file allocations are made without regard to limits.
For example, to deactivate only user quotas on the file system fs1, enter:
mmquotaoff -u fs1
See the “mmquotaoff Command” on page 272 and the “mmlsfs Command” on page 238 for complete
usage information.
GPFS quota management takes replication into account when reporting on and determining if quota
limits have been exceeded for both block and file usage. In a file system that has either type of replication
set to a value of two, the values reported on by both the mmlsquota command and the mmrepquota
command and the values used to determine if quota limits have been exceeded will be double the value
reported by the ls command.
Specify whether you want to list only user quota information (-u flag), group quota information (-g flag),
or fileset quota information (-j flag) in the mmrepquota command. The default is to summarize all three
quotas. If the -e flag is not specified, there is the potential to display negative usage values as the quota
server may process a combination of up-to-date and back-level information. When issuing the
mmrepquota command on a mounted file system, negative in doubt values may be reported if the quota
server processes a combination of up-to-date and back-level information. This is a transient situation and
may be ignored.
To list the group quotas (-g option) for all file systems in the cluster (-a option), and print a report with
header lines (-v option), enter:
mmrepquota -g -v -a
See the “mmrepquota Command” on page 282 for complete usage information.
The user, group, or fileset files can be restored from backup copies by issuing the mmcheckquota
command with the appropriate options.
1. To restore the user quota file for the file system fs1 from the backup file userQuotaInfo, enter:
mmcheckquota -u userQuotaInfo fs1
2. This will restore the user quota limits set for the file system, but the usage information will not be
current. To bring the usage information to current values, the command must be reissued:
mmcheckquota fs1
If no backup files are available and the quota files are to be restored using a new file, follow these steps:
1. The existing corrupted quota files need to be removed:
a. Unmount the file system.
b. Disable quota management:
mmchfs fs1 -Q no
c. Remount the file system.
d. Remove the user.quota, group.quota, and fileset.quota files.
2. Enable quota management:
a. Unmount the file system.
b. Issue the following command:
mmchfs fs1 -Q yes
c. Remount the file system.
3. Reestablish quota limits by issuing the mmedquota command or the mmdefedquota command.
4. Gather the current quota usage values by issuing the mmcheckquota command.
Management of GPFS access control lists (ACLs) and NFS export includes these topics:
v “Traditional GPFS ACL administration”
v “NFS V4 ACL administration” on page 54
v “NFS and GPFS” on page 59
If you are using NFS V4 ACLs, see “NFS V4 ACL administration” on page 54. Both ACL types may
coexist in a single GPFS file system.
Traditional GPFS ACLs are based on the POSIX model. Traditional GPFS access control lists (ACLs)
extend the base permissions, or standard file access modes, of read (r), write (w), and execute (x) beyond
the three categories of file owner, file group, and other users, to allow the definition of additional users
and user groups. In addition, GPFS introduces a fourth access mode, control (c), which can be used to
govern who can manage the ACL itself.
In this way, a traditional ACL can be created that looks like this:
#owner:jesmith
#group:team_A
user::rwxc
group::rwx-
other::--x-
mask::rwxc
user:alpha:r-xc
group:audit:r-x-
group:system:rwx-
In this ACL:
v The first two lines are comments showing the file's owner, jesmith, and group name, team_A
v The next three lines contain the base permissions for the file. These three entries are the minimum
necessary for a GPFS ACL:
1. The permissions set for the file owner (user), jesmith
2. The permissions set for the owner's group, team_A
3. The permissions set for other groups or users outside the owner's group and not belonging to any
named entry
v The next line, with an entry type of mask, contains the maximum permissions allowed for any entries
other than the owner (the user entry) and those covered by other in the ACL.
v The last three lines contain additional entries for specific users and groups. These permissions are
limited by those specified in the mask entry, but you may specify any number of additional entries up
to a memory page (approximately 4 K) in size.
Each GPFS file or directory has an access ACL that determines its access privileges. These ACLs control
who is allowed to read or write at the file or directory level, as well as who is allowed to change the
ACL itself.
In addition to an access ACL, a directory may also have a default ACL. If present, the default ACL is used
as a base for the access ACL of every object created in that directory. This allows a user to protect all files
in a directory without explicitly setting an ACL for each one.
When a new object is created, and the parent directory has a default ACL, the entries of the default ACL
are copied to the new object's access ACL. After that, the base permissions for user, mask (or group if
mask is not defined), and other, are changed to their intersection with the corresponding permissions
from the mode parameter in the function that creates the object.
If the new object is a directory, its default ACL is set to the default ACL of the parent directory. If the
parent directory does not have a default ACL, the initial access ACL of newly created objects consists
only of the three required entries (user, group, other). The values of these entries are based on the mode
parameter in the function that creates the object and the umask currently in effect for the process.
Once you are satisfied that the correct permissions are set in the ACL file, you can apply them to the
target file with the mmputacl command. For example, to set permissions contained in the file project2.acl
for the file project2.history, enter:
mmputacl -i project2.acl project2.history
Although you can issue the mmputacl command without using the -i option to specify an ACL input file,
and make ACL entries through standard input, you will probably find the -i option more useful for
avoiding errors when creating a new ACL.
See the “mmputacl Command” on page 269 and the “mmgetacl Command” on page 213 for complete
usage information.
The first two lines are comments displayed by the mmgetacl command, showing the owner and owning
group. All entries containing permissions that are not allowed (because they are not set in the mask
entry) display with a comment showing their effective permissions.
See the “mmgetacl Command” on page 213 for complete usage information.
For example, use the -o option to specify a file to which the ACL is written:
mmgetacl -o old.acl project2.history
See the “mmgetacl Command” on page 213 and the “mmputacl Command” on page 269 for complete
usage information.
The current ACL entries are displayed using the default editor, provided that the EDITOR environment
variable specifies a complete path name. When the file is saved, the system displays information similar
to:
mmeditacl: 6027-967 Should the modified ACL be applied? (yes) or (no)
See the “mmeditacl Command” on page 196 for complete usage information.
You cannot delete the base permissions. These remain in effect after this command is executed.
See the “mmdelacl Command” on page 170 and the “mmgetacl Command” on page 213 for complete
usage information.
Depending on the value (posix | nfs4 | all) of the -k parameter, one or both ACL types can be allowed
for a given file system. Since ACLs are assigned on a per-file basis, this means that within the same file
system one file may have an NFS V4 ACL, while another has a POSIX ACL. The type of ACL can be
changed by using the mmputacl or mmeditacl command to assign a new ACL or by the mmdelacl
command (causing the permissions to revert to the mode which is in effect a POSIX ACL). At any point
in time, only a single ACL can be associated with a file. Access evaluation is done as required by the ACL
type associated with the file.
NFS V4 ACLs are represented in a completely different format than traditional ACLs. For detailed
information on NFS V4 and its ACLs, refer to the paper, NFS Version 4 Protocol and other information
found at: NFSv4 (http://www.nfsv4.org/).
In the case of NFS V4 ACLs, there is no concept of a default ACL. Instead, there is a single ACL and the
individual ACL entries can be flagged as being inherited (either by files, directories, both, or neither).
Consequently, specifying the -d flag on the mmputacl command for an NFS V4 ACL is an error.
As in traditional ACLs, users and groups are identified by specifying the type and name. For example,
group:staff or user:bin. NFS V4 provides for a set of special names that are not associated with a specific
local UID or GID. These special names are identified with the keyword special followed by the NFS V4
The next two lines provide a list of the available access permissions that may be allowed or denied, based
on the ACL type specified on the first line. A permission is selected using an 'X'. Permissions that are not
specified by the entry should be left marked with '-' (minus sign).
special:owner@:rwxc:allow:DirInherit:InheritOnly
(X)READ/LIST (X)WRITE/CREATE (X)MKDIR (-)SYNCHRONIZE (X)READ_ACL (X)READ_ATTR (-)READ_NAMED
(X)DELETE (X)DELETE_CHILD (X)CHOWN (X)EXEC/SEARCH (X)WRITE_ACL (-)WRITE_ATTR (-)WRITE_NAMED
user:smithj:rwxc:allow
(X)READ/LIST (X)WRITE/CREATE (X)MKDIR (-)SYNCHRONIZE (X)READ_ACL (X)READ_ATTR (-)READ_NAMED
(X)DELETE (X)DELETE_CHILD (X)CHOWN (X)EXEC/SEARCH (X)WRITE_ACL (-)WRITE_ATTR (-)WRITE_NAMED
In this table, the columns refer to the ACL entry for a given file, and the rows refer to the ACL entry for
its parent directory. The various combinations of these attributes produce one of these results:
Permit
Indicates that GPFS permits removal of a file with the combination of file and parent directory
ACL entries specified.
Deny Indicates that GPFS denies (does not permit) removal of a file with the combination of file and
parent directory ACL entries specified.
Removal of a file includes renaming the file, moving the file from one directory to another even if the file
name remains the same, and deleting it.
Table 6. Removal of a file with ACL entries DELETE and DELETE_CHILD
ACL Allows ACL Denies DELETE not UNIX mode bits
DELETE DELETE specified only
ACL Allows DELETE_CHILD Permit Permit Permit Permit
ACL Denies DELETE_CHILD Permit Deny Deny Deny
The UNIX mode bits are used in cases where the ACL is not an NFS V4 ACL.
It can also be the case that NFS V4 ACLs have been set for some file system objects (directories and
individual files) prior to administrator action to revert back to a POSIX-only configuration. Since the NFS
V4 access evaluation will no longer be performed, it is desirable for the mmgetacl command to return an
ACL representative of the evaluation that will now occur (translating NFS V4 ACLs into traditional
POSIX style). The -k posix option returns the result of this translation.
Users may need to see ACLs in their true form as well as how they are translated for access evaluations.
There are four cases:
1. By default, the mmgetacl command returns the ACL in a format consistent with the file system
setting:
v If posix only, it is shown as a traditional ACL.
v If nfs4 only, it is shown as an NFS V4 ACL.
v If all formats are supported, the ACL is returned in its true form.
2. The command mmgetacl -k nfs4 always produces an NFS V4 ACL.
3. The command mmgetacl -k posix always produces a traditional ACL.
4. The command mmgetacl -k native always shows the ACL in its true form, regardless of the file
system setting.
In general, users should continue to use the mmgetacl and mmeditacl commands without the -k flag,
allowing the ACL to be presented in a form appropriate for the file system setting. Since the NFS V4
ACLs are more complicated and therefore harder to construct initially, users that want to assign an NFS
V4 ACL should use the command mmeditacl -k nfs4 to start with a translation of the current ACL, and
then make any necessary modifications to the NFS V4 ACL that is returned.
The lines that follow the first one are then processed according to the rules of the expected ACL type.
special:owner@:----:deny
(-)READ/LIST (-)WRITE/CREATE (-)MKDIR (-)SYNCHRONIZE (-)READ_ACL (-)READ_ATTR (X)READ_NAMED
(-)DELETE (X)DELETE_CHILD (X)CHOWN (-)EXEC/SEARCH (-)WRITE_ACL (-)WRITE_ATTR (X)WRITE_NAMED
user:guest:r-xc:allow
(X)READ/LIST (-)WRITE/CREATE (-)MKDIR (X)SYNCHRONIZE (X)READ_ACL (-)READ_ATTR (-)READ_NAMED
(X)DELETE (-)DELETE_CHILD (-)CHOWN (X)EXEC/SEARCH (X)WRITE_ACL (-)WRITE_ATTR (-)WRITE_NAMED
user:guest:----:deny
(-)READ/LIST (-)WRITE/CREATE (-)MKDIR (-)SYNCHRONIZE (-)READ_ACL (-)READ_ATTR (X)READ_NAMED
(-)DELETE (X)DELETE_CHILD (X)CHOWN (-)EXEC/SEARCH (-)WRITE_ACL (X)WRITE_ATTR (X)WRITE_NAMED
This ACL shows four ACL entries (an allow and deny entry for each of owner@ and guest).
In general, constructing NFS V4 ACLs is more complicated than traditional ACLs. Users new to NFS V4
ACLs may find it useful to start with a traditional ACL and allow either mmgetacl or mmeditacl to
provide the NFS V4 translation, using the -k nfs4 flag as a starting point when creating an ACL for a
new file.
When assigning an ACL to a file that already has an NFS V4 ACL, there are some NFS rules that must be
followed. Specifically, in the case of a directory, there will not be two separate (access and default) ACLs,
as there are with traditional ACLs. NFS V4 requires a single ACL entity and allows individual ACL
entries to be flagged if they are to be inherited. Consequently, mmputacl -d is not allowed if the existing
ACL was the NFS V4 type, since this attempts to change only the default ACL. Likewise mmputacl
(without the -d flag) is not allowed because it attempts to change only the access ACL, leaving the
Export considerations
Keep these points in mind when exporting a GPFS file system to NFS. The operating system being used
and the version of NFS might require special handling or consideration.
Linux export considerations: For Linux nodes only, issue the exportfs -ra command to initiate a reread
of the /etc/exports file.
Starting with Linux kernel version 2.6, an fsid value must be specified for each GPFS file system that is
exported on NFS. For example, the format of the entry in /etc/exports for the GPFS directory /gpfs/dir1
might look like this:
/gpfs/dir1 cluster1(rw,fsid=745)
The administrator must assign fsid values subject to the following conditions:
1. The values must be unique for each file system.
2. The values must not change after reboots. The file system should be unexported before any change is
made to an already assigned fsid.
3. Entries in the /etc/exports file are not necessarily file system roots. You can export multiple directories
within a file system. In the case of different directories of the same file system, the fsids must be
different. For example, in the GPFS file system /gpfs, if two directories are exported (dir1 and dir2),
the entries might look like this:
/gpfs/dir1 cluster1(rw,fsid=745)
/gpfs/dir2 cluster1(rw,fsid=746)
4. If a GPFS file system is exported from multiple nodes, the fsids should be the same on all nodes.
Large installations with hundreds of compute nodes and a few login nodes or NFS-exporting nodes
require tuning of the GPFS parameters maxFilesToCache and maxStatCache with the mmchconfig
command. The general suggestion is for the compute nodes to set maxFilesToCache to about 200. The
login or NFS nodes should set this parameter much higher, with maxFilesToCache set to 1000 and
maxStatCache set to 50000.
This tuning is required for the GPFS token manager (file locking), which can handle approximately
1,000,000 files in memory. The default value of maxFilesToCache is 1000 and the default value of
maxStatCache is 4 * maxFilesToCache, so that by default, each node holds 5000 tokens. The token
manager keeps track of a total number of tokens, which equals 5000 * number of nodes. This will exceed
the memory limit of the token manager on large configurations.
If you are running at SLES 9 SP 1, the kernel defines the sysctl variable fs.nfs.use_underlying_lock_ops,
which determines whether the NFS lockd is to consult the file system when granting advisory byte-range
locks. For distributed file systems like GPFS, this must be set to true (the default is false).
For additional considerations when NFS exporting your GPFS file system, refer to "File system creation
considerations" in GPFS: Concepts, Planning, and Installation Guide.
NFS export considerations for versions prior to NFS V4: For NFS exported file systems, the version of
NFS you are running with may have an impact on the number of inodes you need to cache, as set by
both the maxStatCache and maxFilesToCache parameters on the mmchconfig command. The
implementation of the ls command differs from NFS V2 to NFS V3. The performance of the ls command
in NFS V3 in part depends on the caching ability of the underlying file system. Setting the cache large
enough will prevent rereading inodes to complete an ls command, but will put more of a CPU load on
the token manager.
Also, the clocks of all nodes in your GPFS cluster must be synchronized. If this is not done, NFS access to
the data, as well as other GPFS file system operations, may be disrupted.
NFS V4 export considerations: For information on NFS V4, see the paper, NFS Version 4 Protocol and
other information found at: NFSv4 General Information and References for the NFSv4 protocol
(www.nfsv4.org).
To export a GPFS file system using NFS V4, there are two file system settings that must be in effect.
These attributes can be queried using the mmlsfs command, and set using the mmcrfs and mmchfs
commands.
1. The -D nfs4 flag is required. Conventional NFS access would not be blocked by concurrent file system
reads or writes (this is the POSIX semantic). NFS V4 however, not only allows for its requests to block
if conflicting activity is happening, it insists on it. Since this is an NFS V4 specific requirement, it
must be set before exporting a file system.
flag value description
---- -------------- -----------------------------------------------------
-D nfs4 File locking semantics in effect
2. The -k nfs4 or -k all flag is required. Initially, a file system has the -k posix setting, and only
traditional GPFS ACLs are allowed. To export a file system using NFS V4, NFS V4 ACLs must be
enabled. Since NFS V4 ACLs are vastly different and affect several characteristics of the file system
objects (directories and individual files), they must be explicitly enabled. This is done either
exclusively, by specifying -k nfs4, or by allowing all ACL types to be stored.
flag value description
---- -------------- -----------------------------------------------------
-k all ACL semantics in effect
You may also choose to restrict the use of the NFS server node through the normal GPFS path and not
use it as either a file system manager node or an NSD server.
NFS can be restarted after the unmount completes. On Linux, issue this command:
/etc/rc.d/init.d/nfs start
A GPFS soft-mount does not automatically unmount. Setting -fstype nfs3 causes the local server mounts
to always go through NFS. This allows you to have the same auto.map file on all nodes whether the
server is local or not, and the automatic unmount will occur. If you want local soft-mounts of GPFS file
systems while other nodes perform NFS mounts, you should have different auto.map files on the
different classes of nodes. This should improve performance on the GPFS nodes as they will not have to
go through NFS.
The participating nodes are designated as Cluster NFS (CNFS) member nodes and the entire setup is
frequently referred to as CNFS or CNFS cluster.
In this solution, all CNFS nodes export the same file systems to the NFS clients. When one of the CNFS
nodes fails, the NFS serving load moves from the failing node to another node in the CNFS cluster.
Failover is done using recovery groups to help choose the preferred node for takeover.
For more information about CNFS, see General Parallel File System: Advanced Administration Guide.
For example, GPFS can recognize sequential reads and therefore prefetch blocks in advance of when they
are required by the application. However, in many cases GPFS does not recognize the access pattern of
the application, or cannot optimize its data transfers. In these situations, performance may improve if the
application explicitly discloses aspects of its access pattern to GPFS using the gpfs_fcntl() subroutine.
These subroutines are exploited by MPI Version 2.0.
The gpfs_fcntl() subroutine allows application programs to pass two classes of file access information
giving GPFS an opportunity to improve throughput and latency of file system requests:
1. Hints
2. Directives
Hints allow an application to disclose its future accesses to GPFS. Hints are always optional. Adding or
removing hints from a program, even incorrectly specified hints, will never alter the meaning of a
program. Hints can only affect the performance of an application. The hints that may be passed to GPFS
are:
1. gpfsAccessRange_t
2. gpfsClearFileCache_t
3. gpfsFreeRange_t
4. gpfsMultipleAccessRange_t
Note: GPFS is free to silently ignore a hint if system resources do not permit the hint to be processed.
In contrast, directives are stronger than hints. They may affect program semantics and must be either
carried out by GPFS or return an error. The directives which may be passed to GPFS are:
1. gpfsCancelHints_t
2. gpfsDataShipMap_t
3. gpfsDataShipStart_t
4. gpfsDataShipStop_t
To communicate hints and directives to GPFS, an application program builds a data structure in memory,
then passes it to GPFS. This data structure consists of:
v A fixed length header, mapped by gpfsFcntlHeader_t.
v Followed by a variable number of, and any combination of, hints and directives.
Hints and directives may be mixed within a single gpfs_fcntl() subroutine, and are performed in the
order that they appear. A subsequent hint or directive may cancel out a preceding one.
The header and hints and directives that follow it are defined as C structures.
The gpfs_fcntl() subroutine takes the handle of the opened file as its first parameter and the address of
the data structure as its second parameter. For complete definitions of the gpfs_fcntl() subroutine, the
header, hints, directives, and other operations, see “gpfs_fcntl() Subroutine” on page 330.
Synopsis
Or,
Description
Use the mmaddcallback command to register a user-defined command that GPFS will execute when
certain events occur.
Parameters
CallbackIdentifier
Specifies a user-defined unique name that identifies the callback. It can be up to 255 characters
long. It cannot contain special characters (for example, a colon, semicolon, blank, tab, or comma)
and it cannot start with the letters gpfs or mm (which are reserved for GPFS internally defined
callbacks).
--command CommandPathname
| Specifies the full path name of the executable to run when the desired event occurs.
| The executable that will be called by the mmaddcallback facility should be installed on all nodes
| on which the callback can be triggered. Place the executable in a local file system (not in a GPFS
| file system) so that it is accessible even when the networks fail.
--event Event[,Event...]
Specifies a list of events that trigger the callback. The value defines when the callback will be
invoked. There are two kinds of events: global events and local events. A global event is an event
that will trigger a callback on all nodes in the cluster, such as a nodeLeave event, which informs
all nodes in the cluster that a node has failed. A local event is an event that will trigger a callback
only on the node on which the event occurred, such as mounting a file system on one of the
nodes. The following is a list of supported global and local events.
Global events include:
nodeJoin
Triggered when one or more nodes join the cluster.
nodeLeave
Triggered when one or more nodes leave the cluster.
quorumReached
Triggered when a quorum has been established in the GPFS cluster.
quorumLoss
Triggered when a quorum has been lost in the GPFS cluster.
quorumNodeJoin
Triggered when one or more quorum nodes join the cluster.
quorumNodeLeave
Triggered when one or more quorum nodes leave the cluster.
Options
-S Filename | --spec-file Filename
Specifies a file with multiple callback definitions, one per line. The first token on each line must
be the callback identifier.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To register command /tmp/myScript to run after GPFS startup, issue this command:
mmaddcallback test1 --command=/tmp/myScript --event startup
The system displays information similar to:
mmaddcallback: Propagating the cluster configuration data to all
affected nodes. This is an asynchronous process.
2. To register a callback that NFS exports or to unexport a particular file system after it has been
mounted or before it has been unmounted, issue this command:
mmaddcallback NFSexport --command /usr/local/bin/NFSexport --event mount,preUnmount -N c26f3rp01
--parms "%eventName %fsName"
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmadddisk Device {"DiskDesc[;DiskDesc...]" | -F DescFile} [-a] [-r] [-v {yes | no}] [-N {Node[,Node...] |
NodeFile | NodeClass}]
Description
Use the mmadddisk command to add disks to a GPFS file system. This command optionally rebalances
an existing file system after adding disks when the -r flag is specified. The mmadddisk command does
not require the file system to be unmounted before issuing the command. The file system can be in use
while the command is run.
Parameters
Device The device name of the file system to which the disks are added. File system names need not be
fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
DiskDesc
A descriptor for each disk to be added. Each descriptor is delimited by a semicolon (;) and the
entire list must be enclosed in quotation marks (' or ").
The actual number of disks in your file system may be constrained by products other than GPFS
that you have installed. Refer to the individual product documentation.
A disk descriptor is defined as (second, third and sixth fields reserved):
DiskName:::DiskUsage:FailureGroup::StoragePool
DiskName
You must specify the name of the NSD previously created by the mmcrnsd command.
For a list of available disks, issue the mmlsnsd -F command.
DiskUsage
Specify a disk usage or accept the default:
dataAndMetadata
Indicates that the disk contains both data and metadata. This is the default for
disks in the system pool.
dataOnly
Indicates that the disk contains data and does not contain metadata.
Options
-a Specifies asynchronous processing. If this flag is specified, the mmadddisk command returns
after the file system descriptor is updated and the rebalancing scan is started; it does not wait for
rebalancing to finish. If no rebalancing is requested (the -r flag not specified), this option has no
effect.
-r Rebalance all existing files in the file system to make use of new disks.
Note: Rebalancing of files is an I/O intensive and time consuming operation, and is important
only for file systems with large files that are mostly invariant. In many cases, normal file update
and creation will rebalance your file system over time, without the cost of the rebalancing.
Important: Using -v no on a disk that already belongs to a file system will corrupt that file
system. This will not be noticed until the next time that file system is mounted.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To add the disks whose descriptors are located in ./disk_pools to the file system fs1 and rebalance the
existing files after it is added, issue this command:
mmadddisk fs1 -F ./disk_pools -r
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmaddnode command to add nodes to an existing GPFS cluster. On each new node, a mount
point directory and character mode device is created for each GPFS file system.
Parameters
-N NodeDesc[,NodeDesc...] | NodeFile
Specifies node descriptors, which provide information about nodes to be added to the cluster.
NodeFile
Specifies a file containing a list of node descriptors (see below), one per line, to be added
to the cluster.
NodeDesc[,NodeDesc...]
Specifies the list of nodes and node designations to be added to the GPFS cluster. Node
descriptors are defined as:
NodeName:NodeDesignations:AdminNodeName
where:
NodeName
Specifies the host name or IP address of the node for GPFS daemon-to-daemon
communication.
The host name or IP address must refer to the communication adapter over
which the GPFS daemons communicate (if multiple adapters are going to be
used, see the subnets option in the “mmchconfig Command” on page 102).
Aliased interfaces are not allowed. Use the original address or a name that is
resolved by the host command to that original address. You can specify a node
using any of these forms:
v Short host name (for example, h135n01)
v Long, fully-qualified, host name (for example, h135n01.ibm.com)
v IP address (for example, 7.111.12.102)
Regardless of which form you use, GPFS will resolve the input to a host name
and an IP address and will store these in its configuration files. It is expected that
those values will not change while the node belongs to the cluster.
Note: If you are designating a new node as a quorum node, and adminMode
central is in effect for the cluster, GPFS must be down on all nodes in the
cluster. Alternatively, you may choose to add the new nodes as nonquorum
and once GPFS has been successfully started on the new nodes, you can
change their designation to quorum using the mmchnode command.
AdminNodeName
Specifies an optional field that consists of a node interface name to be used by
the administration commands to communicate between nodes. If AdminNodeName
is not specified, the NodeName value is used.
You must provide a NodeDesc for each node to be added to the GPFS cluster.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To add nodes k164n06 and k164n07 as quorum nodes, designating k164n06 to be available as a manager
node, issue this command:
mmaddnode -N k164n06:quorum-manager,k164n07:quorum
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmapplypolicy {Device | Directory} [-A IscanBuckets] [-a IscanThreads] [-B MaxFiles] [-D yyyy-mm-dd
[@hh:mm[:ss]]] [-e] [-f FileListPrefix] [-g GlobalWorkDirectory] [-I {yes | defer | test | prepare}] [-i
InputFileList] [-L n] [-M name=value...] [-m ThreadLevel] [-N {all | mount | Node[,Node...] | NodeFile |
NodeClass}] [-n DirThreadLevel] [-P PolicyFile] [-q] [-r FileListPathname...] [-S SnapshotName] [-s
| LocalWorkDirectory] [--single-instance] [--sort-buffer-size Size]
Description
Use the mmapplypolicy command to manage migration and replication of the data to and from GPFS
storage pools. It can also be used to delete files from GPFS. You can issue the mmapplypolicy command
from any node in the GPFS cluster that has the file system mounted.
The mmapplypolicy command does not affect placement rules (for example, the SET POOL and
RESTORE rule) that are installed for a file system by the mmchpolicy command. To display the currently
installed rules, issue the mmlspolicy command.
A given file can match more than one list rule, but will be included in a given list only once. ListName
provides the binding to an EXTERNAL LIST rule that specifies the executable program to use when
processing the generated list.
The EXTERNAL POOL rule defines an external storage pool. This rule does not match files, but serves to
define the binding between the policy language and the external storage manager that implements the
external storage.
Any given file is a potential candidate for at most one MIGRATE or DELETE operation during one
invocation of the mmapplypolicy command. That same file may also match the first applicable LIST rule.
A file that matches an EXCLUDE rule is not subject to any subsequent MIGRATE, DELETE, or LIST
rules. You should carefully consider the order of rules within a policy to avoid unintended consequences.
For detailed information on GPFS policies, see the GPFS: Advanced Administration Guide.
This command cannot be run from a Windows node. The GPFS API, documented functions in gpfs.h are
not implemented on Windows, however the policy language does support the Windows file attributes, so
you can manage your GPFS Windows files using the mmapplypolicy command running on an AIX or
Linux node.
Parameters
Device Specifies the device name of the file system from which files will have the policy rules applied.
File system names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0. If specified,
this must be the first parameter.
Directory
Specifies the fully-qualified path name of a GPFS file system subtree from which files will have
the policy rules applied. If specified, this must be the first parameter.
-I {yes | defer | test | prepare}
Specifies what actions the mmapplypolicy command performs on files:
Options
-A IscanBuckets
Specifies the number of buckets of inode numbers (number of inode/filelists) to be created by the
parallel directory scan and processed by the parallel inode scan. Affects the execution of the
high-performance protocol that is used when both -g and -N are specified. The default is 17.
Tip: Set this parameter to a prime number that is close in value to the expected number of the
files to be scanned divided by one million. Then each bucket will have about one million files.
-a IscanThreads
Specifies the number of threads and sort pipelines each node will run during the parallel inode
scan and policy evaluation. It affects the execution of the high-performance protocol that is used
when both -g and -N are specified. The default is 2. Using a moderately larger number can
significantly improve performance, but might "strain" the resources of the node. In some
environments a large value for this parameter can lead to a command failure.
Tip: Set this parameter to the number of CPU "cores" implemented on a typical node in your
GPFS cluster.
-B MaxFiles
Specifies how many files are passed for each invocation of the EXEC script. The default value is
100.
If the number of files exceeds the value specified for MaxFiles, mmapplypolicy invokes the
external program multiple times.
Note: The -e option is particularly useful with -r, but can be used apart from it. It is useful
because in the time that elapses after the policy evaluation and up to the policy execution, it is
possible that the chosen pathname no longer refers to the same inode (for example the original
file was removed or renamed), or that some of the attributes of the chosen file have changed in
some way so that the chosen file no longer satisfies the conditions of the rule. In general, the
longer the elapsed time, the more likely it is that conditions have changed (depending on how
the file system is being used). For example, if files are only written once and never renamed or
erased, except by policy rules that call for deletion after an expiration interval, then it is probably
not necessary to re-evaluate with the -e option.
For more information about -r, refer to the General Parallel File System: Advanced Administration
Guide.
-f FileListPrefix
Specifies the location (a path name or file name prefix or directory) in which the file lists for
external pool and list operations are stored when the either the -I defer or -I prepare option has
been chosen. The default location is LocalWorkDirectory/mmapplypolicy.processid.
-g GlobalWorkDirectory
Specifies a global directory to be used for temporary storage during mmapplypolicy command
processing. The specified directory must exist within a shared file system. It must also be
mounted and available for writing and reading from each of the nodes specified by the -N
option. When both -N and -g are specified, mmapplypolicy uses high performance and
fault-tolerant protocols during execution.
Note: The -g option should specify a directory (for temporary or work files) within a GPFS file
system that is accessible from each node specified with the -N option. The directory can either be
in the file system being operated upon by mmapplypolicy or in another file system.
There is no default value for -g.
If the -g option is specified, but not the -s option, the directory specified by -g is used for all
temporary files required by mmapplypolicy. If both the -g and -s options are specified,
temporary files may be stored in each. In general, temporary files that are only written and read
by a single node are stored in the local work directory specified by the -s option, while
temporary files that must be accessed by more than one node are stored in the global work
directory specified by the -g option.
-i InputFileList
| Specifies the path name for a user-provided input file list. This file list enables you to specify
| multiple starter directories or files. It can be in either of the following formats:
| Each record represents either a single file or a directory. When a directory is specified, the
| command processes the entire subtree that is rooted at the specified path name
| File names can contain spaces and special characters; however, the special characters '\'
| and '\n' must be escaped with the '\' character similarly to the way mmapplypolicy
| writes path names in file lists for external pool and list operations.
| The end-of-record character must be \n.
| Example:
| /mak/ea
| /mak/old news
| /mak/special\\stuff
| /usr/lpp/mmfs/samples/ilm/mmglobexpf.sample is an example of a script that can be
| used to generate simple format file lists.
| expert format file list
| A list of records with the following format:
| INODE:GENERATION:path-length!PATH_NAME end-of-record-character
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This command displays the actions that would occur if a policy were applied, but does not apply the
policy at this time:
mmapplypolicy fs1 -P policyfile -I test
/* Migration rule */
RULE ’migration to system pool’ MIGRATE FROM POOL ’sp1’ TO POOL ’system’ WHERE NAME LIKE ’%sp1%’
[I] Directories scan: 11 files, 1 directories, 0 other objects, 0 ’skipped’ files and/or errors.
[I] Inodes scan: 11 files, 1 directories, 0 other objects, 0 ’skipped’ files and/or errors.
[I] Summary of Rule Applicability and File Choices:
Rule# Hit_Cnt KB_Hit Chosen KB_Chosen KB_Ill Rule
0 3 1536 0 0 0 RULE ’exclude *.save files’ EXCLUDE WHERE(.)
1 3 1536 3 1536 0 RULE ’delete’ DELETE FROM POOL ’sp1’ WHERE(.)
2 2 1024 2 1024 0 RULE ’migration to system pool’ MIGRATE FROM POOL \
’sp1’ TO POOL ’system’ WHERE(.)
/* Deletion rule */
RULE ’delete’ DELETE FROM POOL ’sp1’ WHERE NAME LIKE ’%.tmp’
/* Migration rule */
RULE ’migration to system pool’ MIGRATE FROM POOL ’sp1’ TO POOL ’system’ WHERE NAME LIKE ’%sp1%’
[I] Directories scan: 11 files, 1 directories, 0 other objects, 0 ’skipped’ files and/or errors.
[I] Inodes scan: 11 files, 1 directories, 0 other objects, 0 ’skipped’ files and/or errors.
[I] Summary of Rule Applicability and File Choices:
Rule# Hit_Cnt KB_Hit Chosen KB_Chosen KB_Ill Rule
0 3 3072 0 0 0 RULE ’exclude *.save files’ EXCLUDE WHERE(.)
1 3 3072 3 3072 0 RULE ’delete’ DELETE FROM POOL ’sp1’ WHERE(.)
2 2 2048 2 2048 0 RULE ’migration to system pool’MIGRATE FROM POOL \
’sp1’ TO POOL ’system’ WHERE(.)
| Additional examples of GPFS policies and using the mmapplypolicy command are in the GPFS: Advanced
| Administration Guide.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Or,
Or,
Or,
mmauth grant {RemoteClusterName | all } -f { Device | all } [-a {rw | ro} ] [-r {uid:gid | no}]
Or,
Or,
Description
The mmauth command prepares a cluster to grant secure access to file systems owned locally. The
mmauth command also prepares a cluster to receive secure access to file systems owned by another
cluster. Use the mmauth command to generate a public/private key pair for the local cluster. A
public/private key pair must be generated on both the cluster owning the file system and the cluster
desiring access to the file system. The administrators of the clusters are responsible for exchanging the
public portion of the public/private key pair. Use the mmauth command to add or delete permission for
a cluster to mount file systems owned by the local cluster.
When a cluster generates a new public/private key pair, administrators of clusters participating in remote
file system mounts are responsible for exchanging their respective public key file /var/mmfs/ssl/
id_rsa.pub generated by this command.
The administrator of a cluster desiring to mount a file system from another cluster must provide the
received key file as input to the mmremotecluster command. The administrator of a cluster allowing
another cluster to mount a file system must provide the received key file to the mmauth command.
Parameters
-N {Node[,Node...] | NodeFile | NodeClass}
Specifies the nodes on which the key files should be restored. The default is -N all.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
This command does not support a NodeClass of mount.
RemoteClusterName
Specifies the remote cluster name requesting access to local GPFS file systems. The value all
indicates all remote clusters defined to the local cluster.
Options
-a {rw | ro}
The type of access allowed:
ro Specifies read-only access.
rw Specifies read/write access. This is the default.
-C NewClusterName
| Specifies a new, fully-qualified cluster name for the already-defined cluster RemoteClusterName.
-f Device
The device name for a file system owned by this cluster. The Device argument is required. If all is
specified, the command applies to all file systems owned by this cluster at the time that the
command is issued.
Exit status
0 Successful completion. After a successful completion of the mmauth command, the configuration
change request will have been propagated to all nodes in the cluster.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This is an example of an mmauth genkey new command:
mmauth genkey new
See also
Accessing GPFS file systems from other GPFS clusters in General Parallel File System: Advanced Administration
Guide.
Location
/usr/lpp/mmfs/bin
Synopsis
mmbackup {Device | Directory} [-f] [-g GlobalWorkDirectory] [-N {Node[,Node...] | NodeFile | NodeClass}] [-s
LocalWorkDirectory] [-S SnapshotName] [-t {full | incremental}]
Description
Use the mmbackup command to back up the user data from a GPFS file system, directory, or snapshot to
a backup TSM server.
Note: The TSM Backup-Archive client must be installed on the nodes that will be executing the
mmbackup command.
| You can run multiple instances of mmbackup, as long as they are on different filesystems.
Parameters
Device The device name for the file system to be backed up. File system names need not be
fully-qualified. fs0 is as acceptable as /dev/fs0.
Directory
Specifies the directory or a snapshot to be backed up. The contents of the .snapshots directory
would be backed up as if they were regular files in the file system.
-N {Node[,Node...] | NodeFile | NodeClass}
Specifies the list of nodes that will run parallel instances of the backup process. The TSM
Backup-Archive client must be installed on all nodes specified with this parameter. This
command supports all defined node classes. The default is to run only on the node where the
| mmbackup command is running or the current value of the defaultHelperNodes parameter of
| the mmchconfig command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
-f Specifies that processing should continue when unlinked filesets are detected. All files that belong
to unlinked filesets will be ignored.
-g GlobalWorkDirectory
The directory to be used for temporary files that need to be shared between the mmbackup
worker nodes. Defaults to the value specified with the -s option or /tmp.
-s TempDirectory
Specifies the directory to be used for temporary storage during mmbackup command processing.
The default directory is /tmp.
-S SnapshotName
Specifies the name of a snapshot for file system backup operations. The name appears as a
subdirectory of the .snapshots directory in the file system root and can be found with the
mmlssnapshot command.
Exit status
0 Successful completion.
1 Partially successful completion. Not all of the eligible files were successfully backed up. Invoke
mmbackup again to backup the files not backed up by the previous mmbackup.
2 A failure occurred. Invoke mmbackup again to backup the files not backed up by the previous
mmbackup.
Security
| The node on which the command is issued, as well as all other TSM Backup-Archive client nodes, must
| be able to execute remote shell commands on any other node in the cluster without the use of a
| password and without producing any extraneous messages. See “Requirements for administering a GPFS
| file system” on page 1.
Examples
To perform an incremental backup of the file system gpfs5, issue this command:
mmbackup gpfs5
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmbackupconfig command, in conjunction with the mmrestoreconfig command, can be used to
collect basic file system configuration information that can later be used to restore the file system. The
configuration information backed up by this command includes block size, replication factors, number
and size of disks, storage pool layout, filesets and junction points, policy rules, quota information, and a
number of other file system attributes.
This command does not back up user data or individual file attributes.
Parameters
Device The device name of the file system to be backed up. File system names need not be
fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
-o OutputFile
The path name of a file to which the file system information is to be written. This file must be
provided as input to the subsequent mmrestoreconfig command.
Exit status
0 Successful completion.
1 Partially successful completion.
2 A failure has occurred that cannot be corrected. A new mmbackupconfig run is required.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
See also
/usr/lpp/mmfs/bin
Synopsis
mmchattr [-m MetadataReplicas] [-M MaxMetadataReplicas] [-r DataReplicas] [-R MaxDataReplicas] [-P
| DataPoolName] [-D {yes | no} ] [-I {yes | defer}] [-i {yes | no}] [-l] [-a {yes | no}] Filename [Filename...]
Description
Use the mmchattr command to change the replication attributes, storage pool assignment, retention and
immutability attributes, and I/O caching policy for files in the GPFS file system.
The replication factor must be less than or equal to the maximum replication factor for the file. If
insufficient space is available in the file system to increase the number of replicas to the value requested,
the mmchattr command ends. However, some blocks of the file may have their replication factor
increased after the mmchattr command ends. If additional free space becomes available in the file system
at a later time (when, for example, you add another disk to the file system), you can then issue the
mmrestripefs command with the -r or -b option to complete the replication of the file. The
mmrestripefile command can be used in a similar manner. You can use the mmlsattr command to
display the replication values.
Data of a file is stored in a specific storage pool. A storage pool is a collection of disks or RAIDs with
similar properties. Because these storage devices have similar properties, you can manage them as a
groups. You can use storage pools to:
v Partition storage for the file system
v Assign file storage locations
v Improve system performance
v Improve system reliability
The Direct I/O caching policy bypasses file cache and transfers data directly from disk into the user
space buffer, as opposed to using the normal cache policy of placing pages in kernel memory.
Applications with poor cache hit rates or very large I/Os may benefit from the use of Direct I/O.
You must have write permission for the files whose attributes you are changing.
Parameters
Filename [Filename ...]
The name of one or more files to be changed. Delimit each file name by a space. Wildcard
characters are supported in file names, for example, project*.sched.
Options
| -a {yes | no}
| Specifies whether the file is in appendOnly mode (-a yes) or not (-a no).
-D {yes | no}
Enable or disable the Direct I/O caching policy for files.
-i {yes | no}
Specifies whether the file is immutable (-i yes) or not immutable (-i no).
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You must have write access to the file to run the mmchattr command.
You may issue the mmchattr command only from a node in the GPFS cluster where the file system is
mounted.
Examples
1. To change the metadata replication factor to 2 and the data replication factor to 2 for the
project7.resource file in file system fs1, issue this command:
mmchattr -m 2 -r 2 /fs1/project7.resource
To confirm the change, issue this command:
| mmlsattr project7.resource
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
mmchcluster -p LATEST
Or,
Or,
mmchcluster -C ClusterName
Description
The mmchcluster command serves several purposes. You can use it to:
1. Change the primary or secondary GPFS cluster configuration server.
2. Synchronize the primary GPFS cluster configuration server.
3. Change the remote shell and remote file copy programs to be used by the nodes in the cluster.
4. Change the cluster name.
5. Specify node interfaces to be used by the GPFS administration commands.
6. The mmchnode command replaces the mmchcluster -N command for changes to node data related to
the cluster configuration.
To display current system information for the cluster, issue the mmlscluster command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
When issuing the mmchcluster command with the -p or -s options, the specified nodes must be available
in order for the command to succeed. If any of the nodes listed are not available when the command is
issued, a message listing those nodes is displayed. You must correct the problem on each node and
reissue the command.
Attention: The mmchcluster command, when issued with either the -p or -s option, is designed to
operate in an environment where the current primary and secondary cluster configuration servers are not
available. As a result, the command can run without obtaining its regular serialization locks. To assure
smooth transition to a new cluster configuration server, no other GPFS commands (mm commands)
should be running when the command is issued, nor should any other command be issued until the
mmchcluster command has successfully completed.
Parameters
-C ClusterName
Specifies a new name for the cluster. If the user-provided name contains dots, it is assumed to be
a fully-qualified domain name. Otherwise, to make the cluster name unique, the domain of the
primary configuration server will be appended to the user-provided name.
Options
-R RemoteFileCopy
Specifies the fully-qualified path name for the remote file copy program to be used by GPFS.
The remote copy command must adhere to the same syntax format as the rcp command, but may
implement an alternate authentication mechanism.
-r RemoteShellCommand
Specifies the fully-qualified path name for the remote shell program to be used by GPFS.
The remote shell command must adhere to the same syntax format as the rsh command, but may
implement an alternate authentication mechanism.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To change the primary GPFS server for the cluster, issue this command:
mmchcluster -p k164n06
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmchconfig command to change the GPFS configuration attributes on a single node, a set of
nodes, or globally for the entire cluster.
When changing both maxblocksize and pagepool, the command fails unless these conventions are
followed:
v When increasing the values, pagepool must be specified first.
v When decreasing the values, maxblocksize must be specified first.
Results
Parameters
-N {Node[,Node...] | NodeFile | NodeClass}
Specifies the set of nodes to which the configuration changes apply. For general information on
how to specify node names, see “Specifying nodes as input to GPFS commands” on page 3.
| To see a complete list of the attributes for which the -N flag is valid, refer to the "list of
| NodeNames allowed" column of Table 3 on page 9, or check the individual attribute descriptions
| that follow.
This command does not support a NodeClass of mount.
Options
-I Specifies that the changes take effect immediately, but do not persist when GPFS is restarted. This
option is valid only for the dataStructureDump, dmapiEventTimeout, dmapiMountTimeoout,
dmapiSessionFailureTimeout, maxMBpS, pagepool, unmountOnDiskFail, and verbsRdma
attributes.
-i Specifies that the changes take effect immediately and are permanent. This option is valid only
for the dataStructureDump, dmapiEventTimeout, dmapiMountTimeoout,
dmapiSessionFailureTimeout, maxMBpS, pagepool, unmountOnDiskFail, and verbsRdma
attributes.
| Attribute=value
| Specifies the name of the attribute to be changed and its associated value. More than one attribute
| and value pair, in a comma-separated list, can be changed with one invocation of the command.
| To restore the GPFS default setting for any given attribute, specify DEFAULT as its value.
adminMode
Specifies whether all nodes in the cluster will be used for issuing GPFS administration commands
or just a subset of the nodes. Valid values are:
| Note: After defaultHelperNodes has been set, that setting can be overridden by specifying the
| -N flag on any of the above commands.
defaultMountDir
Specifies the default parent directory for GPFS file systems. The default value is /gpfs. If an
explicit mount directory is not provided with the mmcrfs, mmchfs, or mmremotefs command,
the default mount point will be set to DefaultMountDir/DeviceName.
dmapiDataEventRetry
Controls how GPFS handles data events that are enabled again right after the event is handled by
the DMAPI application. Valid values are:
-1 Specifies that GPFS will always regenerate the event as long as it is enabled. This value
should only be used when the DMAPI application recalls and migrates the same file in
parallel by many processes at the same time.
0 Specifies to never regenerate the event. This value should not be used if a file could be
migrated and recalled at the same time.
RetryCount
Specifies the number of times the data event should be retried. The default is 2.
For further information regarding DMAPI for GPFS, see the General Parallel File System: Data
Management API Guide.
dmapiEventTimeout
Controls the blocking of file operation threads of NFS, while in the kernel waiting for the
handling of a DMAPI synchronous event. The parameter value is the maximum time, in
milliseconds, the thread will block. When this time expires, the file operation returns
ENOTREADY, and the event continues asynchronously. The NFS server is expected to repeatedly
retry the operation, which eventually will find the response of the original event and continue.
This mechanism applies only to read, write, and truncate event types, and only when such events
come from NFS server threads. The timeout value is given in milliseconds. The value 0 indicates
immediate timeout (fully asynchronous event). A value greater than or equal to 86400000 (which
is 24 hours) is considered infinity (no timeout, fully synchronous event). The default value is
86400000.
For further information regarding DMAPI for GPFS, see the General Parallel File System: Data
Management API Guide.
| The -N flag is valid for this attribute.
This will allow more lenient intranode locking, but impose internode whole file range tokens on
files using mmap while writing.
nsdServerWaitTimeForMount
When mounting a file system whose disks depend on NSD servers, this option specifies the
number of seconds to wait for those servers to come up. The decision to wait is controlled by the
criteria managed by the nsdServerWaitTimeWindowOnMount option.
Valid values are between 0 and 1200 seconds. The default is 300. A value of zero indicates that no
waiting is done. The interval for checking is 10 seconds. If nsdServerWaitTimeForMount is 0,
nsdServerWaitTimeWindowOnMount has no effect.
The mount thread waits when the daemon delays for safe recovery. The mount wait for NSD
servers to come up, which is covered by this option, occurs after expiration of the recovery wait
allows the mount thread to proceed.
| The -N flag is valid for this attribute.
The order in which you specify the subnets determines the order that GPFS uses these subnets to
establish connections to the nodes within the cluster. For example, subnets="192.168.2.0" refers to
IP addresses 192.168.2.0 through 192.168.2.255.
This feature cannot be used to establish fault tolerance or automatic failover. If the interface
corresponding to an IP address in the list is down, GPFS does not use the next one on the list.
For more information about subnets, see General Parallel File System: Advanced Administration Guide
and search on Using remote access with public and private IP addresses.
tiebreakerDisks
Controls whether GPFS will use the node quorum with tiebreaker algorithm in place of the
regular node based quorum algorithm. See General Parallel File System: Concepts, Planning, and
Installation Guide and search for node quorum with tiebreaker. To enable this feature, specify the
names of one or three disks. Separate the NSD names with semicolon (;) and enclose the list in
quotes. The disks do not have to belong to any particular file system, but must be directly
accessible from the quorum nodes. For example:
tiebreakerDisks="gpfs1nsd;gpfs2nsd;gpfs3nsd"
In this format, device is the HCA device name (such as mthca0) and port is the one-based port
number (such as 1 or 2). If you do not specify a port number, GPFS uses port 1 as the default.
For example:
verbsPorts="mthca0/1 mthca0/2"
will create two RDMA connections between the NSD client and server using both ports of a dual
ported adapter.
| The -N flag is valid for this attribute.
verbsRdma
Enables or disables InfiniBand RDMA using the Verbs API for data transfers between an NSD
client and NSD server. Valid values are enable or disable.
| The -N flag is valid for this attribute.
Note: InfiniBand RDMA for Linux X86_64 is supported only on GPFS V3.2 Multiplatform. For
the latest support information, see the GPFS Frequently Asked Questions at: http://
publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp?topic=/com.ibm.cluster.gpfs.doc/
gpfs_faqs/gpfsclustersfaq.html.
worker1Threads
Controls the maximum number of concurrent file operations at any one instant. If there are more
requests than that, the excess will wait until a previous request has finished.
This attribute is primarily used for random read or write requests that cannot be pre-fetched,
random I/O requests, or small file activity. The default value is 48. The minimum value is 1. The
maximum value of prefetchThreads plus worker1Threads is:
v 164 on 32-bit kernels
v 550 on 64-bit kernels
| The -N flag is valid for this attribute.
Exit status
0 Successful completion.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To change the maximum file system block size allowed to 4 MB, issue this command:
mmchconfig maxblocksize=4M
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmchdisk Device {suspend | resume | stop | start | change} -d "DiskDesc [;DiskDesc...]" | -F {DescFile}
[-N {Node [,Node...] | NodeFile | NodeClass}]
Or,
Description
Use the mmchdisk command to change the state or the parameters of one or more disks in a GPFS file
system.
The state of a disk is a combination of its status and availability, displayed with the mmlsdisk command.
Disk status is normally either ready or suspended. A transitional status such as replacing, replacement,
or being emptied might also appear if a disk is being deleted or replaced. A suspended disk is one that
the user has decided not to place any new data on. Existing data on a suspended disk may still be read
or updated. Typically, a disk is suspended prior to restriping a file system. Suspending a disk tells the
mmrestripefs command that data is to be migrated off that disk. Disk availability is either up or down.
Be sure to use stop before you take a disk offline for maintenance. You should also use stop when a disk
has become temporarily inaccessible due to a disk failure that is repairable without loss of data on that
disk (for example, an adapter failure or a failure of the disk electronics).
The Disk Usage (dataAndMetadata, dataOnly, metadataOnly, or descOnly) and Failure Group parameters
of a disk are adjusted with the change option. See the General Parallel File System: Concepts, Planning, and
Installation Guide and search for recoverability considerations. The mmchdisk change command does not
move data or metadata that resides on the disk. After changing disk parameters, in particular, Disk Usage,
you may have to issue the mmrestripefs command with the -r option to relocate data so that it conforms
to the new disk parameters.
The mmchdisk command can be issued for a mounted or unmounted file system. When maintenance is
complete or the failure has been repaired, use the mmchdisk command with the start option. If the
failure cannot be repaired without loss of data, you can use the mmdeldisk command.
Note:
1. The mmchdisk command cannot be used to change the NSD servers associated with the disk. Use the
mmchnsd command for this purpose.
2. Similarly, the mmchdisk command cannot be used to change the storage pool for the disk. Use the
mmdeldisk and mmadddisk commands to move a disk from one storage pool to another.
Parameters
Device The device name of the file system to which the disks belong. File system names need not be
fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
-d "DiskDesc[;DiskDesc... ]"
A descriptor for each disk to be changed.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To suspend active disk gpfs2nsd, issue this command:
mmchdisk fs0 suspend -d gpfs2nsd
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Description
The mmcheckquota command counts inode and space usage for a file system and writes the collected
data into quota files. Indications leading you to the conclusion you should run the mmcheckquota
command include:
v MMFS_QUOTA error log entries. This error log entry is created when the quota manager has a
problem reading or writing the quota file.
v Quota information is lost due to a node failure. A node failure could leave users unable to open files or
deny them disk space that their quotas should allow.
v The in-doubt value is approaching the quota limit.
The sum of the in-doubt value and the current usage may not exceed the hard limit. Consequently, the
actual block space and number of files available to the user of the group may be constrained by the
in-doubt value. Should the in-doubt value approach a significant percentage of the quota, use the
mmcheckquota command to account for the lost space and files.
v User, group, or fileset quota files are corrupted.
The mmcheckquota command is I/O intensive and should be run when the system load is light. When
issuing the mmcheckquota command on a mounted file system, negative in-doubt values may be
reported if the quota server processes a combination of up-to-date and back-level information. This is a
transient situation and may be ignored.
If a file system is ill-replicated, the mmcheckquota command will not be able to determine exactly how
many valid replicas actually exist for some of the blocks. If this happens, the used block count results
from mmcheckquota will not be accurate. It is recommended that you run mmcheckquota to restore
accurate usage count after the file system is no longer ill-replicated.
Parameters
-a Checks all GPFS file systems in the cluster from which the command is issued.
Device The device name of the file system. File system names need not be fully-qualified. fs0 is as
acceptable as /dev/fs0.
-g GroupQuotaFileName
Replace the current group quota file with the file indicated.
Options
-v Reports discrepancies between calculated and recorded disk quotas.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmcheckquota command is issued.
Examples
1. To check quotas for file system fs0, issue this command:
mmcheckquota fs0
The system displays information only if a problem is found or if quota management is not enabled for
a file system:
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmchfileset command changes the name or comment for an existing GPFS fileset.
For information on GPFS filesets, see the GPFS: Advanced Administration Guide.
Parameters
Device The device name of the file system that contains the fileset.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
FilesetName
Specifies the name of the fileset.
-J JunctionPath
Specifies the junction path name for the fileset.
A junction is a special directory entry that connects a name in a directory of one fileset to the root
directory of another fileset.
-j NewFilesetName
Specifies the new name that is to be given to the fileset. This name must be less than 256
characters in length. This flag may be specified along with the -t flag.
-t NewComment
Specifies an optional comment that appears in the output of the mmlsfileset command. This
comment must be less than 256 characters in length. This flag may be specified along with the -j
flag.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
| Examples
| This command renames fileset fs1 to fs2and gives it the comment "first fileset":
| mmchfileset gpfs1 fs1 -j fs2 -t ’first fileset’
| See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmchfs Device
[-A {yes | no | automount}]
| [-D {posix | nfs4}]
| [-E {yes | no}]
[-k {posix | nfs4 | all}]
[-K {no | whenpossible | always}]
[-m DefaultMetadataReplicas]
| [-n NumNodes]
[-o MountOptions]
| [-Q {yes | no }]
[-r DefaultDataReplicas]
[-S {yes | no}]
[-T Mountpoint]
[-t DriveLetter]
[-V {full | compat}]
[-z {yes | no}]
| [--filesetdf | --nofilesetdf]
| [--inode-limit MaxNumInodes[:NumInodesToPreallocate]]
| [--mount-priority Priority]
Or,
Description
Use the mmchfs command to change the attributes of a GPFS file system.
Parameters
Device The device name of the file system to be changed.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0. However, file
system names must be unique across GPFS clusters.
This must be the first parameter.
Options
-A {yes | no | automount}
Indicates when the file system is to be mounted:
yes When the GPFS daemon starts.
no Manual mount.
automount
On non-Windows nodes, when the file system is first accessed. On Windows nodes, when
the GPFS daemon starts.
Note: The file system must be unmounted prior to changing the automount settings.
-D {nfs4 | posix}
Specifies whether a 'deny-write open lock' will block writes, which is expected and required by
Important: On clusters running GPFS 3.1 or GPFS 3.2, this option should only be used if all the
nodes have installed the code to handle it: 3.1.0.24 or later or 3.2.1.8 or later.
Note: The -V option cannot be used to make file systems created prior to GPFS 3.2.1.5 available
to Windows nodes. Windows nodes can mount only file systems that are created with GPFS
3.2.1.5 or later.
Before issuing the -V option, see the Migration, coexistence and compatibility topic in the General
Parallel File System: Concepts, Planning, and Installation Guide. You must ensure that all nodes in the
cluster have been migrated to the latest level of GPFS code and that you have successfully run
the mmchconfig release=LATEST command.
For information about specific file system format and function changes when upgrading to the
current release, see Chapter 12, “File system format changes between versions of GPFS,” on page
469.
full Enables all new functionality that requires different on-disk data structures. Nodes in
remote clusters running an older GPFS version will no longer be able to mount the file
system. If there are any nodes running an older GPFS version that have the file system
mounted at the time the command is issued, the mmchfs command will fail.
compat
Enables only backwardly compatible format changes. Nodes in remote clusters that are
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
To change the default replicas for metadata to 2 and the default replicas for data to 2 for new files created
in the fs0 file system, issue this command:
mmchfs fs0 -m 2 -r 2
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmchlicense command to change the type of GPFS license associated with the nodes in the
cluster.
For information on GPFS license designation, see the topic "GPFS license designation" in the GPFS:
Concepts, Planning, and Installation Guide.
Parameters
server | client
The type of GPFS license to be assigned to the nodes specified with the -N parameter.
server
The GPFS server license permits the licensed node to mount GPFS file systems and access
data from operating system block devices. The GPFS server license permits the licensed
node to perform GPFS management functions such as cluster configuration manager,
quorum node, manager node, and NSD server.
The GPFS server license also permits the licensed node to share GPFS data through any
application, service, protocol, or method, such as Network File System (NFS), Common
Internet File System (CIFS), File Transfer Protocol (FTP), or Hypertext Transfer Protocol
(HTTP).
client The GPFS client license permits a node to mount GPFS file systems and access data from
operating system block devices as well as NSD servers.
The GPFS client license only permits exchange of data between nodes that locally mount
the same file system. No other export of the data is permitted.
--accept
Indicates that you accept the applicable licensing terms. The license acceptance prompt will be
suppressed.
-N {Node[,Node...] | NodeFile | NodeClass}
Specifies the nodes that are to be assigned the specified license type. For general information on
how to specify node names, see “Specifying nodes as input to GPFS commands” on page 3.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
Examples
To designate nodes k145n04 and k145n05 as possessing a GPFS server license, issue this command:
mmchlicense server --accept -N k145n04,k145n05
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmchmgr command assigns a new file system manager node or cluster manager node.
Parameters
Device The device name of the file system for which the file system manager node is to be changed. File
system names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
-c Changes the cluster manager node.
Node
The target node to be appointed as either the new file system manager or cluster manager node.
Target nodes for manager functions are selected according to use:
v Target nodes for cluster manager function must be specified from the list of quorum nodes
v Target nodes for file system manager function should be specified from the list of manager
nodes
If Node is not specified, the new manager is selected automatically.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. Assume the file system manager for the file system gpfs1 is currently k164n05. To migrate the file
system manager responsibilities to k164n06, issue this command:
mmchmgr gpfs1 k164n06
The system displays information similar to:
GPFS: 6027-628 Sending migrate request to current manager node
89.116.68.69 (k164n05).
GPFS: 6027-629 Node 89.116.68.69 (k164n05) resigned as manager for gpfs1
GPFS: 6027-630 Node 89.116.68.70 (k164n06) appointed as manager for gpfs1
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Description
Use the mmchnode command to change one or more attributes on a single node or on a set of nodes. If
conflicting node designation attributes are specified for a given node, the last value is used. If any of the
attributes represent a node-unique value, the -N option must resolve to a single node.
Parameters
-N { Node[,Node...] | NodeFile | NodeClass}
Specifies the nodes whose states are to be changed.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
-S Filename | --spec-file=Filename
Specifies a file with a detailed description of the changes to be made. Each line represents the
changes to an individual node and has the following format:
node-identifier change-options
change-options
A blank-separated list of attribute[=value] pairs. The following attributes can be specified:
--admin-interface={hostname | ip_address}
Specifies the name of the node to be used by GPFS administration commands when
communicating between nodes. The admin node name must be specified as an IP address
or a hostname that is resolved by the host command to the desired IP address. If the
keyword DEFAULT is specified, the admin interface for the node is set to be equal to the
daemon interface for the node.
--client
Specifies that the node should not be part of the pool of nodes from which cluster
managers, file system managers, and token managers are selected.
--cnfs-disable
Disables the CNFS functionality of a CNFS member node.
--cnfs-enable
Enables a previously-disabled CNFS member node.
--cnfs-groupid=groupid
Specifies a failover recovery group for the node. If the keyword DEFAULT is specified,
the CNFS recovery group for the node is set to zero.
For additional information, refer to "Implementing a clustered NFS using GPFS on Linux"
in the GPFS: Advanced Administration Guide.
--cnfs-interface=ip_address_list
A comma-separated list of host names or IP addresses to be used for GPFS cluster NFS
serving.
Note: If you are designating a node as a quorum node, and adminMode central is in
effect for the cluster, you must ensure that GPFS is up and running on that node
(mmgetstate reports the state of the node as active).
--snmp-agent
Designates the node as an SNMP collector node. If the GPFS daemon is active on this
node, the SNMP subagent will be started as well. For additional information see the topic
"GPFS SNMP support" in the General Parallel File System: Advanced Administration Guide.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
Examples
1. To change nodes k145n04 and k145n05 to be both quorum and manager nodes, issue this command:
mmchnode --quorum --manager -N k145n04,k145n05
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmchnsd command serves several purposes. You can use it to:
v Specify a server list for an NSD that does not have one.
v Change the NSD server nodes specified in the server list.
v Delete the server list. The disk must now be SAN-attached to all nodes in the cluster on which the file
system will be mounted.
Parameters
DiskDesc
A descriptor for each NSD to be changed. Each descriptor is separated by a semicolon (;). The
entire list must be enclosed in single or double quotation marks.
-F DescFile
Specifies a file containing a list of disk descriptors, one per line.
Each disk descriptor must be specified in the form:
DiskName:ServerList
DiskName
Is the NSD name that was given to the disk by the mmcrnsd command.
ServerList
Is a comma-separated list of NSD server nodes. You can specify up to eight NSD servers
in this list. The defined NSD will preferentially use the first server on the list. If the first
server is not available, the NSD will use the next available server on the list.
When specifying server nodes for your NSDs, the output of the mmlscluster command
lists the host name and IP address combinations recognized by GPFS. The utilization of
aliased host names not listed in the mmlscluster command output may produce
undesired results.
If you do not define a ServerList, GPFS assumes that the disk is SAN-attached to all nodes
in the cluster. If all nodes in the cluster do not have access to the disk, or if the file
system to which the disk belongs is to be accessed by other GPFS clusters, you must
specify a value for ServerList.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
If the disk gpfs1nsd is currently defined with k145n05 as the first server and k145n07 as the second
server, and you want to replace k145n05 with k145n09, issue this command:
mmchnsd "gpfs1nsd:k145n09,k145n07"
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmchpolicy command to establish the rules for policy-based lifecycle management of the files in
a given GPFS file system. Some of the things that can be controlled with the help of policy rules are:
v File placement at creation time
v Replication factors
v Movement of data between storage pools
v File deletion
The mmapplypolicy command must be run to move data between storage pools or delete files.
Policy changes take effect immediately on all nodes that have the affected file system mounted. For nodes
that do not have the file system mounted, policy changes take effect upon the next mount of the file
system.
For information on GPFS policies, see the GPFS: Advanced Administration Guide.
Parameters
Device Specifies the device name of the file system for which policy information is to be established or
changed. File system names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0. This
must be the first parameter.
PolicyFileName
Specifies the name of the file that contains the policy rules. If you specify DEFAULT, GPFS
replaces the current policy file with a single policy rule that assigns all newly-created files to the
system storage pool.
Options
-I {yes | test}
Specifies whether to activate the rules in the policy file PolicyFileName.
yes The policy rules are validated and immediately activated. This is the default.
test The policy rules are validated, but not installed.
-t DescriptiveName
Specifies an optional descriptive name to be associated with the policy rules. The string must be
less than 256 characters in length. If not specified, the descriptive name defaults to the base name
portion of the PolicyFileName parameter.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This command validates a policy before it is installed:
mmchpolicy fs2 policyfile -I test
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Upon successful completion of the mmcrcluster command, the /var/mmfs/gen/mmsdrfs and the
/var/mmfs/gen/mmfsNodeData files are created on each of the nodes in the cluster. Do not delete these
files under any circumstances. For more information, see the Quorum topic in the GPFS: Concepts,
Planning, and Installation Guide.
Parameters
-A Specifies that GPFS daemons are to be automatically started when nodes come up. The default is
not to start daemons automatically.
-C ClusterName
Specifies a name for the cluster. If the user-provided name contains dots, it is assumed to be a
fully qualified domain name. Otherwise, to make the cluster name unique, the domain of the
primary configuration server will be appended to the user-provided name.
If the -C flag is omitted, the cluster name defaults to the name of the primary GPFS cluster
configuration server.
-c ConfigFile
Specifies a file containing GPFS configuration parameters with values different than the
documented defaults. A sample file can be found in /usr/lpp/mmfs/samples/mmfs.cfg.sample. See
the mmchconfig command for a detailed description of the different configuration parameters.
The -c ConfigFile parameter should be used only by experienced administrators. Use this file to
set up only those parameters that appear in the mmfs.cfg.sample file. Changes to any other
values may be ignored by GPFS. When in doubt, use the mmchconfig command instead.
where:
NodeName
Specifies the host name or IP address of the node for GPFS daemon-to-daemon
communication. For hosts with multiple adapters, see the GPFS: Advanced
Administration Guide and search on Using remote access with public and private IP
addresses.
The host name or IP address must refer to the communication adapter over
which the GPFS daemons communicate. Aliased interfaces are not allowed. Use
the original address or a name that is resolved by the host command to that
original address. You can specify a node using any of these forms:
v Short host name (for example, h135n01)
v Long, fully-qualified, host name (for example, h135n01.ibm.com)
v IP address (for example, 7.111.12.102)
Regardless of which form you use, GPFS will resolve the input to a host name
and an IP address and will store these in its configuration files. It is expected that
those values will not change while the node belongs to the cluster.
NodeDesignations
An optional, "-" separated list of node roles:
v manager | client – Indicates whether a node is part of the node pool from
which file system managers and token managers can be selected. The default is
client.
v quorum | nonquorum – Indicates whether a node is counted as a quorum
node. The default is nonquorum.
AdminNodeName
Specifies an optional field that consists of a node name to be used by the
administration commands to communicate between nodes. If AdminNodeName is
not specified, the NodeName value is used.
You must provide a NodeDesc for each node to be added to the GPFS cluster.
-p PrimaryServer
Specifies the primary GPFS cluster configuration server node used to store the GPFS
configuration data. This node must be a member of the GPFS cluster.
-R RemoteFileCopy
Specifies the fully-qualified path name for the remote file copy program to be used by GPFS. The
default value is /usr/bin/rcp.
The remote copy command must adhere to the same syntax format as the rcp command, but may
implement an alternate authentication mechanism.
-r RemoteShellCommand
Specifies the fully-qualified path name for the remote shell program to be used by GPFS. The
default value is /usr/bin/rsh.
Chapter 8. GPFS commands 139
The remote shell command must adhere to the same syntax format as the rsh command, but may
implement an alternate authentication mechanism.
-s SecondaryServer
Specifies the secondary GPFS cluster configuration server node used to store the GPFS cluster
data. This node must be a member of the GPFS cluster.
It is suggested that you specify a secondary GPFS cluster configuration server to prevent the loss
of configuration data in the event your primary GPFS cluster configuration server goes down.
When the GPFS daemon starts up, at least one of the two GPFS cluster configuration servers
must be accessible.
If your primary GPFS cluster configuration server fails and you have not designated a secondary
server, the GPFS cluster configuration files are inaccessible, and any GPFS administration
commands that are issued fail. File system mounts or daemon startups also fail if no GPFS cluster
configuration server is available.
-U DomainName
Specifies the UID domain name for the cluster.
A detailed description of the GPFS user ID remapping convention is contained in UID Mapping
for GPFS In a Multi-Cluster Environment at www.ibm.com/servers/eserver/clusters/library/
wp_aix_lit.html.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To create a GPFS cluster made of all of the nodes listed in the file /u/admin/nodelist, using node k164n05
as the primary server, and node k164n04 as the secondary server, issue:
mmcrcluster -N /u/admin/nodelist -p k164n05 -s k164n04
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmcrfileset command constructs a new fileset with the specified name. The new fileset is empty
except for a root directory, and does not appear in the directory name space until the mmlinkfileset
command is issued. The mmcrfileset command is separate from the mmlinkfileset command to allow
the administrator to establish policies and quotas on the fileset before it is linked into the name space.
For information on GPFS filesets, see the GPFS: Advanced Administration Guide.
| The maximum number of filesets that GPFS supports is 10,000 filesets per file system.
Parameters
Device The device name of the file system to contain the new fileset.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
FilesetName
Specifies the name of the newly created fileset.
-t Comment
Specifies an optional comment that appears in the output of the mmlsfileset command. This
comment must be less than 256 characters in length.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This example creates two filesets in file system gpfs1:
mmcrfileset gpfs1 fset1
| See also
| Location
| /usr/lpp/mmfs/bin
| Synopsis
Description
Use the mmcrfs command to create a GPFS file system. The first two parameters must be Device and
either DiskDescList or DescFile and they must be in that order. The block size and replication factors
chosen affect file system performance. A maximum of 256 file systems can be mounted in a GPFS cluster
at one time, including remote file systems.
When deciding on the maximum number of files (number of inodes) in a file system, consider that for
file systems that will be doing parallel file creates, if the total number of free inodes is not greater than
5% of the total number of inodes, there is the potential for slowdown in file system access. The total
number of inodes can be increased using the mmchfs command.
When deciding on a block size for a file system, consider these points:
1. Supported block sizes are 16 KB, 64 KB, 128 KB, 256 KB, 512 KB, 1 MB, 2 MB, and 4 MB.
2. The GPFS block size determines:
v The minimum disk space allocation unit. The minimum amount of space that file data can occupy
is a sub-block. A sub-block is 1/32 of the block size.
v The maximum size of a read or write request that GPFS sends to the underlying disk driver.
3. From a performance perspective, it is recommended that you set the GPFS block size to match the
application buffer size, the RAID stripe size, or a multiple of the RAID stripe size. If the GPFS block
size does not match the RAID stripe size, performance may be severely degraded, especially for write
operations.
Results
Upon successful completion of the mmcrfs command, these tasks are completed on all GPFS nodes:
v Mount point directory is created.
v File system is formatted.
Parameters
Device The device name of the file system to be created.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0. However, file
system names must be unique within a GPFS cluster. Do not specify an existing entry in /dev.
-F DescFile
Specifies a file containing a list of disk descriptors, one per line. You may use the rewritten
DiskDesc file created by the mmcrnsd command, create your own file, or enter the disk
descriptors on the command line. When using the DiskDesc file created by the mmcrnsd
command, the values supplied on input to the command for Disk Usage and FailureGroup are
used. When creating your own file or entering the descriptors on the command line, you must
specify these values or accept the system defaults.
"DiskDesc[;DiskDesc...]"
A descriptor for each disk to be included. Each descriptor is separated by a semicolon (;). The
entire list must be enclosed in quotation marks (' or ").
A disk descriptor is defined as (second, third and sixth fields reserved):
DiskName:::DiskUsage:FailureGroup::StoragePool
DiskName
You must specify the name of the NSD previously created by the mmcrnsd command.
For a list of available disks, issue the mmlsnsd -F command.
DiskUsage
Specify a disk usage or accept the default:
dataAndMetadata
Indicates that the disk contains both data and metadata. This is the default for
disks in the system pool.
Chapter 8. GPFS commands 145
dataOnly
Indicates that the disk contains data and does not contain metadata. This is the
default for disks in storage pools other than the system pool.
metadataOnly
Indicates that the disk contains metadata and does not contain data.
descOnly
Indicates that the disk contains no data and no file metadata. Such a disk is used
solely to keep a copy of the file system descriptor, and can be used as a third
failure group in certain disaster recovery configurations. For more information,
see General Parallel File System: Advanced Administration and search on Synchronous
mirroring utilizing GPFS replication.
FailureGroup
A number identifying the failure group to which this disk belongs. You can specify any
value from -1 (where -1 indicates that the disk has no point of failure in common with
any other disk) to 4000. If you do not specify a failure group, the value defaults to the
node number of the first NSD server defined in the NSD server list plus 4000. If you do
not specify an NSD server list, the value defaults to -1. GPFS uses this information during
data and metadata placement to assure that no two replicas of the same block are written
in such a way as to become unavailable due to a single failure. All disks that are attached
to the same NSD server or adapter should be placed in the same failure group.
If replication of -m or -r is set to 2, storage pools must have two failure groups for the
commands to work properly.
StoragePool
Specifies the storage pool to which the disk is to be assigned. If this name is not
provided, the default is system.
Only the system pool may contain descOnly, metadataOnly or dataAndMetadata disks.
Options
-A {yes | no | automount}
Indicates when the file system is to be mounted:
yes When the GPFS daemon starts. This is the default.
no Manual mount.
automount
On non-Windows nodes, when the file system is first accessed. On Windows nodes, when
the GPFS daemon starts.
-B BlockSize
Size of data blocks. Must be 16 KB, 64 KB, 128 KB, 256 KB (the default), 512 KB, 1 MB, 2 MB, or 4
MB. Specify this value with the character K or M, for example 512K.
-D {nfs4 | posix}
Specifies whether a 'deny-write open lock' will block writes, which is expected and required by
NFS V4. File systems supporting NFS V4 must have -D nfs4 set. The option -D posix allows NFS
writes even in the presence of a deny-write open lock. If you intend to export the file system
using NFS V4 or Samba, you must use -D nfs4. For NFS V3 (or if the file system is not NFS
exported at all) use -D posix. The default is -D nfs4.
-E {yes | no}
Specifies whether to report exact mtime values (-E yes), or to periodically update the mtime value
for a file system (-E no). If it is more desirable to display exact modification times for a file
system, specify or use the default -E yes option.
Important: Using -v no on a disk that already belongs to a file system will corrupt that file
system. This will not be noticed until the next time that file system is mounted.
-z {yes | no}
Enable or disable DMAPI on the file system. Turning this option on will require an external data
management application such as Tivoli Storage Manager (TSM) hierarchical storage management
(HSM) before the file system can be mounted. The default is -z no. For further information on
DMAPI for GPFS, see General Parallel File System: Data Management API Guide.
| --filesetdf | --nofilesetdf
When this option is enabled and quotas are enforced for a fileset, the df command reports
| numbers based on the quotas for the fileset and not for the total file system.
| --inode-limit MaxNumInodes[:NumInodesToPreallocate]
Specifies the maximum number of files in the file system. This value is decided by the available
file system space, available metadata space, block size and the suggested number of nodes in the
cluster.
For file systems that will be creating parallel files, if the total number of free inodes is not greater
than 5% of the total number of inodes, file system access might slow down. Take this into
consideration when creating your file system.
The parameter NumInodesToPreallocate specifies the number of inodes that the system will
immediately preallocate. If you do not specify a value for NumInodesToPreallocate, GPFS will
dynamically allocate inodes as needed.
You can specify the NumInodes and NumInodesToPreallocate values with a suffix, for example 100K
or 2M.
| --mount-priority Priority
Controls the order in which the individual file systems are mounted at daemon startup or when
one of the all keywords is specified on the mmmount command.
File systems with higher Priority numbers are mounted after file systems with lower numbers.
File systems that do not have mount priorities are mounted last. A value of zero indicates no
| priority. This is the default.
| --version VersionString
Enable only the file system features that are compatible with the specified release. The lowest
allowed Version value is 3.1.0.0.
The default is 3.4.0.0, which will enable all currently available features but will prevent nodes
that are running earlier GPFS releases from accessing the file system. Windows nodes can mount
only file systems that are created with GPFS 3.2.1.5 or later.
Exit status
0 Successful completion.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
This example shows how to create a file system named gpfs1 using three disks, each with a block size of
512 KB, allowing metadata and data replication to be 2, turning quotas on, and creating /gpfs1 as the
mount point. To complete this task, issue the command:
mmcrfs gpfs1 "hd2n97;hd3n97;hd4n97" -B 512K -m 2 -r 2 -Q yes -T /gpfs1
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmcrnsd command is used to create cluster-wide names for NSDs used by GPFS.
This is the first GPFS step in preparing disks for use by a GPFS file system. The disk descriptor file
supplied to this command is rewritten with the new NSD names and that rewritten disk descriptor file
can be supplied as input to the mmcrfs, mmadddisk or mmrpldisk commands.
The names created by the mmcrnsd command are necessary since disks connected to multiple nodes may
have different disk device names on each node. The NSD names uniquely identify each disk. This
command must be run for all disks that are to be used in GPFS file systems. The mmcrnsd command is
also used to assign each disk an NSD server list that can be used for I/O operations on behalf of nodes
that do not have direct access to the disk.
To identify that a disk has been processed by the mmcrnsd command, a unique NSD volume ID is
written on sector 2 of the disk. All of the NSD commands (mmcrnsd, mmlsnsd, and mmdelnsd) use this
unique NSD volume ID to identify and process NSDs.
| After the NSDs are created, the GPFS cluster data is updated and they are available for use by GPFS.
| On Windows, GPFS will only create NSDs from empty disk drives. mmcrnsd accepts Windows Basic
disks or Unknown/Not Initialized disks. It always re-initializes these disks so that they become Basic GPT
Disks with a single GPFS partition. NSD data is stored in GPFS partitions. This allows other operating
system components to recognize the disks are used. mmdelnsd deletes the partition tables created by
mmcrnsd.
Results
Upon successful completion of the mmcrnsd command, these tasks are completed:
v NSDs are created.
v The DescFile contains NSD names to be used as input to the mmcrfs, mmadddisk, or the mmrpldisk
commands.
v A unique NSD volume ID to identify each disk as an NSD has been written on sector 2.
v An entry for each new disk is created in the GPFS cluster data.
Parameters
-F DescFile
Specifies the file containing the list of disk descriptors, one per line. Disk descriptors have this
format:
DiskName:ServerList::DiskUsage:FailureGroup:DesiredName:StoragePool
DiskName
On UNIX, the block device name appearing in /dev for the disk you want to define as an
| NSD. Examples of disks that are accessible through a block device are SAN-attached
| disks. If server nodes are specified, DiskName must be the /dev name for the disk device
of the first listed NSD server node.
Upon successful completion of the mmcrnsd command, the DescFile file is rewritten to contain the created
NSD names in place of the device name. NSD servers defined in the ServerList and desiredName are
omitted from the rewritten disk descriptor and all other fields, if specified, are copied without
modification. The original lines, as well as descriptor lines in error, are commented out and preserved for
reference. The rewritten disk descriptor file can then be used as input to the mmcrfs, mmadddisk, or the
mmrpldisk commands. You must have write access to the directory where the DescFile file is located in
order to rewrite the created NSD information.
The Disk Usage and Failure Group specifications in the disk descriptor are preserved only if you use the
rewritten file produced by the mmcrnsd command. If you do not use this file, you must either accept the
default values or specify new values when creating disk descriptors for other commands.
Options
-v {yes |no}
Verify the disks are not already formatted as an NSD.
A value of -v yes specifies that the NSDs are to be created only if each disk has not been
formatted by a previous invocation of the mmcrnsd command, as indicated by the NSD volume
ID on sector 2 of the disk. A value of -v no specifies that the disks are to be created irrespective
of their previous state. The default is -v yes.
Important: Using -v no when a disk already belongs to a file system can corrupt that file system
by making that physical disk undiscoverable by that file system. This will not be noticed until the
next time that file system is mounted.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
and
Disk Name
sdav2
Server List
k145n06,k145n05
Disk Usage
dataAndMetadata, allowing both
Failure Group
5
NSD Name
ABC
Storage Pool
system
nsdesc is rewritten as
#sdav1:k145n05,k145n06::dataOnly:4::poolA
gpfs20nsd:::dataOnly:4::poolA
#sdav2:k145n06,k145n05::dataAndMetadata:5:ABC
ABC:::dataAndMetadata:5
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmcrsnapshot command to create a snapshot of an entire GPFS file system at a single point in
time.
A snapshot is a copy of the changed user data in the file system. System data and existing snapshots are
not copied. The snapshot function allows a backup or mirror program to run concurrently with user
updates and still obtain a consistent copy of the file system as of the time the copy was created.
Snapshots also provide an online backup capability that allows easy recovery from common problems
such as accidental deletion of a file, and comparison with older versions of a file.
Snapshots are exact copies of changed data in the active files and directories of a file system. Snapshots
of a file system are read-only and they appear in a .snapshots directory located in the file system root
directory. The files and attributes of the file system can be changed only in the active copy.
There is a maximum limit of 256 snapshots per file system. Snapshots may be deleted only by issuing the
mmdelsnapshot command. The .snapshots directory cannot be deleted, though it can be renamed with
the mmsnapdir command using the -s option.
If the mmcrsnapshot command is issued while a conflicting command is running, the mmcrsnapshot
command waits for that command to complete. If the mmcrsnapshot command is running while a
conflicting command is issued, the conflicting command waits for the mmcrsnapshot command to
| complete.
| If quorum is lost before the mmcrsnapshot command completes, the snapshot is considered partial and
will be deleted when quorum is achieved again.
Because snapshots are not full, independent copies of the entire file system, they should not be used as
protection against media failures. For protection against media failures, see General Parallel File System:
Concepts, Planning, and Installation Guide and search on recoverability considerations.
For more information on snapshots, see Creating and maintaining snapshots of GPFS file systems in General
Parallel File System: Advanced Administration Guide.
Parameters
Device The device name of the file system for which the snapshot is to be created. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
This must be the first parameter.
| SnapshotName
Specifies the name given to the snapshot. The name appears as a subdirectory of the .snapshots
directory in the file system root. Each snapshot must have a unique name.
If you do not want to traverse the file system's root to access the snapshot, a more convenient
mechanism that enables a connection in each directory of the active file system can be enabled
with the -a option of the mmsnapdir command.
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To create a snapshot snap1, for the file system fs1, issue this command:
mmcrsnapshot fs1 snap1
Before issuing the command, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
After the command has been issued, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
If a second snapshot were to be created at a later time, the first snapshot would remain as is. Snapshots
are made only of active file systems, not existing snapshots. For example:
mmcrsnapshot fs1 snap2
After the command has been issued, the directory structure would appear similar to:
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
/fs1/.snapshots/snap2/file1
/fs1/.snapshots/snap2/userA/file2
/fs1/.snapshots/snap2/userA/file3
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmdefedquota command to set or change default quota limits for new users, groups, and
filesets for a file system. Default quota limits for a file system may be set or changed only if the file
system was created with the -Q yes option on the mmcrfs command or changed with the mmchfs
command.
The mmdefedquota command displays the current values for these limits, if any, and prompts you to
enter new values using your default editor:
v current block usage (display only)
v current inode usage (display only)
v inode soft limit
v inode hard limit
v block soft limit
Displayed in KB, but may be specified using k, K, m, M, g, or G . If no suffix is provided, the number
is assumed to be in bytes.
v block hard limit
Displayed in KB, but may be specified using k, K, m, M, g, or G. If no suffix is provided, the number
is assumed to be in bytes.
The mmdefedquota command waits for the edit window to be closed before checking and applying new
values. If an incorrect entry is made, reissue the command and enter the correct values.
When setting quota limits for a file system, replication within the file system should be considered. GPFS
quota management takes replication into account when reporting on and determining if quota limits have
been exceeded for both block and file usage. In a file system that has either type of replication set to a
value of two, the values reported on by both the mmlsquota command and the mmrepquota command
are double the value reported by the ls command.
The EDITOR environment variable must contain a complete path name, for example:
export EDITOR=/bin/vi
Parameters
Device The device name of the file system to have default quota values set for.
File system names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-g Specifies that the default quota value is to be applied for new groups accessing the specified file
system.
-j Specifies that the default quota value is to be applied for new filesets in the specified file system.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmdefedquota command is issued.
Examples
To set default quotas for new users of the file system fs1, issue this command:
mmdefedquota -u fs1
See also
/usr/lpp/mmfs/bin
Synopsis
Description
The mmdefquotaoff command deactivates default quota limits for file systems. If default quota limits are
deactivated, new users or groups for that file system will then have a default quota limit of 0, indicating
no limit.
If neither the -u, -j or the -g option is specified, the mmdefquotaoff command deactivates all default
quotas.
If the -a option is not used, Device must be the last parameter specified.
Parameters
Device The device name of the file system to have default quota values deactivated.
If more than one file system is listed, the names must be delimited by a space. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-a Deactivates default quotas for all GPFS file systems in the cluster. When used in combination
with the -g option, only group quotas are deactivated. When used in combination with the -u or
-j options, only user or fileset quotas, respectively, are deactivated.
-g Specifies that default quotas for groups are to be deactivated.
-j Specifies that default quotas for filesets are to be deactivated.
-u Specifies that default quotas for users are to be deactivated.
-v Prints a message for each file system in which default quotas are deactivated.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmdefquotaoff command is issued.
Examples
1. To deactivate default user quotas on file system fs0, issue this command:
mmdefquotaoff -u fs0
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmdefquotaon command activates default quota limits for a file system. If default quota limits are
not applied, new users, groups, or filesets for that file system will have a quota limit of 0, indicating no
limit.
To use default quotas, the file system must have been created or changed with the -Q yes option. See the
mmcrfs and mmchfs commands.
If neither the -u, -j or the -g option is specified, the mmdefquotaon command activates all default quota
limits.
If the -a option is not used, Device must be the last parameter specified.
Default quotas are established for new users, groups of users or filesets by issuing the mmdefedquota
command. Under the -d option, all users without an explicitly set quota limit will have a default quota
limit assigned.
Parameters
Device The device name of the file system to have default quota values activated.
If more than one file system is listed, the names must be delimited by a space. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-a Activates default quotas for all GPFS file systems in the cluster. When used in combination with
the -g option, only group quotas are activated. When used in combination with the -u or -j
options, only user or fileset quotas, respectively, are activated.
-d Specifies that existing users, groups of users, or filesets with no established quota limits will have
default quota values assigned when the mmdefedquota command is issued.
If this option is not chosen, existing quota entries remain in effect and are not governed by the
default quota rules.
-g Specifies that only a default quota value for group quotas is to be activated.
-j Specifies that only a default quota value for fileset quotas is to be activated.
-u Specifies that only a default quota value for users is to be activated.
-v Prints a message for each file system in which default quotas are activated.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmdefquotaon command is issued.
Examples
1. To activate default user quotas on file system fs0, issue this command:
mmdefquotaon -u fs0
To confirm the change, individually for each file system, issue this command:
mmlsfs fs1 -Q
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmdefragfs Device [-i] [-u BlkUtilPct] [-P PoolName] [-N {Node[,Node...] | NodeFile | NodeClass}]
Description
Use the mmdefragfs command to reduce fragmentation of a file system. The mmdefragfs command
moves existing file system data within a disk to make more efficient use of disk blocks. The data is
migrated to unused sub-blocks in partially allocated blocks, thereby increasing the number of free full
blocks.
The mmdefragfs command can be run against a mounted or unmounted file system. However, best
results are achieved when the file system is unmounted. When a file system is mounted, allocation status
may change causing retries to find a suitable unused sub-block.
Note: On a file system that has a very low level of fragmentation, negative numbers can be seen in the
output of mmdefragfs for free sub-blocks. This indicates that the block usage has in fact increased after
running the mmdefragfs command. If negative numbers are seen, it does not indicate a problem and you
do not need to rerun the mmdefragfs command.
Parameters
Device The device name of the file system to have fragmentation reduced. File system names need not
be fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
-P PoolName
Specifies the pool name to use.
-N {Node[,Node...] | NodeFile | NodeClass }
Specifies the nodes that can be used in this disk defragmentation. This parameter supports all
| defined node classes. The default is all or the current value of the defaultHelperNodes parameter
| of the mmchconfig command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
-i Specifies to query the current disk fragmentation state of the file system. Does not perform the
actual defragmentation of the disks in the file system.
-u BlkUtilPct
The average block utilization goal for the disks in the file system. The mmdefragfs command
reduces the number of allocated blocks by increasing the percent utilization of the remaining
blocks. The command automatically goes through multiple iterations until BlkUtilPct is achieved
on all of the disks in the file system or until no progress is made in achieving BlkUtilPct from one
iteration to the next, at which point it exits.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To query the fragmentation state of file system fs0, issue this command:
mmdefragfs fs0 -i
The system displays information similar to:
disk disk size in full subblk in % %
name in nSubblk blocks fragments free blk blk util
--------------- --------- --------- --------- -------- -------
nsd32 327680 277504 12931 84.688 96.054
nsd33 327680 315232 580 96.201 99.823
nsd21 327680 301824 2481 92.109 99.243
nsd34 327680 275904 13598 84.199 95.850
nsd30 327680 275808 13380 84.170 95.917
nsd19 327680 278496 12369 84.990 96.225
nsd31 327680 276224 12012 84.297 96.334
--------- --------- --------- -------
(total) 2293760 2000992 67351 97.064
2. To reduce fragmentation of the file system fs0 on all defined, accessible disks that are not stopped or
suspended, issue this command:
mmdefragfs fs0
The system displays information similar to:
free subblk free
disk in full subblk in % %
name blocks blk framents free blk blk util
before after freed before after before after before after
---------- ------------------------ ------------- ------------ ------------
gpfs57nsd 28896 29888 31 1462 463 50.39 52.12 94.86 98.31
gpfs60nsd 41728 43200 46 1834 362 59.49 61.59 93.55 98.66
--------------------- -------------- -----------
(total) 70624 73088 77 3296 825 93.63 98.84
3. To reduce fragmentation of all files in the fs1 file system until the disks have 100% full block
utilization, issue this command:
mmdefragfs fs1 -u 100
The system displays information similar to:
Defragmenting file system ’fs1’...
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmdelacl command to delete the extended entries of an access ACL of a file or directory, or to
delete the default ACL of a directory.
Parameters
Filename
The path name of the file or directory for which the ACL is to be deleted. If the -d option is
specified, Filename must contain the name of a directory.
Options
-d Specifies that the default ACL of a directory is to be deleted.
Since there can be only one NFS V4 ACL (no separate default), specifying the -d flag for a file
with an NFS V4 ACL is an error. Deleting an NFS V4 ACL necessarily removes both the ACL and
any inheritable entries contained in it.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The mmdelacl command may be issued only by the file or directory owner, the root user, or by someone
with control (c) authority in the ACL for the file.
You may issue the mmdelacl command only from a node in the GPFS cluster where the file system is
mounted.
Examples
To delete the default ACL for a directory named project2, issue this command:
mmdelacl -d project2
Location
/usr/lpp/mmfs/bin
Synopsis
mmdelcallback CallbackIdentifier[,CallbackIdentifier...]
Description
Use the mmdelcallback command to delete one or more user-defined callbacks from the GPFS system.
Parameters
CallbackIdentifier
Specifies a user-defined unique name that identifies the callback to be deleted. Use the
mmlscallback command to see the name of the callbacks that can be deleted.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To delete the test1 callback from the GPFS system, issue this command:
mmmdelcallback test1
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmdeldisk Device {"DiskName[;DiskName...]" | -F DescFile} [-a] [-c] [-m | -r | -b] [-N {Node[,Node...] |
NodeFile | NodeClass}]
Description
The mmdeldisk command migrates all data that would otherwise be lost to the remaining disks in the
file system. It then removes the disks from the file system descriptor, preserves replication at all times,
and optionally rebalances the file system after removing the disks.
If a replacement for a failing disk is available, use the mmrpldisk command in order to keep the file
system balanced. Otherwise, use one of these procedures to delete a disk:
v If the file system is replicated, replica copies can be preserved at all times by using the default -r
option or the -b option.
v Using the -m option will not preserve replication during the deldisk step because it will only copy the
minimal amount of data off the disk being deleted so that every block has at least one copy. Also,
using the -a option will not preserve replication during the deldisk step, but will then re-establish
replication during the subsequent restripe step.
v If you want to move all data off the disk before running mmdeldisk, use mmchdisk to suspend all the
disks that will be deleted and run mmrestripefs with the -r or -b option. This step is no longer
necessary, now that mmdeldisk does the same function. If mmdeldisk fails (or is canceled), it leaves
the disks in the suspended state, and mmdeldisk can be retried when the problem that caused
mmdeldisk to stop is corrected.
v If the disk is permanently damaged and the file system is not replicated, or if the mmdeldisk
command repeatedly fails, see the General Parallel File System: Problem Determination Guide and search
for Disk media failure.
If the last disk in a storage pool is deleted, the storage pool is deleted. The mmdeldisk command is not
permitted to delete the system storage pool. A storage pool must be empty in order for it to be deleted.
Results
Upon successful completion of the mmdeldisk command, these tasks are completed:
v Data that has not been replicated from the target disks is migrated to other disks in the file system.
v Remaining disks are rebalanced, if specified.
Parameters
Device The device name of the file system to delete the disks from. File system names need not be
fully-qualified. fs0 is as acceptable as /dev/fs0. This must be the first parameter.
Options
-a Specifies that the mmdeldisk command not wait for rereplicating or rebalancing to complete
before returning. When this flag is specified, the mmdeldisk command runs asynchronously and
returns after the file system descriptor is updated and the rebalancing scan is started, but it does
not wait for rebalancing to finish. If no rebalancing is requested (-r option is not specified), this
option has no effect.
If -m is specified, this option has no effect. If -r or -b is specified (no option defaulting to -r ),
then the deldisk step is done using -m, and the restripe step is done using the specified option.
-b Rebalances the blocks onto the other disks while moving data off the disks being deleted. This
might have to move much more data than the -r operation.
Note: Rebalancing of files is an I/O intensive and time consuming operation, and is important
only for file systems with large files that are mostly invariant. In many cases, normal file update
and creation will rebalance your file system over time, without the cost of the rebalancing.
-c Specifies that processing continues even in the event that unreadable data exists on the disks
being deleted. Data that has not been replicated is lost. Replicated data is not lost as long as the
disks containing the replication are accessible.
-m Does minimal data copying to preserve any data that is located only on the disks being removed.
This is the fastest way to get a disk out of the system, but it could reduce replication of some
blocks of the files and metadata.
Note: This might be I/O intensive if there is a lot of data to be copied or rereplicated off the
disks that are being deleted.
-r Preserves replication of all files and metadata during the mmdeldisk operation (except when the
-a option is specified). This is the default.
Note: This might be I/O intensive if there is a lot of data to be copied or rereplicated off the
disks that are being deleted.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
Examples
To delete gpfs2nsd and gpfs3nsd from file system fs0 and rebalance the files across the remaining disks,
issue this command:
mmdeldisk fs2 hd34vsd3
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmdelfileset command deletes a GPFS fileset. The mmdelfileset command fails if the fileset is not
empty or is currently linked into the name space, unless the -f flag is specified.
If the deleted fileset is included in a snapshot, the fileset is deleted from the active file system, but
remains part of the file system in a deleted state. Filesets in the deleted state are displayed by the
mmlsfileset command with their names in parenthesis. If the -L flag is specified, the latest including
snapshot is also displayed. A deleted fileset's contents are still available in the snapshot (that is, through
some path name containing a .snapshots component), since it was saved when the snapshot was created.
mmlsfileset command illustrates the display of a deleted fileset. When the last snapshot that includes the
fileset has been deleted, the fileset is fully removed from the file system.
For information on GPFS filesets, see the topic Information Lifecycle Management for GPFS in General Parallel
File System: Advanced Administration Guide.
Parameters
Device The device name of the file system that contains the fileset.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
FilesetName
Specifies the name of the fileset to be deleted.
Options
-f Forces the deletion of the fileset. All fileset contents are deleted. Any child filesets are first
unlinked.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This sequence of commands illustrates what happens when attempting to delete a fileset that is
linked.
2. This sequence of commands illustrates what happens when attempting to delete a fileset that contains
user files.
a. Command:
mmlsfileset gpfs1
The system displays output similar to:
Filesets in file system ’gpfs1’:
Name Status Path
root Linked /gpfs1
fset1 Linked /gpfs1/fset1
fset2 Unlinked --
b. Command:
mmdelfileset gpfs1 fset2
The system displays output similar to:
Fileset ’fset2’ contains user files,
but can be deleted with the "-f" option.
c. Command:
mmdelfileset gpfs1 fset2 -f
The system displays output similar to:
Deleting user files ...
83 % complete on Tue Dec 17 11:38:25 2005
86 % complete on Tue Dec 17 11:42:47 2005
88 % complete on Tue Dec 17 11:43:13 2005
91 % complete on Tue Dec 17 11:44:15 2005
94 % complete on Tue Dec 17 11:45:20 2005
97 % complete on Tue Dec 17 11:50:14 2005
100 % complete on Tue Dec 17 11:50:47 2005
Fileset ’fset2’ deleted.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmdelfs command removes all the structures for the specified file system from the nodes in the
cluster.
Before you can delete a file system using the mmdelfs command, you must unmount it on all nodes.
Results
Upon successful completion of the mmdelfs command, these tasks are completed on all nodes:
v Deletes the character device entry from /dev.
v Removes the mount point directory where the file system had been mounted.
Parameters
Device The device name of the file system to be removed. File system names need not be fully-qualified.
fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
Options
-p Indicates that the disks are permanently damaged and the file system information should be
removed from the GPFS cluster data even if the disks cannot be marked as available.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You must have root authority to run the mmdelfs command.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmdelnode command to delete one or more nodes from the GPFS cluster. You may issue the
mmdelnode command on any GPFS node.
| Each GPFS cluster is managed independently, so there is no automatic coordination and propagation of
changes between clusters like there is between the nodes within a cluster. This means that if you
permanently delete nodes that are being used as contact nodes by other GPFS clusters that can mount
your file systems, you should notify the administrators of those GPFS clusters so that they can update
their own environments.
Results
Upon successful completion of the mmdelnode command, the specified nodes are deleted from the GPFS
cluster.
Options
None.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
You may issue the mmdelnode command from any node that will remain in the GPFS cluster.
Examples
1. To delete all of the nodes in the cluster, issue this command:
mmdelnode -a
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Description
NSDs being deleted cannot be part of any file system. Either the mmdeldisk or mmdelfs command must
be issued prior to deleting the NSDs from the GPFS cluster.
NSDs being deleted cannot be tiebreaker disks. Use the mmchconfig command to assign new tiebreaker
disks prior to deleting NSDs from the cluster. For information on tiebreaker disks, see the Quorum topic
in the GPFS: Concepts, Planning, and Installation Guide.
Results
Upon successful completion of the mmdelnsd command, these tasks are completed:
v All references to the disks are removed from the GPFS cluster data.
v Sector 2 of each disk is cleared of its unique NSD volume ID.
v On Windows, the disk's GPT partition table is removed leaving the disk Unknown/Not Initialized.
Parameters
DiskName[;DiskName...]
Specifies the names of the NSDs to be deleted from the GPFS cluster. Specify the names
generated when the NSDs were created. Use the mmlsnsd -F command to display disk names. If
there is more than one disk to be deleted, delimit each name with a semicolon (;) and enclose the
list of disk names in quotation marks.
-F DiskFile
Specifies a file containing the names of the NSDs, one per line, to be deleted from the GPFS
cluster.
-N Node[,Node]
Specifies the nodes to which the disk is attached. If no nodes are listed, the disk is assumed to be
directly attached to the local node.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
-p NSDId
Specifies the NSD volume ID of an NSD that needs to be cleared from the disk as indicated by
the failure of a previous invocation of the mmdelnsd command.
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Once the mmdelsnapshot command has been issued, the snapshot is marked for deletion and cannot be
recovered.
If the node from which the mmdelsnapshot command is issued fails, reissue the command from another
node in the cluster to complete the deletion. Prior to reissuing a subsequent mmdelsnapshot command,
the file system may be recovered, mounted, and updates may continue to be made and the
mmcrsnapshot command may be issued. However, the mmrestorefs and mmdelsnapshot commands
may not be issued on other snapshots until the present snapshot is successfully deleted.
If the mmdelsnapshot command is issued while a conflicting command is running, the mmdelsnapshot
command waits for that command to complete. Conflicting operations include:
1. Other snapshot commands on the same snapshot
2. Adding, deleting, replacing disks in the file system
3. Rebalancing, repairing, reducing disk fragmentation in a file system
Any files open in the snapshot will be forcibly closed. The user will receive an errno of ESTALE on the
next file access.
Parameters
Device The device name of the file system for which the snapshot is to be deleted. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
This must be the first parameter.
| SnapshotName
| The name of the snapshot to be deleted
-N {Node [,Node...] | NodeFile | NodeClass}
Specifies the nodes that participate in the deleting of the snapshot. This command supports all
| defined node classes. The default is all or the current value of the defaultHelperNodes parameter
| of the mmchconfig command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To delete the snapshot snap1, for the file system fs1, issue this command:
mmdelsnapshot fs1 snap1
Before issuing the command, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
After the command has been issued, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
/fs1/.snapshots
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmdf command to display available file space on a GPFS file system. For each disk in the GPFS
file system, the mmdf command displays this information, by failure group and storage pool:
v The size of the disk.
v The failure group of the disk.
v Whether the disk is used to hold data, metadata, or both.
v Available space in full blocks.
v Available space in fragments.
Displayed values are rounded down to a multiple of 1024 bytes. If the fragment size used by the file
system is not a multiple of 1024 bytes, then the displayed values may be lower than the actual values.
This can result in the display of a total value that exceeds the sum of the rounded values displayed for
individual disks. The individual values are accurate if the fragment size is a multiple of 1024 bytes.
The mmdf command may be run against a mounted or unmounted file system.
Notes:
1. This command is I/O intensive and should be run when the system load is light.
2. An asterisk at the end of a line means that this disk is in a state where it is not available for new
block allocation.
Parameters
Device The device name of the file system to be queried for available file space. File system names need
not be fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
Options
-d List only disks that can hold data.
-F List the number of inodes and how many of them are free.
-m List only disks that can hold metadata.
-P PoolName
Lists only disks that belong to the requested storage pool.
Exit status
0 Successful completion.
Security
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
If you are a non-root user, you may specify only file systems that belong to the same cluster as the node
on which the mmdf command was issued.
Examples
1. To query all disks in the fs2 file system that can hold data, issue this command:
mmdf fs2 -d
The system displays information similar to:
disk disk size failure holds holds free KB in free KB in
name in KB group metadata data full blocks fragments
----------- --------- -------- -------- ---- ----------- ---------
Disks in storage pool: system (Maximum disk size allowed is 833 GB)
------------- -------- ----------
(pool total) 0 0 (0%) 0 (0%)
Disks in storage pool: sp1 (Maximum disk size allowed is 359 GB)
gpfs1002nsd 8897968 1 no yes 8342016 (94%) 928 (0%)
--------- ------------- ------------
(pool total) 8897968 8342016 (94%) 928 (0%)
Disks in storage pool: sp1 (Maximum disk size allowed is 359 GB)
--------- ------------ ----–----
(pool total) 0 0 (0%) 0 (0%)
See also
Location
/usr/lpp/mmfs/bin
| Synopsis
| Description
| Use the mmdiag command to query various aspects of the GPFS internal state for troubleshooting and
| tuning purposes. The mmdiag command displays information about the state of GPFS on the node where
| it is executed. The command obtains the required information by querying the GPFS daemon process
| (mmfsd), and thus will only function when the GPFS daemon is running.
| Results
| The mmdiag command displays the requested information and returns 0 if successful.
| Parameters
| None.
| Options
| --version
| Displays information about the GPFS build currently running on this node. This helps in
| troubleshooting installation problems. The information displayed here may be more
| comprehensive than version information available via the OS package management infrastructure,
| in particular when an e-fix is installed.
| --waiters
| Displays mmfsd threads waiting for events. This information can be very helpful in
| troubleshooting deadlocks and performance problems. For each thread, the thread name, wait
| time in seconds, and wait reason are typically shown. Only non-idle threads currently waiting for
| some event to occur are displayed. Note that only mmfsd threads are shown; any application IO
| threads that might be waiting in GPFS kernel code would not be present here.
| --threads
| Displays mmfsd thread statistics and the list of active threads. For each thread, its type and
| kernel thread ID are shown. All non-idle mmfsd threads are shown. For those threads that are
| currently waiting for an event, the wait reason and wait time in seconds are shown. This
| information provides more detail than the data displayed by mmdiag --waiters.
| --network
| Displays information about mmfsd network connections and pending Remote Procedure Calls
| (RPCs). Basic information and statistics about all existing mmfsd network connections to other
| nodes is displayed, including information about broken connections. If there are currently any
| RPCs pending (that is, sent but not yet replied to), the information about each one is shown,
| including the list of RPC destinations and the status of the request for each destination. This
| information can be very helpful in following a multinode chain of dependencies during a
| deadlock or performance-problem troubleshooting.
| --config
| Displays configuration parameters and their settings. The list of configuration parameters shown
| here consists of configuration parameters known to mmfsd. Note that some configuration
| Exit status
| 0 Successful completion.
| nonzero
| A failure has occurred.
| Security
| Examples
| 1. To display a list of waiters, issue this command:
| mmdiag --waiters
| The system displays output similar to the following:
| === mmdiag: waiters ===
| 0x11DA520 waiting 0.001147000 seconds, InodePrefetchWorker:
| for I/O completion
| 0x2AAAAAB02830 waiting 0.002152000 seconds, InodePrefetchWorker:
| for I/O completion
| 0x2AAAAB103990 waiting 0.000593000 seconds, InodePrefetchWorker:
| for I/O completion
| 0x11F51E0 waiting 0.000612000 seconds, InodePrefetchWorker:
| for I/O completion
| 0x11EDE60 waiting 0.005736500 seconds, InodePrefetchWorker:
| on ThMutex 0x100073ABC8 (0xFFFFC2000073ABC8)
| (CacheReplacementListMutex)
| In this example, all waiters have a very short wait duration and represent a typical snapshot of
| normal GPFS operation.
| 2. To display information about GPFS memory utilization, issue this command:
| mmdiag --memory
| The system displays output similar to the following:
| See also
| Not applicable.
| Location
| /usr/lpp/mmfs/bin
Synopsis
Description
Use the mmeditacl command for interactive editing of the ACL of a file or directory. This command uses
the default editor, specified in the EDITOR environment variable, to display the current access control
information, and allows the file owner to change it. The command verifies the change request with the
user before making permanent changes.
The EDITOR environment variable must contain a complete path name, for example:
export EDITOR=/bin/vi
For information about NFS V4 ACLs, see Chapter 6, “Managing GPFS access control lists and NFS
export,” on page 51 and “NFS and GPFS” on page 59.
Users may need to see ACLs in their true form as well as how they are translated for access evaluations.
There are four cases:
1. By default, mmeditacl returns the ACL in a format consistent with the file system setting, specified
using the -k flag on the mmcrfs or mmchfs commands.
v If the setting is posix, the ACL is shown as a traditional ACL.
v If the setting is nfs4, the ACL is shown as an NFS V4 ACL.
v If the setting is all, the ACL is returned in its true form.
2. The command mmeditacl -k nfs4 always produces an NFS V4 ACL.
3. The command mmeditacl -k posix always produces a traditional ACL.
4. The command mmeditacl -k native always shows the ACL in its true form regardless of the file
system setting.
The following describes how mmeditacl works for POSIX and NFS V4 ACLs:
Command ACL mmcrfs -k Display -d (default)
------------------- ----- --------- ------------- --------------
mmeditacl posix posix Access ACL Default ACL
mmeditacl posix nfs4 NFS V4 ACL Error[1]
mmeditacl posix all Access ACL Default ACL
mmeditacl nfs4 posix Access ACL[2] Default ACL[2]
mmeditacl nfs4 nfs4 NFS V4 ACL Error[1]
mmeditacl nfs4 all NFS V4 ACL Error[1]
mmeditacl -k native posix any Access ACL Default ACL
mmeditacl -k native nfs4 any NFS V4 ACL Error[1]
mmeditacl -k posix posix any Access ACL Default ACL
mmeditacl -k posix nfs4 any Access ACL[2] Default ACL[2]
mmeditacl -k nfs4 any any NFS V4 ACL Error[1]
---------------------------------------------------------------------
[1] NFS V4 ACLs include inherited entries. Consequently, there cannot
be a separate default ACL.
[2] Only the mode entries (owner, group, everyone) are translated.
The rwx values are derived from the
NFS V4 file mode attribute. Since the NFS V4 ACL is more granular
in nature, some information is lost in this translation.
---------------------------------------------------------------------
Depending on the file system's -k setting (posix, nfs4, or all), mmeditacl may be restricted. The
mmeditacl command is not allowed to store an NFS V4 ACL if -k posix is in effect, and is not allowed to
store a POSIX ACL if -k nfs4 is in effect. For more information, see the description of the -k flag for the
mmchfs, mmcrfs, and mmlsfs commands.
Parameters
Filename
The path name of the file or directory for which the ACL is to be edited. If the -d option is
specified, Filename must contain the name of a directory.
Options
-d Specifies that the default ACL of a directory is to be edited.
-k {nfs4 | posix | native}
nfs4 Always produces an NFS V4 ACL.
posix Always produces a traditional ACL.
native Always shows the ACL in its true form regardless of the file system setting.
This option should not be used for routine ACL manipulation. It is intended to provide a way to
show the translations that are done. For example, if a posix ACL is translated by NFS V4. Beware
that if the -k nfs4 flag is used, but the file system does not allow NFS V4 ACLs, you will not be
able to store the ACL that is returned. If the file system does support NFS V4 ACLs, the -k nfs4
flag is an easy way to convert an existing posix ACL to nfs4 format.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You may issue the mmeditacl command only from a node in the GPFS cluster where the file system is
mounted.
The mmeditacl command may be used to display an ACL. POSIX ACLs may be displayed by any user
with access to the file or directory. NFS V4 ACLs have a READ_ACL permission that is required for
non-privileged users to be able to see an ACL. To change an existing ACL, the user must either be the
owner, the root user, or someone with control permission (WRITE_ACL is required where the existing
ACL is of type NFS V4).
Examples
To edit the ACL for a file named project2.history, issue this command:
mmeditacl project2.history
The current ACL entries are displayed using the default editor, provided that the EDITOR environment
variable specifies a complete path name. When the file is saved, the system displays information similar
to:
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmedquota {-u [-p ProtoUser] User...| -g [-p ProtoGroup] Group... | -j [-p ProtoFileset ] Device:Fileset... | -d
{-u User...| -g Group... | -j Device:Fileset...} | -t {-u | -g | -j}}
Description
The mmedquota command displays the current values for these limits, if any, and prompts you to enter
new values using your default editor:
v current block usage (display only)
v current inode usage (display only)
v inode soft limit
v inode hard limit
v block soft limit
Displayed in KB, but may be specified using g, G, k, K, m, or M. If no suffix is provided, the number
is assumed to be in bytes.
v block hard limit
Displayed in KB, but may be specified using g, G, k, K, m, or M. If no suffix is provided, the number
is assumed to be in bytes.
The mmedquota command waits for the edit window to be closed before checking and applying new
values. If an incorrect entry is made, reissue the command and enter the correct values.
You can also use the mmedquota command to change the file system-specific grace periods for block and
file usage if the default of one week is unsatisfactory. The grace period is the time during which users
can exceed the soft limit. If the user, group, or fileset does not show reduced usage below the soft limit
before the grace period expires, the soft limit becomes the new hard limit.
When setting quota limits for a file system, replication within the file system should be considered. GPFS
quota management takes replication into account when reporting on and determining if quota limits have
been exceeded for both block and file usage. In a file system that has either type of replication set to a
value of two, the values reported by both the mmlsquota command and the mmrepquota command are
double the value reported by the ls command.
The EDITOR environment variable must contain a complete path name, for example:
export EDITOR=/bin/vi
Parameters
User Name or user ID of target user for quota editing.
Options
-d Reestablish default quota limits for a specific user, group, or fileset that has had an explicit quota
limit set by a previous invocation of the mmedquota command.
-g Sets quota limits or grace times for groups.
-j Sets quota limits or grace times for filesets.
-p Applies already-established limits to a particular user, group or fileset.
When invoked with the -u option, ProtoUser limits are automatically applied to the specified User
or space-delimited list of users.
When invoked with the -g option, ProtoGroup limits are automatically applied to the specified
Group or space-delimited list of groups.
When invoked with the -j option, ProtoFileset limits are automatically applied to the specified
fileset or space-delimited list of fileset names.
You can specify any user as a ProtoUser for another User, or any group as a ProtoGroup for
another Group, or any fileset as a ProtoFileset for another Fileset.
-t Sets grace period during which quotas can exceed the soft limit before it is imposed as a hard
limit. The default grace period is one week.
This flag is followed by one of the following flags: -u, -g or -j, to specify whether the changes
apply to users, groups, or filesets respectively.
-u Sets quota limits or grace times for users.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
GPFS must be running on the node from which the mmedquota command is issued.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To set user quotas for userid paul, issue this command:
mmedquota -u paul
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmexportfs command, in conjunction with the mmimportfs command, can be used to move one or
more GPFS file systems from one GPFS cluster to another GPFS cluster, or to temporarily remove file
systems from the cluster and restore them at a later time. The mmexportfs command retrieves all relevant
file system and disk information and stores it in the file specified with the -o parameter. This file must
later be provided as input to the mmimportfs command. When running the mmexportfs command, the
file system must be unmounted on all nodes.
When all is specified in place of a file system name, any disks that are not associated with a file system
will be exported as well.
Exported file systems remain unusable until they are imported back with the mmimportfs command to
the same or a different GPFS cluster.
Results
Upon successful completion of the mmexportfs command, all configuration information pertaining to the
exported file system and its disks is removed from the configuration data of the current GPFS cluster and
is stored in the user specified file ExportFilesysData.
Parameters
Device | all
The device name of the file system to be exported. File system names need not be fully-qualified.
fs0 is as acceptable as /dev/fs0. Specify all to export all GPFS file systems, as well as all disks that
do not belong to a file system yet. This must be the first parameter.
-o ExportFilesysData
The path name of a file to which the file system information is to be written. This file must be
provided as input to the subsequent mmimportfs command.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
To export all file systems in the current cluster, issue this command:
mmexportfs all -o /u/admin/exportfile
mmexportfs: Processing disks that do not belong to any file system ...
mmexportfs: 6027-1371 Propagating the changes to all affected
nodes. This is an asynchronous process.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
| mmfsck Device [-n | -y] [-c | -o] [-t Directory] [ -s |-v | -V] [-N {Node[,Node...] | NodeFile | NodeClass}]
The file system must be unmounted before you can run the mmfsck command with any option other
than -o.
Description
The mmfsck command in offline mode is intended to be used only in situations where there have been
disk or communications failures that have caused MMFS_FSSTRUCT error log entries to be issued, or
where it is known that disks have been forcibly removed or otherwise permanently unavailable for use in
the file system, and other unexpected symptoms are seen by users. In general it is unnecessary to run
mmfsck in offline mode unless under the direction of the IBM Support Center.
If neither the -n nor -y flag is specified, the mmfsck command runs interactively prompting you for
permission to repair each consistency error as reported. It is suggested that in all but the most severely
damaged file systems, you run the mmfsck command interactively (the default).
The occurrence of I/O errors, or the appearance of a message telling you to run the mmfsck command,
may indicate file system inconsistencies. If either situation occurs, use the mmfsck command to check file
system consistency and interactively repair the file system.
For information about file system maintenance and repair, see “Checking and repairing a file system” on
page 20. The mmfsck command checks for these inconsistencies:
v Blocks marked allocated that do not belong to any file. The corrective action is to mark the block free
in the allocation map.
v Files for which an inode is allocated and no directory entry exists (orphaned files). The corrective
action is to create directory entries for these files in a lost+found subdirectory of the fileset to which
the orphaned file or directory belongs. The index number of the inode is assigned as the name. If you
do not allow the mmfsck command to reattach an orphaned file, it asks for permission to delete the
file.
v Directory entries pointing to an inode that is not allocated. The corrective action is to remove the
directory entry.
v Incorrectly formed directory entries. A directory file contains the inode number and the generation
number of the file to which it refers. When the generation number in the directory does not match the
generation number stored in the file's inode, the corrective action is to remove the directory entry.
v Incorrect link counts on files and directories. The corrective action is to update them with accurate
counts.
v Policy files are not valid. The corrective action is to delete the file.
v Various problems related to filesets: missing or corrupted fileset metadata, inconsistencies in directory
structure related to filesets, missing or corrupted fileset root directory, other problems in internal data
structures.
If you are repairing a file system due to node failure and the file system has quotas enabled, it is
suggested that you run the mmcheckquota command to recreate the quota files.
Indications leading you to the conclusion that you should run the mmfsck command include:
v An MMFS_FSSTRUCT along with an MMFS_SYSTEM_UNMOUNT error log entry on any node
indicating some critical piece of the file system is inconsistent.
For further information on recovery actions and how to contact the IBM Support Center, see the General
Parallel File System: Problem Determination Guide.
If you are running the online mmfsck command to free allocated blocks that do not belong to any files,
plan to make file system repairs when system demand is low. This is an I/O intensive activity and it can
affect system performance.
Results
If the file system is inconsistent, the mmfsck command displays information about the inconsistencies
and (depending on the option entered) may prompt you for permission to repair them. The mmfsck
command tries to avoid actions that may result in loss of data. In some cases, however, it may indicate
| the destruction of a damaged file.
| All corrective actions, with the exception of recovering lost disk blocks (blocks that are marked as
allocated but do not belong to any file), require that the file system be unmounted on all nodes. If the
mmfsck command is run on a mounted file system, lost blocks are recovered but any other
inconsistencies are only reported, not repaired.
If a bad disk is detected, the mmfsck command stops the disk and writes an entry to the error log. The
operator must manually start and resume the disk when the problem is fixed.
The file system must be unmounted on all nodes before the mmfsck command can repair file system
inconsistencies.
Parameters
Device The device name of the file system to be checked and repaired. File system names need not be
fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
-N {Node[,Node...] | NodeFile | NodeClass}
Specify the nodes to participate in the check and repair of the file system. This command
| supports all defined node classes. The default is all or the current value of the
| defaultHelperNodes parameter of the mmchconfig command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
-c When the file system log has been lost and the file system is replicated, this option specifies that
the mmfsck command attempt corrective action by comparing the replicas of metadata and data.
If this error condition occurs, it is indicated by an error log entry. The -c and -o flags are mutually
exclusive.
-n Specifies a no response to all prompts from the mmfsck command. The option reports
inconsistencies but it does not change the file system. To save this information, redirect it to an
output file when you issue the mmfsck command.
-o Specifies that the file system can be mounted during the operation of the mmfsck command.
Online mode does not perform a full file system consistency check, but blocks marked as
allocated that do not belong to a file are recovered. The -c and -o flags are mutually exclusive.
Exit status
0 Successful completion.
2 The command was interrupted before it completed checks or repairs.
4 The command changed the file system and it must now be restarted.
8 The file system contains damage that has not been repaired.
16 The problem cannot be fixed.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To run the mmfsck command on the fs1 file system, receive a report, but not fix inconsistencies, issue
this command:
mmfsck fs1 -n
1220704 subblocks
18277 allocated
0 unreferenced
0 deletable
0 deallocated
687 addresses
0 suspended
40960 inodes
40 allocated
3 repairable
2 repaired
0 damaged
0 deallocated
2 orphaned
2 attached
1220704 subblocks
18791 allocated
2 unreferenced
0 deletable
2 deallocated
705 addresses
0 suspended
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Or,
Description
Use the mmfsctl command to issue control requests to a particular GPFS file system. The command is
used to temporarily suspend the processing of all application I/O requests, and later resume them, as
well as to synchronize the file system's configuration state between peer clusters in disaster recovery
environments.
See Establishing disaster recovery for your GPFS cluster in General Parallel File System: Advanced Administration
Guide.
Before creating a FlashCopy® image of the file system, the user must run mmfsctl suspend to temporarily
quiesce all file system activity and flush the internal buffers on all nodes that mount this file system. The
on-disk metadata will be brought to a consistent state, which provides for the integrity of the FlashCopy
snapshot. If a request to the file system is issued by the application after the invocation of this command,
GPFS suspends this request indefinitely, or until the user issues mmfsctl resume.
Once the FlashCopy image has been taken, the mmfsctl resume command can be issued to resume the
normal operation and complete any pending I/O requests.
The mmfsctl syncFSconfig command extracts the file system's related information from the local GPFS
configuration data, transfers this data to one of the nodes in the peer cluster, and attempts to import it
there.
Once the GPFS file system has been defined in the primary cluster, users run this command to import the
configuration of this file system into the peer recovery cluster. After producing a FlashCopy image of the
file system and propagating it to the peer cluster using Peer-to-Peer Remote Copy (PPRC), users similarly
run this command to propagate any relevant configuration changes made in the cluster after the previous
snapshot.
The primary cluster configuration server of the peer cluster must be available and accessible using remote
shell and remote copy at the time of the invocation of the mmfsctl syncFSconfig command. Also, the
peer GPFS clusters should be defined to use the same remote shell and remote copy mechanism, and
they must be set up to allow nodes in peer clusters to communicate without the use of a password.
Not all administrative actions performed on the file system necessitate this type of resynchronization. It is
required only for those actions that modify the file system information maintained in the local GPFS
configuration data, which includes:
v Additions, removals, and replacements of disks (commands mmadddisk, mmdeldisk, mmrpldisk)
v Modifications to disk attributes (command mmchdisk)
The process of synchronizing the file system configuration data can be automated by utilizing the
syncfsconfig user exit.
The mmfsctl exclude command is to be used only in a disaster recovery environment, only after a
disaster has occurred, and only after ensuring that the disks in question have been physically
disconnected. Otherwise, unexpected results may occur.
The mmfsctl exclude command can be used to manually override the file system descriptor quorum after
a site-wide disaster. See Establishing disaster recovery for your GPFS cluster in General Parallel File System:
Advanced Administration Guide. This command enables users to restore normal access to the file system
with less than a quorum of available file system descriptor replica disks, by effectively excluding the
specified disks from all subsequent operations on the file system descriptor. After repairing the disks, the
mmfsctl include command can be issued to restore the initial quorum configuration.
Parameters
Device The device name of the file system. File system names need not be fully-qualified. fs0 is just as
acceptable as /dev/fs0. If all is specified with the syncFSconfig option, this command is
performed on all GPFS file systems defined in the cluster.
exclude
Instructs GPFS to exclude the specified group of disks from all subsequent operations on the file
system descriptor, and change their availability state to down, if the conditions in the Note below
are met.
If necessary, this command assigns additional disks to serve as the disk descriptor replica holders,
and migrate the disk descriptor to the new replica set. The excluded disks are not deleted from
the file system, and still appear in the output of the mmlsdisk command.
Note: The mmfsctl exclude command is to be used only in a disaster recovery environment, only
after a disaster has occurred, and only after ensuring that the disks in question have been
physically disconnected. Otherwise, unexpected results may occur.
include
Informs GPFS that the previously excluded disks have become operational again. This command
writes the up-to-date version of the disk descriptor to each of the specified disks, and clears the
excl tag.
resume
Instructs GPFS to resume the normal processing of I/O requests on all nodes.
suspend
Instructs GPFS to flush the internal buffers on all nodes, bring the file system to a consistent state
on disk, and suspend the processing of all subsequent application I/O requests.
syncFSconfig
Synchronizes the configuration state of a GPFS file system between the local cluster and its peer
in two-cluster disaster recovery configurations.
| -C RemoteClusterName
Specifies the name of the GPFS cluster that owns the remote GPFS file system.
-d "DiskName[;DiskName...]"
Specifies the names of the NSDs to be included or excluded by the mmfsctl command. Separate
the names with semicolons (;) and enclose the list of disk names in quotation marks.
Options
None.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Results
Security
You must have root authority to run the mmfsctl command.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
This sequence of commands creates a FlashCopy image of the file system and propagates this image to
the recovery cluster using the Peer-to-Peer Remote Copy technology. The following configuration is
assumed:
Site LUNs
Primary cluster (site A) lunA1, lunA2
Recovery cluster (site B) lunB1
Location
/usr/lpp/mmfs/bin
Synopsis
Description
For information about NFS V4 ACLs, see Chapter 6, “Managing GPFS access control lists and NFS
export,” on page 51 and “NFS and GPFS” on page 59.
Users may need to see ACLs in their true form as well as how they are translated for access evaluations.
There are four cases:
1. By default, mmgetacl returns the ACL in a format consistent with the file system setting, specified
using the -k flag on the mmcrfs or mmchfs commands.
If the setting is posix, the ACL is shown as a traditional ACL.
If the setting is nfs4, the ACL is shown as an NFS V4 ACL.
If the setting is all, the ACL is returned in its true form.
2. The command mmgetacl -k nfs4 always produces an NFS V4 ACL.
3. The command mmgetacl -k posix always produces a traditional ACL.
4. The command mmgetacl -k native always shows the ACL in its true form regardless of the file
system setting.
The following describes how mmgetacl works for POSIX and NFS V4 ACLs:
Command ACL mmcrfs -k Display -d (default)
------------------- ----- --------- ------------- --------------
mmgetacl posix posix Access ACL Default ACL
mmgetacl posix nfs4 NFS V4 ACL Error[1]
mmgetacl posix all Access ACL Default ACL
mmgetacl nfs4 posix Access ACL[2] Default ACL[2]
mmgetacl nfs4 nfs4 NFS V4 ACL Error[1]
mmgetacl nfs4 all NFS V4 ACL Error[1]
mmgetacl -k native posix any Access ACL Default ACL
mmgetacl -k native nfs4 any NFS V4 ACL Error[1]
mmgetacl -k posix posix any Access ACL Default ACL
mmgetacl -k posix nfs4 any Access ACL[2] Default ACL[2]
mmgetacl -k nfs4 any any NFS V4 ACL Error[1]
---------------------------------------------------------------------
[1] NFS V4 ACLs include inherited entries. Consequently, there cannot
be a separate default ACL.
[2] Only the mode entries (owner, group, everyone) are translated.
The rwx values are derived from the
NFS V4 file mode attribute. Since the NFS V4 ACL is more granular
in nature, some information is lost in this translation.
---------------------------------------------------------------------
Parameters
Filename
The path name of the file or directory for which the ACL is to be displayed. If the -d option is
specified, Filename must contain the name of a directory.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You must have read access to the directory where the file exists to run the mmgetacl command.
You may issue the mmgetacl command only from a node in the GPFS cluster where the file system is
mounted.
Examples
1. To display the ACL for a file named project2.history, issue this command:
mmgetacl project2.history
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmgetstate command to show the state of the GPFS daemon on one or more nodes.
Parameters
-a Show the state of the GPFS daemon on all nodes in the cluster.
-N {Node[,Node...] | NodeFile | NodeClass}
Directs the mmgetstate command to return GPFS daemon information for a set of nodes. For
general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
This command does not support a NodeClass of mount.
Options
-L Display quorum, number of nodes up, total number of nodes, and other extended node
information.
-s Display summary information such as: number of local and remote nodes that have joined in the
cluster, number of quorum nodes.
-v Display intermediate error messages.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Node number Node name Quorum Nodes up Total nodes GPFS state Remarks
----------------------------------------------------------------------------
1 c5n92 3 5 12 active
2 c5n94 3 5 12 active
3 c5n95 3 5 12 active quorum node
4 c5n96 3 5 12 active
5 c5n97 3 5 12 active quorum node
6 c5n98 3 5 12 active
7 c5n107 3 5 12 active quorum node
8 c5n108 3 5 12 active
9 c5n109 3 5 12 active quorum node
10 c5n110 3 5 12 down
11 c5n111 3 5 12 active quorum node
12 c5n112 3 5 12 active
The 3 under the Quorum column means that you must have three quorum nodes up to achieve
quorum.
2. This is an example of a cluster using node quorum with tiebreaker disks. Note the * in the Quorum
field, which indicates that tiebreaker disks are being used:
mmgetstate -a -L
Summary information
---------------------
Number of nodes defined in the cluster: 12
Number of local nodes active in the cluster: 12
Number of remote nodes joined in this cluster: 0
Number of quorum nodes defined in the cluster: 5
Number of quorum nodes active in the cluster: 5
Quorum = 3, Quorum achieved
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmimportfs command, in conjunction with the mmexportfs command, can be used to move into the
current GPFS cluster one or more file systems that were created in another GPFS cluster. The
mmimportfs command extracts all relevant file system and disk information from the ExportFilesysData
file specified with the -i parameter. This file must have been created by the mmexportfs command.
When all is specified in place of a file system name, any disks that are not associated with a file system
will be imported as well.
If the file systems being imported were created on nodes that do not belong to the current GPFS cluster,
the mmimportfs command assumes that all disks have been properly moved, and are online and
available to the appropriate nodes in the current cluster.
If any node in the cluster, including the node on which you are running the mmimportfs command, does
not have access to one or more disks, use the -S option to assign NSD servers to those disks.
The mmimportfs command attempts to preserve any NSD server assignments that were in effect when
| the file system was exported.
| After the mmimportfs command completes, use mmlsnsd to display the NSD server names that are
assigned to each of the disks in the imported file system. Use mmchnsd to change the current NSD
server assignments as needed.
After the mmimportfs command completes, use mmlsdisk to display the failure groups to which each
disk belongs. Use mmchdisk to make adjustments if necessary.
If you are importing file systems into a cluster that already contains GPFS file systems, it is possible to
encounter name conflicts. You must resolve such conflicts before the mmimportfs command can succeed.
You can use the mmchfs command to change the device name and mount point of an existing file
system. If there are disk name conflicts, use the mmcrnsd command to define new disks and specify
unique names (rather than let the command generate names). Then replace the conflicting disks using
mmrpldisk and remove them from the cluster using mmdelnsd.
Results
Upon successful completion of the mmimportfs command, all configuration information pertaining to the
file systems being imported is added to configuration data of the current GPFS cluster.
Parameters
Device | all
The device name of the file system to be imported. File system names need not be fully-qualified.
fs0 is as acceptable as /dev/fs0. Specify all to import all GPFS file systems, as well as all disks
that do not belong to a file system yet. This must be the first parameter.
-i ImportfsFile
The path name of the file containing the file system information. This file must have previously
been created with the mmexportfs command.
Note:
1. You cannot change the name of a disk. You cannot change the disk usage or failure group
assignment with the mmimportfs command. Use the mmchdisk command for this purpose.
2. All disks that do not have descriptors in ChangeSpecFile are assigned the NSD servers that
they had at the time the file system was exported. All disks with NSD servers that are not
valid are assumed to be SAN-attached to all nodes in the cluster. Use the mmchnsd command
to assign new or change existing NSD server nodes.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To import all file systems in the current cluster, issue this command:
mmimportfs all -i /u/admin/exportfile
mmimportfs: Processing disks that do not belong to any file system ...
mmimportfs: Processing disk gpfs6nsd
mmimportfs: Processing disk gpfs1001nsd
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmlinkfileset command creates a junction at JunctionPath that references the root directory of
FilesetName. The junction is a special directory entry, much like a POSIX hard link, that connects a name
in a directory of one fileset, the parent, to the root directory of a child fileset. From the user's viewpoint,
a junction always appears as if it were a directory, but the user is not allowed to issue the unlink or
rmdir commands on a junction. Instead, the mmunlinkfileset command must be used to remove a
junction.
If JunctionPath is not specified, the junction is created in the current directory with the name FilesetName.
The user may use the mv command on the directory to move to a new location in the parent fileset, but
the mv command is not allowed to move the junction to a different fileset.
For information on GPFS filesets, see the GPFS: Advanced Administration Guide.
Parameters
Device The device name of the file system that contains the fileset.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
FilesetName
Specifies the name of the fileset to be linked. It must not already be linked into the namespace.
-J JunctionPath
Specifies the name of the junction. The name must not refer to an existing file system object.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
This command links fileset fset1 in file system gpfs1 to junction path /gpfs1/fset1:
mmlinkfileset gpfs1 fset1 -J /gpfs1/fset1
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Results
Parameters
FileName
The name of the file to be queried. You must enter at least one file name.
Options
-l Specifies that this command works only with regular files and directories and does not follow
symlinks. The default is to follow symlinks.
-L Displays additional file attributes:
v The file's assigned storage pool name.
v The name of the fileset that includes the file.
v The name of the snapshot that includes the file.
If the file is not part of a snapshot, an empty string is displayed.
| v Whether the file is exposed, ill replicated, ill placed, or unbalanced (displayed under the flags
| heading).
v Whether the file is immutable.
| v Whether the file is in appendOnly mode.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You may issue the mmlsattr command only from a node in the GPFS cluster where the file system is
mounted.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmlscallback command to list some or all of the callbacks that are currently registered in the
GPFS system.
Parameters
| CallbackIdentifier
| Indicates the callback for which information is displayed.
| user Indicates all user-defined callbacks. This is the default.
| system
| Indicates all system-defined callbacks.
| all Indicates all callbacks currently registered with the system.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To list all of the callbacks that are currently in the GPFS system, issue this command:
mmlscallback
test2
command = /tmp/myScript2
event = shutdown
parms = %upNodes
To list a specific callback (for example, test2) that is currently in the GPFS system, issue this command:
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmlscluster [--cnfs]
Description
Use the mmlscluster command to display the current configuration information for a GPFS cluster.
Parameters
NONE
Options
--cnfs Displays information about clustered NFS.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
To display the current configuration information for the GPFS cluster, issue this command:
mmlscluster
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmlsconfig
Description
Use the mmlsconfig command to display the current configuration data for a GPFS cluster.
Depending on your configuration, additional information that is set by GPFS may be displayed to assist
in problem determination when contacting the IBM Support Center. If a configuration parameter is not
shown in the output of this command, the default value for that parameter, as documented in the
mmchconfig command, is in effect.
Parameters
NONE
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To display the current configuration data for the GPFS cluster that you are running on, issue this
command:
mmlsconfig
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Description
Use the mmlsdisk command to display the current state of the disks in the file system.
The mmlsdisk command may be run against a mounted or unmounted file system.
Options
-e Display all of the disks in the file system that do not have an availability of up and a status of
ready. If all disks in the file system are up and ready, the message displayed is:
6027-623 All disks up and ready
-L Displays an extended list of the disk parameters, including the disk ID field and the remarks
field. The remarks column shows the current file system descriptor quorum assignments, and
displays the excluded disks. The remarks field contains desc for all disks assigned as the file
system descriptor holders and excl for all excluded disks.
-M Displays whether I/O requests to the disk are satisfied on the local node, or using an NSD server.
If the I/O is done using an NSD server, shows the NSD server name and the underlying disk
name on that server node.
-m Displays whether I/O requests to the disk are satisfied on the local node, or using an NSD server.
This scope of this options is the node on which the mmlsdisk command is issued.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
If you are a non-root user, you may specify only file systems that belong to the same cluster as the node
on which the mmlsdisk command was issued.
Examples
1. To display the current state of gpfs2nsd, issue this command:
mmlsdisk /dev/fs0 -d gpfs2nsd
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmlsfileset Device {[[Fileset [,Fileset ...]] [-J Junction [,Junction ...]] | -F FileName} [-L] [-d] [-i]
Description
Use the mmlsfileset command to display information for the filesets that belong to a given GPFS file
system. The default is to display information for all filesets in the file system. You may choose to display
information for only a subset of the filesets.
The operation of the -L flag omits the attributes listed without it, namely status and junction path. In
addition, if the fileset has status 'deleted', then -L also displays the name of the latest snapshot that
includes the fileset in place of the root inode number and parent fileset identifier.
For information on GPFS filesets, see the GPFS: Advanced Administration Guide.
Parameters
Device The device name of the file system that contains the fileset.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
Fileset Specifies a comma-separated list of fileset names.
-J Junction
Specifies a comma-separated list of path names. They are not restricted to fileset junctions, but
may name any file or directory within the filesets to be listed.
-F FileName
Specifies the name of a file containing either fileset names or path names. Each line must contain
a single entry. All path names must be fully-qualified.
Options
-d Display the number of blocks in use for the fileset.
-i Display the number of inodes in use for the fileset.
-L Display additional information for the fileset. This includes:
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This command displays fileset information for all filesets in file system gpfs1:
mmlsfileset gpfs1
| See also
| Location
| /usr/lpp/mmfs/bin
| Synopsis
| mmlsfs {Device | all | all_local | all_remote} [-A] [-B] [-D] [-d] [-E] [-f] [-I] [-i] [-j] [-k] [-K] [-L] [-M]
| [-m] [-n] [-o] [-P] [-Q] [-R] [-r] [-S] [-t] [-T] [-u] [-V] [-z] [--create-time] [--fastea] [--filesetdf]
| [--inode-limit] [--mount-priority]
Description
Depending on your configuration, additional information that is set by GPFS may be displayed to assist
in problem determination when contacting the IBM Support Center.
Results
If you do not specify any options, all attributes of the file system are displayed. When you specify
options, only those attributes specified are listed, in the order issued in the command. Some parameters
are preset for optimum performance and, although they display in the mmlsfs command output, you
cannot change them.
Parameters
Device | all | all_local | all_remote
Device Indicates the device name of the file system for which information is displayed. File
system names do not need to be fully qualified. fs0 is as acceptable as /dev/fs0.
all Indicates all file systems that are known to this cluster.
all_local
Indicates all file systems that are owned by this cluster.
all_remote
Indicates all file systems that are owned by another cluster.
This must be the first parameter.
Options
| -A Automatically mounts the file system when the GPFS daemon starts.
| -B Displays the size of the data block, in bytes.
-D Displays the type of file locking semantics that are in effect (nfs4 or posix).
-d Displays the names of all of the disks in the file system.
-E Displays the exact mtime values reported.
-f Displays the minimum fragment size, in bytes.
-I Displays the indirect block size, in bytes.
-i Displays the inode size, in bytes.
-j Displays the block allocation type.
-k Displays the type of authorization supported by the file system.
-K Displays the strict replication enforcement.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
As root, a user can also issue the mmlsfs on remote file systems.
If you are a non-root user, you may specify only file systems that belong to the same cluster as the node
on which the mmlsfs command was issued.
If you issue the mmlsfs command with no options for the file system gpfs2:
mmlsfs gpfs2
The system displays information similar to this (output appears in the order that the options were
specified in the command):
| flag value description
| ----------------- ------------------------ ------------------------------------
| -f 262144 Minimum fragment size in bytes
| -i 512 Inode size in bytes
| -I 32768 Indirect block size in bytes
| -m 2 Default number of metadata replicas
| -M 2 Maximum number of metadata replicas
| -r 1 Default number of data replicas
| -R 2 Maximum number of data replicas
| -j scatter Block allocation type
| -D nfs4 File locking semantics in effect
| -k all ACL semantics in effect
| -n 32 Estimated number of nodes that will mount file system
| -B 8388608 Block size
| -Q user;group;fileset Quotas enforced
| user;group Default quotas enabled
| -V 12.02 (3.4.0.0) File system version
| -u yes Support for large LUNs?
| -z no Is DMAPI enabled?
| -L 262144 Logfile size
| -E yes Exact mtime mount option
| -S yes Suppress atime mount option
| -K whenpossible Strict replica allocation option
| --create-time Tue Mar 30 14:56:59 2010 File system creation time
| --fastea yes Fast external attributes enabled?
| --filesetdf no Fileset df enabled?
| --inode-limit 2015232 Maximum number of inodes
| -P system;sp1 Disk storage pools in file system
| -d dm13nsd;dm10nsd;dm11nsd;dm12nsd;dm9;dm14nsd;dm3nsd Disks in file system
| -A yes Automatic mount option
| -o none Additional mount options
| -T /gpfs2 Default mount point
| --mount-priority 10 Mount priority
Location
/usr/lpp/mmfs/bin
Synopsis
mmlslicense [-L]
Description
Use the mmlslicense command to display the number of GPFS client and server licenses assigned to the
nodes in the cluster.
For information on GPFS license designation, see the topic "GPFS license designation" in the GPFS:
Concepts, Planning, and Installation Guide.
Parameters
NONE
Options
-L Displays detailed information about the license type associated with each of the nodes in the
cluster. An asterisk after the license type indicates insufficient license level for the roles that the
node performs.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To display the summary information about the type and number of GPFS licenses associated with the
nodes in the cluster, issue this command:
mmlslicense
To display detailed information about the type of GPFS licenses associated with each of the nodes in the
cluster, issue this command:
Summary information
---------------------
Number of nodes defined in the cluster: 4
Number of nodes with server license designation: 1
Number of nodes with client license designation: 2
Number of nodes still requiring server license designation: 1
Number of nodes still requiring client license designation: 1
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
mmlsmgr -C ClusterName
Or,
mmlsmgr -c
Description
Use the mmlsmgr command to display which node is the file system manager or cluster manager for the
file system.
If you do not provide a Device operand, file system managers for all file systems within the current
cluster for which a file system manager has been appointed are displayed.
Parameters
Device The device names of the file systems for which the file system manager information is displayed.
If more than one file system is listed, the names must be delimited by a space. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
If no file system is specified, information about all file systems is displayed.
-C ClusterName
Displays the name of the nodes that are file system managers in cluster ClusterName.
-c Displays the current cluster manager node.
Options
NONE
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
As root, a user can also issue the mmlsmgr on remote file systems.
Examples
1. To display the file system manager node information for all the file systems, issue this command:
mmlsmgr
See also
“mmchmgr Command” on page 128
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmlsmount command reports if a file system is in use at the time the command is issued. A file
system is considered to be in use if it is explicitly mounted with the mount or mmmount command, or if
it is mounted internally for the purposes of running some other GPFS command. For example, when you
run the mmrestripefs command, the file system will be internally mounted for the duration of the
command. If mmlsmount is issued in the interim, the file system will be reported as being in use by the
mmlsmount command but, unless it is explicitly mounted, will not show up in the output of the mount
or df commands.
Parameters
Device | all | all_local | all_remote
Device Indicates the device name of the file system for which information is displayed. File
system names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
all Indicates all file systems known to this cluster.
all_local
Indicates all file systems owned by this cluster.
all_remote
Indicates all file systems owned by another cluster.
This must be the first parameter.
Options
-C {all | all_remote | ClusterName[,ClusterName...] }
Specifies the clusters for which mount information is requested. If one or more ClusterName is
specified, only the names of nodes that belong to these clusters and have the file system mounted
are displayed. The dot character ('.') can be used in place of the cluster name to denote the local
cluster.
Option -C all_remote denotes all clusters other than the one from which the command was
issued.
Option -C all refers to all clusters, local and remote, that can have the file system mounted.
Option -C all is the default.
-L Specifies to list the nodes that have the file system mounted.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
If you are a non-root user, you may specify only file systems that belong to the same cluster as the node
on which the mmlsmount command was issued.
Examples
1. To see how many nodes have file system fs2 mounted, issue this command:
mmlsmount fs2
Output is similar to this:
File system fs2 is mounted on 3 nodes.
2. To display all mounted file systems:
mmlsmount all
Output is similar to this:
File system fs1 is mounted on 17 nodes.
| File system remotefs1 (remote.cluster:fs1)
is mounted on 17 nodes.
3. To display all remotely mounted file systems:
mmlsmount all_remote
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmlsnsd command to display the current information for the NSDs belonging to the GPFS
cluster. The default is to display information for all NSDs defined to the cluster (-a). Otherwise, you may
choose to display the information for a particular file system (-f) or for all disks that do not belong to any
file system (-F).
Parameters
-a Display information for all of the NSDs belonging to the GPFS cluster. This is the default.
-f Device
The device name of the file system for which you want NSD information displayed. File system
names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
-F Display the NSDs that do not belong to any file system in the GPFS cluster.
-d DiskName[;DiskName...]
The name of the NSDs for which you want information displayed. When you enter multiple
DiskNames, separate them with semicolons and enclose the entire string of disk names in
quotation marks:
"gpfs3nsd;gpfs4nsd;gpfs5nsd"
Options
-L Displays the information in a long format that shows the NSD identifier.
-m Maps the NSD name to its disk device name on the local node and, if applicable, on the NSD
server nodes.
-M Maps the NSD names to its disk device name on all nodes.
This is a slow operation and its usage is suggested for problem determination only.
-v Specifies that the output should contain error information, where available.
-X Maps the NSD name to its disk device name on the local node and, if applicable, on the NSD
server nodes. The -X option also displays extended information for the NSD volume ID and
information such as NSD server status and Persistent Reserve (PR) enablement in the Remarks
field. Using the -X option is a slow operation and is recommended only for problem
determination.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
Examples
1. To display the default information for all of the NSDs belonging to the cluster, issue this command:
mmlsnsd
See also
“mmcrnsd Command” on page 151
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmlspolicy command displays policy information for a given file system. The information displayed
includes:
v When the policy file was installed.
v The user who installed the policy file.
v The node on which the policy file was installed.
v The first line of the original policy file.
For information on GPFS policies, see the GPFS: Advanced Administration Guide.
Parameters
Device The device name of the file system for which policy information is to be displayed. File system
names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-L Displays the entire original policy file. If this flag is not specified, only the first line of the
original policy file is displayed
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This command displays basic information for the policy installed for file system fs2:
| mmlspolicy fs2
/* Exclude Rule */
RULE ’Exclude root users files’ EXCLUDE WHERE USER_ID = 0 AND
name like ’%org%’
/* Delete Rule */
RULE ’delete files’ DELETE WHERE PATH_NAME like ’%tmp%’
/* Migrate Rule */
RULE ’sp4.files’ MIGRATE FROM POOL ’sp4’ TO POOL ’sp5’ WHERE
name like ’%sp4%’
/* End of Policy */
See also
Location
/usr/lpp/mmfs/bin
Synopsis
| mmlsquota [-u User | -g Group | -j Fileset] [-v | -q] [-e] [-C ClusterName]
| [--block-size {BlockSize | auto}] [Device ...]
Or,
Description
For the specified User, Group, or Fileset the mmlsquota command displays information about quota limits
and current usage on each file system in the cluster. This information is displayed only if quota limits
have been established and the user has consumed some amount of storage. If you want quota
information for a User, Group, or Fileset that has no file system storage allocated at the present time, you
must specify the -v option.
If none of: -g, -u, or -j is specified, the default is to display only user quotas for the user who issues the
command.
For each file system in the cluster, the mmlsquota command displays:
1. Block limits:
v quota type (USR or GRP or FILESET)
v current usage in KB
v soft limit in KB
v hard limit in KB
v space in doubt
v grace period
2. File limits:
v current number of files
v soft limit
v hard limit
v files in doubt
v grace period
Because the sum of the in-doubt value and the current usage may not exceed the hard limit, the actual
block space and number of files available to the user, group, or fileset may be constrained by the
in-doubt value. If the in-doubt value approaches a significant percentage of the quota, run the
mmcheckquota command to account for the lost space and files.
GPFS quota management takes replication into account when reporting on and determining if quota
limits have been exceeded for both block and file usage. In a file system that has either type of replication
set to a value of two, the values reported on by both the mmlsquota command and the mmrepquota
command are double the value reported by the ls command.
When issuing the mmlsquota command on a mounted file system, negative in-doubt values may be
reported if the quota server processes a combination of up-to-date and back-level information. This is a
transient situation and may be ignored.
Parameters
-C ClusterName
Specify the name of the cluster from which the quota information is obtained (from the file
systems within that cluster). If this option is omitted, the local cluster is assumed.
Device
Specifies the device name of the file system for which quota information is to be displayed. File
system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
Options
-d Display the default quota limits for user, group, or fileset quotas.
-e Specifies that mmlsquota is to collect updated quota usage data from all nodes before displaying
results. If this option is not specified, there is the potential to display negative usage values as the
quota server may process a combination of up-to-date and back-level information.
-g Group
Display quota information for the user group or group ID specified in the Group parameter.
-j Fileset
Display quota information for the named fileset.
-q Prints a terse message containing information only about file systems with usage over quota.
-u User
Display quota information for the user name or user ID specified in the User parameter.
-v Display quota information on file systems where the User, Group or Fileset limit has been set, but
the storage has not been allocated.
| –block-size {BlockSize | auto}
| Specifies the unit in which the number of blocks is displayed. The value must be of the form
| [n]K, [n]M, [n]G or [n]T, where n is an optional integer in the range 1 to 1023. The default is 1K.
| If auto is specified, the number of blocks is automatically scaled to an easy-to-read value.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
If you are a non-root user, you may view only fileset quota information, your own quota information,
and quota information for any groups to which you belong.
GPFS must be running on the node from which the mmlsquota command is issued.
This output shows the quotas for user paul in file system fsn set to a soft limit of 100096 KB, and a hard
limit of 200192 KB. 728 KB is currently allocated to paul. 4880 KB is also in doubt, meaning that the
quota system has not yet been updated as to whether this space has been used by the nodes, or whether
it is still available. No grace period appears because the user has not exceeded his quota. If the user had
exceeded the soft limit, the grace period would be set and the user would have that amount of time to
bring his usage below the quota values. If the user failed to do so, the user would not be allocated any
more space.
The soft limit for files (inodes) is set at 30 and the hard limit is 50. 35 files are currently allocated to this
user, and the quota system does not yet know whether the 10 in doubt have been used or are still
available. A grace period of six days appears because the user has exceeded his quota. The user would
have this amount of time to bring his usage below the quota values. If the user fails to do so, the user is
not allocated any more space.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
| Use the mmlssnapshot command to display GPFS snapshot information for the specified file system. You
| may optionally display the amount of storage used by the snapshot.
Parameters
Device The device name of the file system for which snapshot information is to be displayed. File system
names need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
This must be the first parameter.
Options
| -d Display the amount of storage used by the snapshot.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
If you are a non-root user, you may only specify file systems that belong to the same cluster as the node
on which the mmlssnapshot command was issued.
Examples
To display the snapshot information for the file system fs1 additionally requesting storage information,
issue this command:
mmlssnapshot fs1 -d
See also
Location
/usr/lpp/mmfs/bin
| Synopsis
| Description
| Use the mmmigratefs command to enable features that require existing on-disk data structures to be
| converted to a new format.
| Before issuing the mmmigratefs command, see the topic about migration, coexistence, and compatibility
| in the GPFS: Concepts, Planning, and Installation Guide. You must ensure that all nodes in the cluster have
| been migrated to the latest level of GPFS code and that you have successfully run the mmchconfig
| release=LATEST command. You must also ensure that the new features have been enabled by running
| mmchfs -V full.
| The file system must be unmounted prior to issuing the mmmigratefs command.
| Parameters
| Device
| The device name of the file system to be migrated. File system names need not be fully qualified;
| for example, fs0 is just as acceptable as /dev/fs0. This must be the first parameter.
| Options
| --fastea
| Convert the existing extended attributes to the new format required for storing the attributes in
| the file's inode and thereby allowing for faster extended-attribute access.
| Exit status
| 0 Successful completion.
| nonzero
| A failure has occurred.
| Security
| The node on which the command is issued must be able to execute remote shell commands on any other
| node in the cluster without the use of a password and without producing any extraneous messages. See
| “Requirements for administering a GPFS file system” on page 1.
| Examples
| To enable fast extended attribute access for file system fs3, issue this command:
| mmmigratefs fs3 --fastea
| See also
| Location
| /usr/lpp/mmfs/bin
Synopsis
Or,
Description
The mmmount command mounts the specified GPFS file system on one or more nodes in the cluster. If
no nodes are specified, the file systems are mounted only on the node from which the command was
issued. A file system can be specified using its device name or its default mount point, as established by
the mmcrfs, mmchfs or mmremotefs commands.
When all is specified in place of a file system name, all GPFS file systems will be mounted. This also
includes remote GPFS file systems to which this cluster has access.
Parameters
Device | DefaultMountPoint | DefaultDriveLetter | all | all_local | all_remote
Indicates the file system or file systems to be mounted.
Device The device name of the file system to be mounted. File system names need not be
fully-qualified. fs0 is just as acceptable as /dev/fs0.
DefaultMountPoint
The mount point associated with the file system as a result of the mmcrfs, mmchfs, or
mmremotefs commands.
DefaultDriveLetter
The Windows drive letter associated with the file system as a result of the mmcrfs or
mmchfs command.
all Indicates all file systems known to this cluster.
all_local
Indicates all file systems owned by this cluster.
all_remote
Indicates all files systems owned by another cluster to which this cluster has access.
This must be the first parameter.
DriveLetter
The location where the file system is to be mounted. If not specified, the file system is mounted
at its default drive letter. This option can be used to mount a file system at a drive letter other
than its default one or to mount a file system that does not have an established default drive
letter.
MountPoint
The location where the file system is to be mounted. If not specified, the file system is mounted
at its default mount point. This option can be used to mount a file system at a mount point other
than its default mount point.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To mount all GPFS file systems on all of the nodes in the cluster, issue this command:
mmmount all -a
2. To mount file system fs2 read-only on the local node, issue this command:
mmmount fs2 -o ro
3. To mount file system fs1 on all NSD server nodes, issue this command:
mmmount fs1 -N nsdsnodes
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmnsddiscover command is used to rediscover paths for GPFS NSDs on one or more nodes. If you
do not specify a node, GPFS rediscovers NSD paths on the node from which you issued the command.
On server nodes, mmnsddiscover causes GPFS to rediscover access to disks, thus restoring paths which
may have been broken at an earlier time. On client nodes, mmnsddiscover causes GPFS to refresh its
choice of which NSD server to use when an I/O operation occurs.
In general, after the path to a disk is fixed, the mmnsddiscover command must be first run on the server
that lost the path to the NSD. After that, run the command on all client nodes that need to access the
NSD on that server. You can achieve the same effect with a single mmnsddiscover invocation if you
utilize the -N option to specify a node list that contains all the NSD servers and clients that need to
rediscover paths.
Parameters
-a Rediscovers paths for all NSDs. This is the default.
-d "DiskName[;DiskName]"
Specifies a list of NSDs whose paths are to be rediscovered.
-F DiskFile
Specifies a file that contains the names of the NSDs whose paths are to be rediscovered.
-C ClusterName
Specifies the name of the cluster to which the NSDs belong. This defaults to the local cluster if
not specified.
-N {Node[,Node...] | NodeFile | NodeClass}
Specifies the nodes on which the rediscovery is to be done.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
None
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
Examples
1. To rediscover the paths for all of the NSDs in the local cluster on the local node, issue the command:
mmnsddiscover
mmnsddiscover: Finished.
2. To rediscover the paths for all of the NSDs in the local cluster on all nodes in the local cluster, issue
the command:
mmnsddiscover -a -N all
mmnsddiscover: Finished.
3. To rediscover the paths for a given list of the NSDs on a node in the local cluster, issue the command:
mmnsddiscover -d "gpfs1nsd;gpfs2nsd" -N c6f2c2vp5
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmpmon [-i CommandFile] [-d IntegerDelayValue] [-p] [-r IntegerRepeatValue] [-s] [-t IntegerTimeoutValue]
Description
Before attempting to use mmpmon, IBM suggests that you review this command entry, then read the
entire topic, Monitoring GPFS I/O performance with the mmpmon command in General Parallel File System:
Advanced Administration Guide.
Use the mmpmon command to manage GPFS performance monitoring functions and display
performance monitoring data. The mmpmon command reads requests from an input file or standard
input (stdin), and writes responses to standard output (stdout). Error messages go to standard error
(stderr). Prompts, if not suppressed, go to stderr.
When running mmpmon in such a way that it continually reads input from a pipe (the driving script or
application never intends to send an end-of-file to mmpmon), set the -r option value to 1 (or use the
default value of 1) to prevent mmpmon from caching the input records. This avoids unnecessary memory
consumption.
Results
The performance monitoring request is sent to the GPFS daemon running on the same node that is
running the mmpmon command.
For details on how to interpret the mmpmon command results, see the topic Monitoring GPFS I/O
performance with the mmpmon command in General Parallel File System: Advanced Administration Guide.
Parameters
-i CommandFile
The input file contains mmpmon command requests, one per line. Use of the -i flag implies use
of the -s flag. For interactive use, just omit the -i flag. In this case, the input is then read from
stdin, allowing mmpmon to take keyboard input or output piped from a user script or
application program.
Leading blanks in the input file are ignored. A line beginning with a pound sign (#) is treated as
a comment. Leading blanks in a line whose first non-blank character is a pound sign (#) are
ignored.
Options
-d IntegerDelayValue
Specifies a number of milliseconds to sleep after one invocation of all the requests in the input
file. The default value is 1000. This value must be an integer greater than or equal to 500 and less
than or equal to 8000000.
The input file is processed as follows: The first request is processed, it is sent to the GPFS
daemon, the responses for this request are received and processed, the results for this request are
displayed, and then the next request is processed and so forth. When all requests from the input
file have been processed once, the mmpmon command sleeps for the specified number of
milliseconds. When this time elapses, mmpmon wakes up and processes the input file again,
depending on the value of the -r flag.
-p Indicates to generate output that can be parsed by a script or program. If this option is not
specified, human readable output is produced.
-r IntegerRepeatValue
Specifies the number of times to run all the requests in the input file.
Exit status
0 Successful completion.
1 Various errors (insufficient memory, input file not found, incorrect option, and so forth).
3 Either no commands were entered interactively, or there were no mmpmon commands in the
input file. The input file was empty, or consisted of all blanks or comments.
4 mmpmon terminated due to a request that was not valid.
5 An internal error has occurred.
111 An internal error has occurred. A message will follow.
Restrictions
1. Up to five instances of mmpmon may be run on a given node concurrently. However, concurrent
users may interfere with each other. See Monitoring GPFS I/O performance with the mmpmon command in
General Parallel File System: Advanced Administration Guide.
2. Do not alter the input file while mmpmon is running.
3. The input file must contain valid input requests, one per line. When an incorrect request is detected
by mmpmon, it issues an error message and terminates. Input requests that appear in the input file
before the first incorrect request are processed by mmpmon.
Security
The mmpmon command must be run by a user with root authority, on the node for which statistics are
desired.
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. Assume that infile contains these requests:
ver
io_s
fs_io_s
rhist off
For several more examples, see Monitoring GPFS I/O performance with the mmpmon command in General
Parallel File System: Advanced Administration Guide.
Location
/usr/lpp/mmfs/bin
Synopsis
Description
If the -i option is not used, the command expects the input to be supplied through standard input, and
waits for your response to the prompt.
For information about NFS V4 ACLs, see Chapter 6, “Managing GPFS access control lists and NFS
export,” on page 51.
Any output from the mmgetacl command can be used as input to mmputacl. The command is extended
to support NFS V4 ACLs. In the case of NFS V4 ACLs, there is no concept of a default ACL. Instead,
there is a single ACL and the individual access control entries can be flagged as being inherited (either by
files, directories, both, or neither). Consequently, specifying the -d flag for an NFS V4 ACL is an error. By
its nature, storing an NFS V4 ACL implies changing the inheritable entries (the GPFS default ACL) as
well.
The following describes how mmputacl works for POSIX and NFS V4 ACLs:
Command POSIX ACL NFS V4 ACL
----------- ---------------------------- ------------------------
mmputacl Access ACL (Error if default Stores the ACL (implies
ACL is NFS V4 [1]) default as well)
mmputacl -d Default ACL (Error if access Error: NFS V4 ACL (has
ACL is NFS V4 [1] no default ACL)
---------------------------------------------------------------------
[1] The default and access ACLs are not permitted to be mixed types
because NFS V4 ACLs include inherited entries, which are the
equivalent of a default ACL. An mmdelacl of the NFS V4 ACL is
required before an ACL is converted back to POSIX.
---------------------------------------------------------------------
Depending on the file system's -k setting (posix, nfs4, or all), mmputacl may be restricted. The
mmputacl command is not allowed to store an NFS V4 ACL if -k posix is in effect. The mmputacl
command is not allowed to store a POSIX ACL if -k nfs4 is in effect. For more information, see the
description of the -k flag for the mmchfs, mmcrfs, and mmlsfs commands.
Note that the test to see if the given ACL is acceptable based on the file system's -k setting cannot be
done until after the ACL is provided. For example, if mmputacl file1 is issued (no -i flag specified) the
user then has to input the ACL before the command can verify that it is an appropriate ACL given the
file system settings. Likewise, the command mmputacl -d dir1 (again the ACL was not given with the -i
flag) requires that the ACL be entered before file system ACL settings can be tested. In this situation, the
-i flag may be preferable to manually entering a long ACL, only to find out it is not allowed by the file
system.
Options
-d Specifies that the default ACL of a directory is to be set. This flag cannot be used on an NFS V4
ACL.
-i InFilename
The path name of a source file from which the ACL is to be read.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You may issue the mmputacl command only from a node in the GPFS cluster where the file system is
mounted.
You must be the file or directory owner, the root user, or someone with control permission in the ACL, to
run the mmputacl command.
Examples
To use the entries in a file named standard.acl to set the ACL for a file named project2.history, issue this
command:
mmputacl -i standard.acl project2.history
Location
/usr/lpp/mmfs/bin
Synopsis
Description
If none of: -u, -j or -g is specified, the mmquotaoff command deactivates quota limit checking for users,
groups, and filesets.
If the -a option is not specified, Device must be the last parameter entered.
Parameters
Device[ Device ... ]
The device name of the file system to have quota limit checking deactivated.
If more than one file system is listed, the names must be delimited by a space. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-a Deactivates quota limit checking for all GPFS file systems in the cluster. When used in
combination with the -g option, only group quota limit checking is deactivated. When used in
combination with the -u or -j options, only user or fileset quota limit checking, respectively, is
deactivated.
-g Specifies that only group quota limit checking is to be deactivated.
-j Specifies that only quota checking for filesets is to be deactivated.
-u Specifies that only user quota limit checking is to be deactivated.
-v Prints a message for each file system in which quotas are deactivated.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmquotaoff command is issued.
Examples
1. To deactivate user quota limit checking on file system fs0, issue this command:
mmquotaoff -u fs0
To confirm the change, individually for each file system, issue this command:
mmlsfs fs2 -Q
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
If none of: -u, -j or -g is specified, the mmquotaon command activates quota limit checking for users,
groups, and filesets.
If the -a option is not used, Device must be the last parameter specified.
After quota limit checking has been activated by issuing the mmquotaon command, issue the
mmcheckquota command to count inode and space usage.
Parameters
Device[ Device ... ]
The device name of the file system to have quota limit checking activated.
If more than one file system is listed, the names must be delimited by a space. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-a Activates quota limit checking for all of the GPFS file systems in the cluster. When used in
combination with the -g option, only group quota limit checking is activated. When used in
combination with the -u or -j option, only user or fileset quota limit checking, respectively, is
activated.
-g Specifies that only group quota limit checking is to be activated.
-j Specifies that only fileset quota checking is to be activated.
-u Specifies that only user quota limit checking is to be activated.
-v Prints a message for each file system in which quota limit checking is activated.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmquotaon command is issued.
To confirm the change, individually for each file system, issue this command:
mmlsfs fs1 -Q
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Or,
Or,
Description
The mmremotecluster command is used to make remote GPFS clusters known to the local cluster, and to
maintain the attributes associated with those remote clusters. The keyword appearing after
mmremotecluster determines which action is performed:
add Adds a remote GPFS cluster to the set of remote clusters known to the local cluster.
delete Deletes the information for a remote GPFS cluster.
show Displays information about a remote GPFS cluster.
update
Updates the attributes of a remote GPFS cluster.
To be able to mount file systems that belong to some other GPFS cluster, you must first make the nodes
in this cluster aware of the GPFS cluster that owns those file systems. This is accomplished with the
mmremotecluster add command. The information that the command requires must be provided to you
by the administrator of the remote GPFS cluster. You will need this information:
v The name of the remote cluster.
v The names or IP addresses of a few nodes that belong to the remote GPFS cluster.
v The public key file generated by the administrator of the remote cluster by running the mmauth
genkey command for the remote cluster.
Since each cluster is managed independently, there is no automatic coordination and propagation of
changes between clusters like there is between the nodes within a cluster. This means that once a remote
cluster is defined with the mmremotecluster command, the information about that cluster is
automatically propagated across all nodes that belong to this cluster. But if the administrator of the
remote cluster decides to rename it, or deletes some or all of the contact nodes, or change the public key
file, the information in this cluster becomes obsolete. It is the responsibility of the administrator of the
remote GPFS cluster to notify you of such changes so that you can update your information using the
appropriate options of the mmremotecluster update command.
Parameters
RemoteClusterName
Specifies the cluster name associated with the remote cluster that owns the remote GPFS file
where:
tcpPort=NNNN
Specifies the TCP port number to be used by the local GPFS daemon when contacting the remote
cluster. If not specified, GPFS will use the default TCP port number 1191.
node1[,node2...]
Specifies a list of nodes that belong to the remote cluster. The nodes can be identified through
their host names or IP addresses.
Options
None.
Exit status
0 Successful completion. After successful completion of the mmremotecluster command, the new
configuration information is propagated to all nodes in the cluster.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. This command adds remote cluster k164.kgn.ibm.com to the set of remote clusters known to the local
cluster, specifying k164n02 and k164n03 as remote contact nodes. File k164.id_rsa.pub is the name of
the public key file provided to you by the administrator of the remote cluster.
mmremotecluster add k164.kgn.ibm.com -n k164n02,k164n03\
-k k164.id_rsa.pub
See also
Accessing GPFS file systems from other GPFS clusters in General Parallel File System: Advanced Administration
Guide.
Location
/usr/lpp/mmfs/bin
Synopsis
mmremotefs add Device -f RemoteDevice -C RemoteClusterName [-T MountPoint] [-t DriveLetter] [-A {yes |
| no | automount}] [-o MountOptions] [--mount-priority Priority]
Or,
Or,
Or,
mmremotefs update Device [-f RemoteDevice] [-C RemoteClusterName] [-T MountPoint] [-t DriveLetter] [-A
| {yes | no | automount}] [-o MountOptions] [--mount-priority Priority]
Description
The mmremotefs command is used to make GPFS file systems that belong to other GPFS clusters known
to the nodes in this cluster, and to maintain the attributes associated with these file systems. The keyword
appearing after mmremotefs determines which action is performed:
add Define a new remote GPFS file system.
delete Delete the information for a remote GPFS file system.
show Display the information associated with a remote GPFS file system.
update
Update the information associated with a remote GPFS file system.
Use the mmremotefs command to make the nodes in this cluster aware of file systems that belong to
other GPFS clusters. The cluster that owns the given file system must have already been defined with the
mmremotecluster command. The mmremotefs command is used to assign a local name under which the
remote file system will be known in this cluster, the mount point where the file system is to be mounted
in this cluster, and any local mount options that you may want.
Once a remote file system has been successfully defined and a local device name associated with it, you
can issue normal commands using that local name, the same way you would issue them for file systems
that are owned by this cluster.
When running the mmremotefs command delete and update options, the file system must be unmounted
on the local cluster, however, it can be mounted elsewhere.
Parameters
Device Specifies the name by which the remote GPFS file system will be known in the cluster.
-C RemoteClusterName
Specifies the name of the GPFS cluster that owns the remote GPFS file system.
Options
-A {yes | no | automount}
Indicates when the file system is to be mounted:
yes When the GPFS daemon starts.
no Manual mount. This is the default.
automount
When the file system is first accessed.
-o MountOptions
Specifies the mount options to pass to the mount command when mounting the file system. For a
detailed description of the available mount options, see “GPFS-specific mount options” on page
18.
-T MountPoint
The local mount point directory of the remote GPFS file system. If it is not specified, the mount
point will be set to DefaultMountDir/Device. The default value for DefaultMountDir is /gpfs, but it
can be changed with the mmchconfig command.
-t DriveLetter
Specifies the drive letter to use when the file system is mounted on Windows.
| --mount-priority Priority
Controls the order in which the individual file systems are mounted at daemon startup or when
one of the all keywords is specified on the mmmount command.
File systems with higher Priority numbers are mounted after file systems with lower numbers.
File systems that do not have mount priorities are mounted last. A value of zero indicates no
| priority.
Exit status
0 Successful completion. After successful completion of the mmremotefs command, the new
configuration information is propagated to all nodes in the cluster.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
This command adds remote file system gpfsn, owned by remote cluster k164.kgn.ibm.com, to the local
cluster, assigning rgpfsn as the local name for the file system, and /gpfs/rgpfsn as the local mount point.
mmremotefs add rgpfsn -f gpfsn -C k164.kgn.ibm.com -T /gpfs/rgpfsn
See also
Accessing GPFS file systems from other GPFS clusters in General Parallel File System: Advanced Administration
Guide.
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmrepquota command reports file system usage and quota information for a user, group, or fileset.
If none of -g, -j, or -u is specified, then user, group and fileset quotas are listed.
If the -a option is not specified, Device must be the last parameter entered.
For each file system in the cluster, the mmrepquota command displays:
1. Block limits:
v quota type (USR, GRP or FILESET)
v current usage in KB
v soft limit in KB
v hard limit in KB
v space in doubt
v grace period
2. File limits:
v current number of files
v soft limit
v hard limit
v files in doubt
v grace period
3. Entry Type
default on
Default quotas are enabled for this file system
default off
Default quotas are not enabled for this file system
e Explicit quotas – the quota limits have been explicitly set using the mmedquota command
d Default quotas – the quota limits are the default values set using the mmdefedquota
command
i Initial quotas – default quotas were not enabled when this initial entry was established. Initial
quota limits have a value of zero indicating no limit.
Because the sum of the in-doubt value and the current usage may not exceed the hard limit, the actual
block space and number of files available to the user, group, or fileset may be constrained by the in-doubt
value. If the in-doubt value approach a significant percentage of the quota, run the mmcheckquota
command to account for the lost space and files.
When issuing the mmrepquota command on a mounted file system, negative in-doubt values may be
reported if the quota server processes a combination of up-to-date and back-level information. This is a
transient situation and may be ignored.
Parameters
Device[ Device...]
The device name of the file system to be listed.
If more than one file system is listed, the names must be delimited by a space. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
Options
-a Lists quotas for all file systems in the cluster. A header line is printed automatically with this
option.
-e Specifies that the mmrepquota command is to collect updated quota usage data from all nodes
before displaying results. If this option is not specified, there is the potential to display negative
usage values as the quota server may process a combination of up-to-date and back-level
information.
-g List only group quotas.
-j List only fileset quotas.
-n Displays a numerical user ID.
-q Show whether quota enforcement is active.
-u List only user quotas.
-v Print a header line.
| –block-size {BlockSize | auto}
| Specifies the unit in which the number of blocks is displayed. The value must be of the form
| [n]K, [n]M, [n]G or [n]T, where n is an optional integer in the range 1 to 1023. The default is 1K.
| If auto is specified, the number of blocks is automatically scaled to an easy-to-read value.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
GPFS must be running on the node from which the mmrepquota command is issued.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmrestoreconfig Device -i InputFile [-I {test | yes | continue }] [-Q {yes | no}] [-W NewDeviceName] [-z
{yes | no}]
or
or
Description
| The mmrestoreconfig command allows you to either query or restore, or both query and restore, the
| output file of the mmbackupconfig command.
In the query phase, the mmrestoreconfig command uses the output file generated by the
mmbackupconfig command as an input parameter, and then creates a configuration file. Users can then
edit the configuration file to fit their current file system configuration. You can use the definitions in the
configuration file to create the appropriate network shared disks (NSDs) and file systems required for the
restore.
In the restore phase, the mmrestoreconfig command uses the input file (output from the
| mmbackupconfig command) to restore the backed up file system configuration in the newly created file
system.
Parameters
Device Specifies the name of the file system to be restored.
Options
-I {test | yes | continue}
Specifies the action to be taken during the restore phase:
| test Test all the configuration settings before the actual restore is performed.
yes Test and proceed on the restore process. This is the default action.
continue
Restart mmrestoreconfig from the last known successful configuration restore.
-i inputFile
Specifies the file generated by the mmbackupconfig command. The input file contains the file
system configuration information.
| -F QueryResultFile
Specifies the pathname of the configuration query result file generated by mmrestoreconfig. The
configuration query result file is a report file that you can edit and use as a guide to mmcrnsd or
mmcrfs.
Exit status
0 Successful completion.
1 Partially successful completion.
2 A failure has occurred that cannot be corrected. A new mmrestoreconfig run is required.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
| 1. Run mmrestoreconfig -F QueryResultFile to specify the pathname of the configuration query result file
| to be generated.
| mmrestoreconfig gpfs1 -i inputFile -F reportfile
2. To test the settings before running the restore:
| mmrestoreconfig fs1 -i nothing -I test
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmrestorefs command to restore user data and attribute files to a file system using those of the
specified snapshot.
Prior to issuing the mmrestorefs command, unmount the file system from all nodes in the cluster. The
file system may not be remounted until the mmrestorefs command has successfully completed, unless
you have specified the -c option to force the restore to continue even in the event errors are encountered.
Automatic quota activation upon mounting the file system is not restored by the mmrestorefs command.
You must issue the mmchfs -Q yes command to restore automatic quota activation.
Snapshots are not affected by the mmrestorefs command. Consequently, a failure while restoring one
snapshot may possibly be recovered by restoring a different snapshot.
When the mmsnapdir -a (add a snapshots subdirectory to all subdirectories in the file system) option is
in effect, the snapshots subdirectories may no longer show the complete list of snapshots containing the
parent directory, if the file system was restored from a snapshot that was not the latest. Since the root
directory is contained in all snapshots, its snapshots subdirectory will always show the complete list of
snapshots.
For information on how GPFS policies and snapshots interact, see the GPFS: Advanced Administration
Guide.
Because snapshots are not copies of the entire file system, they should not be used as protection against
media failures. For protection against media failures, see GPFS: Concepts, Planning, and Installation Guide
and search on recoverability considerations.
Parameters
Device The device name of the file system for which the snapshot is to be created. File system names
need not be fully-qualified. fs0 is just as acceptable as /dev/fs0.
This must be the first parameter.
| SnapshotName
The snapshot with which to restore the file system.
Options
-c Continue to restore the file system in the event errors occur.
Upon completion of the mmrestorefs -c command, the file system is inconsistent, but can be
mounted to recover data from the snapshot. If necessary, the command may be issued to recover
as much data as possible. The mmfsck command may be run on an inconsistent file system.
After the mmrestorefs -c command has been issued, use the mmfsck command to clean up the
files or directories that could not be restored.
Exit status
0 Successful completion.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
The directory userB is then created using the inode originally assigned to userA. We take another
snapshot:
mmcrsnapshot fs1 snap2
After the command is issued, the directory structure would appear similar to:
/fs1/file1
/fs1/userB/file2b
/fs1/userB/file3b
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
/fs1/.snapshots/snap2/file1
/fs1/.snapshots/snap2/userB/file2b
/fs1/.snapshots/snap2/userB/file3b
After the command has been issued, the directory structure would appear similar to:
/fs1/.snapshots/snap1/file1
/fs1/.snapshots/snap1/userA/file2
/fs1/.snapshots/snap1/userA/file3
/fs1/.snapshots/snap2/file1
/fs1/.snapshots/snap2/userB/file2b
/fs1/.snapshots/snap2/userB/file3b
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmrestripefile command performs a repair operation over the specified list of files. The -F flag
allows the user to specify a file containing the list of file names to be restriped, with one file name per
line.
The mmrestripefile command attempts to restore the metadata or data replication factor of the file.
You must specify one of the four options (-b, -m, -p, or -r) to indicate how much file data to move.
If you do not use replication, the -m and -r options are equivalent. Their behavior differs only on
replicated files. A migrate operation, using the -m option, leaves data on a suspended disk as long as at
least one other replica of the data remains on a disk that is not suspended. Restriping a file includes
replicating it. The -b option performs all the operations of the -m and -r options.
Parameters
-F FilenameFile
Specifies a file that contains a list of names of files to be restriped, one name per line.
Filename
Specifies the names of one or more files to be restriped.
Options
-b Rebalances a list of files across all disks that are not suspended, even if they are stopped.
Although blocks are allocated on a stopped disk, they are not written to a stopped disk, nor are
reads allowed from a stopped disk, until that disk is started and replicated data is copied onto it.
-m Migrates critical data off of any suspended disk for a list of specified files. Critical data is all data
that would be lost if currently suspended disks were removed.
-p Directs mmrestripefile to repair the file placement within the storage pool.
Files assigned to one storage pool, but with data in a different pool, will have their data migrated
to the correct pool. These files are called ill-placed. Utilities, such as the mmchattr command, may
change a file's storage pool assignment, but not move the data. The mmrestripefile command
may then be invoked to migrate all of the data at once, rather than migrating each file
individually. Note that the rebalance operation, specified by the -b option, also performs data
placement on all files, whereas the placement option, specified by -p, rebalances only the files
that it moves.
-r Migrates all data for a list of files off of suspended disks. It also restores all specified replicated
files in the file system to their designated degree of replication when a previous disk failure or
removal of a disk has made some replicated data inaccessible. Use this option either immediately
after a disk failure, to protect replicated data against a subsequent failure, or before taking a disk
offline for maintenance, to protect replicated data against failure of another disk during the
maintenance process.
Exit status
0 Successful completion.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
This example illustrates restriping a file named testfile0. This command confirms that testfile0 is
illplaced.
mmlsattr -L testfile0
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmrestripefs Device {-b | -m | -p | -r | -R} [-N {Node[,Node...] | NodeFile | NodeClass}] [-P PoolName]
Description
Use the mmrestripefs command to rebalance or restore the replication of all files in a file system. The
mmrestripefs command moves existing file system data between different disks in the file system based
on changes to the disk state made by the mmchdisk, mmadddisk, and mmdeldisk commands.
The mmrestripefs command attempts to restore the metadata or data replication of any file in the file
system.
You must specify one of the five options (-b, -m, -p, -r, or -R) to indicate how much file system data to
move. You can issue this command against a mounted or unmounted file system.
If you do not use replication, the -m and -r options are equivalent. Their behavior differs only on
replicated files. After a successful replicate (-r option), all suspended disks are empty. A migrate
operation, using the -m option, leaves data on a suspended disk as long as at least one other replica of
the data remains on a disk that is not suspended. Restriping a file system includes replicating it. The -b
option performs all the operations of the -m and -r options.
Consider the necessity of restriping and the current demands on the system. New data that is added to
the file system is correctly striped. Restriping a large file system requires a large number of insert and
delete operations and may affect system performance. Plan to perform this task when system demand is
low.
Parameters
Device The device name of the file system to be restriped. File system names need not be fully-qualified.
fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
-N {Node[,Node...] | NodeFile | NodeClass}
Specify the nodes that participate in the restripe of the file system. This command supports all
| defined node classes. The default is all or the current value of the defaultHelperNodes parameter
| of the mmchconfig command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
-b Rebalances all files across all disks that are not suspended, even if they are stopped. Although
blocks are allocated on a stopped disk, they are not written to a stopped disk, nor are reads
allowed from a stopped disk, until that disk is started and replicated data is copied onto it. The
mmrestripefs command rebalances and restripes the file system. Use this option to rebalance the
file system after adding, changing, or deleting disks in a file system.
Note: Rebalancing of files is an I/O intensive and time consuming operation, and is important
only for file systems with large files that are mostly invariant. In many cases, normal file update
and creation will rebalance your file system over time, without the cost of the rebalancing.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To move all critical data from any suspended disk in file system fs1, issue this command:
mmrestripefs fs1 -m
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmrpldisk Device DiskName {DiskDesc | -F DescFile} [-v {yes | no}] [-N {Node[,Node...] | NodeFile |
NodeClass}]
Description
Use the mmrpldisk command to replace an existing disk in the GPFS file system with a new one. All
data on the old disk is migrated to the new disk.
To replace a disk in a GPFS file system, you must first decide if you will:
1. Create a new disk using the mmcrnsd command.
In this case, use the rewritten disk descriptor file produced by the mmcrnsd command or create a
new disk descriptor. When using the rewritten file, the Disk Usage and Failure Group specifications
remain the same as specified on the mmcrnsd command.
2. Select a disk no longer in any file system. Issue the mmlsnsd -F command to display the available
disks.
The disk may then be used to replace a disk in the file system using the mmrpldisk command.
Notes:
1. You cannot replace a disk when it is the only remaining disk in the file system.
2. Under no circumstances should you replace a stopped disk. You need to start a stopped disk before
replacing it. If a disk cannot be started, delete it using the mmdeldisk command. See the General
Parallel File System: Problem Determination Guide and search for Disk media failure.
3. The file system may be mounted when running the mmrpldisk command.
Results
Upon successful completion of the mmrpldisk command, the disk is replaced in the file system and data
is copied to the new disk without restriping.
Parameters
Device The device name of the file system where the disk is to be replaced. File system names need not
be fully-qualified. fs0 is as acceptable as /dev/fs0.
This must be the first parameter.
Disk Name
The name of the disk to be replaced, which was previously passed to the mmcrfs, mmadddisk,
or mmrpldisk commands. You can display the entire list of disk names by issuing the mmlsdisk
command.
DiskDesc
A descriptor for the replacement disk.
-F DescFile
Specifies a file containing the disk descriptor for the replacement disk.
The disk descriptor must be specified in the form (second, third, and sixth fields reserved):
DiskName:::DiskUsage:FailureGroup:::
DiskName
Note: While it is not absolutely necessary to specify the same disk descriptor parameters for the
new disk as the old disk, it is suggested you do so. If the new disk is equivalent in size as the
old disk, and if the DiskUsage and FailureGroup parameters are the same, the data and metadata
can be completely migrated from the old disk to the new disk. A disk replacement in this manner
allows the file system to maintain its current data and metadata balance.
If the new disk has a different size, DiskUsage parameter, or FailureGroup parameter, the operation
may leave the file system unbalanced and require a restripe. Additionally, a change in size or the
DiskUsage parameter may cause the operation to fail since other disks in the file system may not
have sufficient space to absorb more data or metadata. In this case, first use the mmadddisk
command to add the new disk, the mmdeldisk command to delete the old disk, and finally the
mmrestripefs command to rebalance the file system.
-N {Node[,Node...] | NodeFile | NodeClass}
Specify the nodes that participate in the migration of data from the old to the new disk. This
| command supports all defined node classes. The default is all or the current value of the
| defaultHelperNodes parameter of the mmchconfig command.
For general information on how to specify node names, see “Specifying nodes as input to GPFS
commands” on page 3.
Options
-v {yes | no}
Verify the new disk does not belong to an existing file system. The default is -v yes. Specify -v no
only when you want to reuse a disk that is no longer needed for an existing file system. If the
command is interrupted for any reason, use the -v no option on the next invocation of the
command.
Important: Using -v no on a disk that already belongs to a file system will corrupt that file
system. This will not be noticed until the next time that file system is mounted.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To replace disk hd27n01 in fs1 with a new disk, hd16vsdn10 allowing the disk usage and failure group
parameters to default to the corresponding values of hd27n01, and have only nodes c154n01, c154n02,
and c154n09 participate in the migration of the data, issue this command:
mmrpldisk fs1 hd27n01 hd16vsdn10 -N c154n01,c154n02,c154n09
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmshutdown command to stop the GPFS daemons on one or more nodes. If no operand is
specified, GPFS is stopped only on the node from which the command was issued.
The mmshutdown command first attempts to unmount all GPFS file systems. If the unmount does not
complete within the specified timeout period, the GPFS daemons shut down anyway.
Results
Upon successful completion of the mmshutdown command, these tasks are completed:
v GPFS file systems are unmounted.
v GPFS daemons are stopped.
Parameters
-a Stop GPFS on all nodes in a GPFS cluster.
-N {Node[,Node...] | NodeFile | NodeClass}
Directs the mmshutdown command to process a set of nodes. For general information on how to
specify node names, see “Specifying nodes as input to GPFS commands” on page 3.
This command does not support a NodeClass of mount.
Options
-t UnmountTimeout
The maximum amount of time, in seconds, that the unmount command is given to complete. The
default timeout period is equal to:
60 + 3 × number of nodes
If the unmount does not complete within the specified amount of time, the command times out
and the GPFS daemons shut down.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Or,
Description
Use the mmsnapdir command to create or delete invisible directories that connect to the snapshots of a
GPFS file system, and change the name of the snapshots subdirectory.
Snapshots appear in a subdirectory in the root directory of the file system. If you prefer to access the
snapshots from each file system directory rather than traversing through the root directory, you can create
an invisible directory to make the connection by issuing the mmsnapdir command with the -a flag (see
Example 1 on page 305). The -a flag of the mmsnapdir command creates an invisible directory in each
normal directory in the active file system (they do not appear in directories in snapshots) that contains a
subdirectory for each existing snapshot of the file system. These subdirectories correspond to the copy of
that directory in the snapshot with the same name. For example if you enter, ls -a /fs1/userA it does not
show .snapshots , but if you enter ls /fs1/userA/.snapshots and cd /fs1/userA/.snapshots, it does show
.snapshots.
If the mmsnapdir command is issued while another snapshot command is running, the mmsnapdir
command waits for that command to complete.
For more information about GPFS snapshots, see Creating and maintaining snapshots of GPFS file systems in
General Parallel File System: Advanced Administration Guide.
Parameters
Device The device name of the file system. File system names need not be fully-qualified. fs0 is just as
acceptable as /dev/fs0.
This must be the first parameter.
Options
-a Adds a snapshots subdirectory to all subdirectories in the file system.
-q Displays current settings, if issued without any other flags.
-r Reverses the effect of the -a option. All invisible snapshot directories are removed. The snapshot
directory under the file system root directory is not affected.
-s SnapDirName
Changes the name of the snapshots subdirectory to SnapDirName. This affects both the directory
in the file system root as well as the invisible directory in the other file system directories if the
mmsnapdir -a command has been issued.
Exit status
0 Successful completion.
Security
If you are a root user, the node on which the command is issued must be able to execute remote shell
commands on any other node in the cluster without the use of a password and without producing any
extraneous messages. See “Requirements for administering a GPFS file system” on page 1.
If you are a non-root user, you may only specify file systems that belong to the same cluster as the node
on which the mmsnapdir command was issued.
Examples
1. To rename the .snapshots directory (the default snapshots directory name) to .link for file system fs1,
issue the command:
mmsnapdir fs1 -s .link
After the command has been issued, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
/fs1/.link/snap1/file1
/fs1/.link/snap1/userA/file2
/fs1/.link/snap1/userA/file3
2. Issuing:
mmsnapdir fs1 -a
After the command has been issued, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
/fs1/userA/.link/snap1/file2
/fs1/userA/.link/snap1/file3
/fs1/.link/snap1/file1
/fs1/.link/snap1/userA/file2
/fs1/.link/snap1/userA/file3
The .link subdirectory under the root directory and under each subdirectory of the tree provides two
different paths to each snapshot copy of a file. For example, /fs1/userA/.link/snap1/file2 and
/fs1/.link/snap1/userA/file2 are two different paths that access the same snapshot copy of
/fs1/userA/file2.
3. Issuing:
mmsnapdir fs1 -r
After the command has been issued, the directory structure would appear similar to:
/fs1/file1
/fs1/userA/file2
/fs1/userA/file3
/fs1/.link/snap1/file1
/fs1/.link/snap1/userA/file2
/fs1/.link/snap1/userA/file3
4. Issuing:
mmsnapdir fs1 -q
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
Use the mmstartup command to start the GPFS daemons on one or more nodes. If no operand is
specified, GPFS is started only on the node from which the command was issued.
Parameters
-a Start GPFS on all nodes in a GPFS cluster.
-N {Node[,Node...] | NodeFile | NodeClass}
Directs the mmstartup command to process a set of nodes. For general information on how to
specify node names, see “Specifying nodes as input to GPFS commands” on page 3.
This command does not support a NodeClass of mount.
Options
-E EnvVar=value
Specifies the name and value of an environment variable to be passed to the GPFS daemon. You
can specify multiple -E options.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To start GPFS on all nodes in the GPFS cluster, issue this command:
mmstartup -a
See also
/usr/lpp/mmfs/bin
Synopsis
mmtracectl {--start | --stop | --off | --set} [--trace={io | all | def | "Class Level [Class Level ...]"]
[--trace-recycle={off | local | global | globalOnShutdown}] [--aix-trace-buffer-size=BufferSize]
[--tracedev-buffer-size=BufferSize] [--tracedev-compression-level=Level] [--tracedev-overwrite-buffer-
size=BufferSize] [--tracedev-write-mode={blocking | overwrite}] [--trace-file-size=FileSize]
[--trace-dispatch={yes | no}] [-N {Node [,Node...] | NodeFile | NodeClass}]
Description
Attention: Use this command only under the direction of your IBM service representative.
| Note: Tracing on Windows requires support programs provided by Microsoft®. For details about this
| prerequisite, see the section about configuring Windows and installing tracing support programs in the
| GPFS: Concepts, Planning, and Installation Guide.
Results
GPFS tracing can be started, stopped, or related configuration options can be set.
Parameters
--start | --stop | --off | --set
Specifies the actions that the mmtracectl command performs, where:
start Starts the trace.
stop Stops the trace.
off Clears all of the setting variables and stops the trace.
set Sets the trace variables.
--trace={io | all | def | "Class Level [Class Level ...}"]
Allows for predefined and user-specified trace levels, where:
io Indicates trace-level settings tailored for input and output (I/O).
all Sets trace levels to their highest setting (9).
def Indicates that the default trace settings will be used.
Options
--aix-trace-buffer-size=BufferSize
Controls the size of the trace buffer in memory for AIX.
--tracedev-buffer-size=BufferSize
Specifies the trace buffer size for Linux trace in blocking mode. If --tracedev-write-mode is set to
blocking, this parameter will be used. It should be no less than 4K and no more than 64M. The
default is 4M.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
To set trace levels to the defined group of def and have traces start on all nodes when GPFS comes up,
issue this command:
mmtracectl --set --trace=def --trace-recycle=global
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmumount {Device | MountPoint | DriveLetter | all | all_local | all_remote} [-f ] [-a | -N {Node[,Node...]
| NodeFile | NodeClass}]
Or,
Description
Another name for the mmumount command is the mmunmount command. Either name can be used.
The mmumount command unmounts a previously mounted GPFS file system on one or more nodes in
the cluster. If no nodes are specified, the file systems are unmounted only on the node from which the
command was issued. The file system can be specified using its device name or the mount point where it
is currently mounted.
Use the first form of the command to unmount file systems on nodes that belong to the local cluster.
Use the second form of the command with the -C option when it is necessary to force an unmount of file
systems that are owned by the local cluster, but are mounted on nodes that belong to another cluster.
When a file system is unmounted by force with the second form of the mmumount command, the
affected nodes may still show the file system as mounted, but the data will not be accessible. It is the
responsibility of the system administrator to clear the mount state by issuing the umount command.
When multiple nodes are affected and the unmount target is identified via a mount point or a Windows
drive letter, the mount point is resolved on each of the target nodes. Depending on how the file systems
were mounted, this may result in different file systems being unmounted on different nodes. When in
doubt, always identify the target file system with its device name.
Parameters
Device | MountPoint | DriveLetter | all | all_local | all_remote
Indicates the file system or file systems to be unmounted.
Device Is the device name of the file system to be unmounted. File system names do not need to
be fully qualified. fs0 is as acceptable as /dev/fs0.
MountPoint
Is the location where the GPFS file system to be unmounted is currently mounted.
DriveLetter
Identifies a file system by its Windows drive letter.
all Indicates all file systems that are known to this cluster.
all_local
Indicates all file systems that are owned by this cluster.
all_remote
Indicates all files systems that are owned by another cluster to which this cluster has
access.
This must be the first parameter.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
Examples
1. To unmount file system fs1 on all nodes in the cluster, issue this command:
mmumount fs1 -a
See also
Location
/usr/lpp/mmfs/bin
Synopsis
Description
The mmunlinkfileset command removes the junction to the fileset. The junction can be specified by path
or by naming the fileset that is its target. The unlink fails if there are files open in the fileset, unless the -f
flag is specified. The root fileset may not be unlinked.
Attention: If you are using the TSM Backup Archive client, use caution when you unlink filesets that
contain data backed up by TSM. TSM tracks files by pathname and does not track filesets. As a result,
when you unlink a fileset, it appears to TSM that you deleted the contents of the fileset. Therefore, the
TSM Backup Archive client inactivates the data on the TSM server which may result in the loss of backup
data during the expiration process.
For information on GPFS filesets, see the GPFS: Advanced Administration Guide.
Parameters
Device The device name of the file system that contains the fileset.
File system names need not be fully-qualified. fs0 is as acceptable as /dev/fs0.
FilesetName
Specifies the name of the fileset to be removed.
-J JunctionPath
Specifies the name of the junction to be removed.
A junction is a special directory entry that connects a name in a directory of one fileset to the root
directory of another fileset.
Options
-f Forces the unlink to take place even though there may be open files. This option forcibly closes
any open files, causing an errno of ESTALE on their next use of the file.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
The node on which the command is issued must be able to execute remote shell commands on any other
node in the cluster without the use of a password and without producing any extraneous messages. See
“Requirements for administering a GPFS file system” on page 1.
See also
Location
/usr/lpp/mmfs/bin
Synopsis
mmwinservctl set [--account AccountName [--password Password]] [--remote-shell {yes | no}] [-N
{Node[,Node...] | NodeFile | NodeClass}] [-v]
Or,
Description
mmwinserv is a GPFS for Windows service that is needed for the proper functioning of the GPFS
daemon on nodes running Windows. Optionally, the service can be configured to provide a remote
execution facility for GPFS administration commands.
Use the mmwinservctl command to manage the mmwinserv service. You can set the log on account and
password for the service, enable or disable the service, enable or disable the service's remote execution
facility, or query its current state.
The mmwinservctl command must be run on a Windows node and it has no effect on nodes running
other operating systems.
If the remote execution facility of mmwinserv is enabled, a Windows GPFS cluster can be configured to
use mmwinrsh and mmwinrcp as the remote shell and remote file copy commands:
v mmwinrsh (/usr/lpp/mmfs/bin/mmwinrsh) uses Windows Named Pipes to pass the command to the
target node.
v mmwinrcp (/usr/lpp/mmfs/bin/mmwinrcp) is a wrapper module that invokes the SUA cp command to
copy the files that are needed by the mm commands. The path names on remote hosts are translated
into path names based on the standard Windows ADMIN$ share.
An account must be given the right to log on as a service before it can be used to run mmwinserv. The
right to log on as a service is controlled by the Local Security Policy of each Windows node. You can use
the Domain Group Policy to set the Local Security Policy on all Windows nodes in a GPFS cluster.
For more information on the mmwinserv service, see the topic Configuring the GPFS Administration service
in the GPFS: Concepts, Planning, and Installation Guide.
Parameters
set Sets the service configuration options and restarts the service if it is running.
enable
Sets the service to automatic startup and starts the service.
disable
| Sets the service to disabled and stops the service.
| query Returns information about the service's configuration and current state.
--account AccountName
| Specifies the log on account name for the mmwinserv service. By default, mmwinserv is
| configured to run using the LocalSystem account.
Options
-v Displays progress and intermediate error messages.
Exit status
0 Successful completion.
nonzero
A failure has occurred.
Security
You must be a member of the Domain Admins group to run the mmwinservctl command.
Examples
1. To specify gpfs\root as the log on account name for the mmwinserv service and enable the remote
command execution facility on nodes ls21n19 and ls21n20, issue:
mmwinservctl set -N ls21n19,ls21n20 --account gpfs/root -password abcdefg –remote-shell yes
The system displays information similar to:
Node name Service state Remote shell Account name
----------------------------------------------------------
ls21n19 START_PENDING yes gpfs\root
ls21n20 START_PENDING yes gpfs\root
2. To display the current state of the mmwinserv service on all nodes in the cluster, issue:
mmwinservctl query -N all
Location
/usr/lpp/mmfs/bin
Library
Structure
/* The GPFS ACL */
typedef struct gpfs_acl
{
gpfs_aclLen_t acl_len; /* Total length of this ACL in bytes */
gpfs_aclLevel_t acl_level; /* Reserved (must be zero) */
gpfs_aclVersion_t acl_version; /* POSIX or NFS4 ACL */
gpfs_aclType_t acl_type; /* Access, Default, or NFS4 */
gpfs_aclCount_t acl_nace; /* Number of Entries that follow */
union
{
gpfs_ace_v1_t ace_v1[1]; /* when GPFS_ACL_VERSION_POSIX */
gpfs_ace_v4_t ace_v4[1]; /* when GPFS_ACL_VERSION_NFS4 */
};
} gpfs_acl_t;
Description
The gpfs_acl_t structure contains size, version, and ACL type information for the gpfs_getacl() and
gpfs_putacl() subroutines.
Members
acl_len
The total length (in bytes) of this gpfs_acl_t structure.
acl_level
Reserved for future use. Currently must be zero.
acl_version
This field contains the version of the GPFS ACL. GPFS supports two ACL versions:
GPFS_ACL_VERSION_POSIX and GPFS_ACL_VERSION_NFS4. On input to the gpfs_getacl()
subroutine, set this field to zero.
acl_type
On input to the gpfs_getacl() subroutine, set this field to:
v Either GPFS_ACL_TYPE_ACCESS or GPFS_ACL_TYPE_DEFAULT for POSIX ACLs
v GPFS_ACL_TYPE_NFS4 for NFS ACLs.
These constants are defined in the gpfs.h header file.
acl_nace
The number of ACL entries that are in the array (ace_v1 or ace_v4).
Location
Library
Synopsis
#include <gpfs.h>
void gpfs_close_inodescan(gpfs_iscan_t *iscan);
Description
The gpfs_close_inodescan() subroutine closes the scan of the inodes in a file system or snapshot that was
opened with the gpfs_open_inodescan() subroutine. The gpfs_close_inodescan() subroutine frees all
storage used for the inode scan and invalidates the iscan handle.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan Pointer to the inode scan handle.
Exit status
Exceptions
None.
Error status
None.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_cmp_fssnapid (const gpfs_fssnap_id_t *fssnapId1,
const gpfs_fssnap_id_t *fssnapId2,
int *result);
Description
The gpfs_cmp_fssnapid() subroutine compares two snapshot IDs for the same file system to determine
the order in which the two snapshots were taken. The result parameter is set as follows:
v result less than zero indicates that snapshot 1 was taken before snapshot 2.
v result equal to zero indicates that snapshot 1 and 2 are the same.
v result greater than zero indicates that snapshot 1 was taken after snapshot 2.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapId1
File system snapshot ID of the first snapshot.
fssnapId2
File system snapshot ID of the second snapshot.
result Pointer to an integer indicating the outcome of the comparison.
Exit status
If the gpfs_cmp_fssnapid() subroutine is successful, it returns a value of 0 and the result parameter is set
as described above.
If the gpfs_cmp_fssnapid() subroutine is unsuccessful, it returns a value of -1 and the global error
variable errno is set to indicate the nature of the error.
Exceptions
None.
Error status
EDOM
The two snapshots cannot be compared because they were taken from two different file systems.
ENOSYS
The gpfs_cmp_fssnapid() subroutine is not available.
Location
Library
Structure
typedef struct gpfs_direntx
{
int d_version; /*this struct’s version*/
unsigned short d_reclen; /*actual size of this struct including
null-terminated variable-length d_name*/
unsigned short d_type; /*types are defined below*/
gpfs_ino_t d_ino; /*file inode number*/
gpfs_gen_t d_gen; /*generation number for the inode*/
char d_name[256]; /*null-terminated variable-length name*/
} gpfs_direntx_t;
Description
Members
d_version
The version number of this structure.
d_reclen
The actual size of this structure including the null-terminated variable-length d_name field.
To allow some degree of forward compatibility, careful callers should use the d_reclen field for
the size of the structure rather than the sizeof() function.
d_type
The type of directory.
d_ino The directory inode number.
d_gen The directory generation number.
d_name
Null-terminated variable-length name of the directory.
Examples
Location
Library
Structure
{
int d_version; /* this struct’s version */
unsigned short d_reclen; /* actual size of this struct including
null terminated variable length d_name */
unsigned short d_type; /* Types are defined below */
gpfs_ino64_t d_ino; /* File inode number */
gpfs_gen64_t d_gen; /* Generation number for the inode */
char d_name[256]; /* null terminated variable length name */
} gpfs_direntx64_t;
Description
Members
d_version
The version number of this structure.
d_reclen
The actual size of this structure including the null-terminated variable-length d_name field.
To allow some degree of forward compatibility, careful callers should use the d_reclen field for
the size of the structure rather than the sizeof() function.
d_type
The type of directory.
d_ino The directory inode number.
d_gen The directory generation number.
d_name
Null-terminated variable-length name of the directory.
Examples
Location
/usr/lpp/mmfs/lib/libgpfs.a for AIX, /usr/lpp/mmfs/lib/libgpfs.so for Linux
Library
Synopsis
#include <gpfs.h>
int gpfs_fcntl(int fileDesc, void* fcntlArgP)
Description
The gpfs_fcntl() subroutine is used to pass file access pattern information and to control certain file
attributes on behalf of an open file. More than one operation can be requested with a single invocation of
gpfs_fcntl(). The type and number of operations is determined by the second operand, fcntlArgP, which
is a pointer to a data structure built in memory by the application. This data structure consists of:
v A fixed length header, mapped by gpfsFcntlHeader_t.
v A variable list of individual file access hints, directives or other control structures:
– File access hints:
1. gpfsAccessRange_t
2. gpfsFreeRange_t
3. gpfsMultipleAccessRange_t
4. gpfsClearFileCache_t
– File access directives:
1. gpfsCancelHints_t
2. gpfsDataShipMap_t
3. gpfsDataShipStart_t
4. gpfsDataShipStop_t
– Other file attribute operations:
1. gpfsGetFilesetName_t
2. gpfsGetReplication_t
3. gpfsGetSnapshotName_t
4. gpfsGetStoragePool_t
5. gpfsRestripeData_t
6. gpfsSetReplication_t
7. gpfsSetStoragePool_t
The above hints, directives and other operations may be mixed within a single gpfs_fcntl() subroutine,
and are performed in the order that they appear. A subsequent hint or directive may cancel out a
preceding one. See Chapter 7, “Communicating file access patterns to GPFS,” on page 63.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Exit status
If the gpfs_fcntl() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EBADF
The file descriptor is not valid.
EINVAL
The file descriptor does not refer to a GPFS file or a regular file.
The system call is not valid.
ENOSYS
The gpfs_fcntl() subroutine is not supported under the current file system format.
Examples
1. This programming segment releases all cache data held by the file handle and tell GPFS that the
subroutine will write the portion of the file with file offsets between 2 GB and 3 GB minus one:
struct
{
gpfsFcntlHeader_t hdr;
gpfsClearFileCache_t rel;
gpfsAccessRange_t acc;
} arg;
arg.hdr.totalLength = sizeof(arg);
arg.hdr.fcntlVersion = GPFS_FCNTL_CURRENT_VERSION;
arg.hdr.fcntlReserved = 0;
arg.rel.structLen = sizeof(arg.rel);
arg.rel.structType = GPFS_CLEAR_FILE_CACHE;
arg.acc.structLen = sizeof(arg.acc);
arg.acc.structType = GPFS_ACCESS_RANGE;
arg.acc.start = 2LL * 1024LL * 1024LL * 1024LL;
arg.acc.length = 1024 * 1024 * 1024;
arg.acc.isWrite = 1;
rc = gpfs_fcntl(handle, &arg);
2. This programming segment gets the storage pool name and fileset name of a file from GPFS.
struct {
gpfsFcntlHeader_t hdr;
gpfsGetStoragePool_t pool;
gpfsGetFilesetName_t fileset;
} fcntlArg;
fcntlArg.hdr.totalLength = sizeof(fcntlArg.hdr) + sizeof(fcntlArg.pool) + sizeof(fcntlArg.fileset);
fcntlArg.hdr.fcntlVersion = GPFS_FCNTL_CURRENT_VERSION;
fcntlArg.hdr.fcntlReserved = 0;
fcntlArg.fileset.structLen = sizeof(fcntlArg.fileset);
fcntlArg.fileset.structType = GPFS_FCNTL_GET_FILESETNAME;
rc = gpfs_fcntl(fd, &fcntlArg);
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_fgetattrs(int fileDesc, int flags, void *bufferP,
int bufferSize, int *attrSizeP)
Description
The gpfs_fgetattrs() subroutine, together with gpfs_fputattrs(), is intended for use by a backup program
to save (gpfs_fgetattrs()) and restore (gpfs_fputattrs()) extended file attributes such as ACLs, DMAPI
attributes, and other information for the file. If the file has no extended attributes, the gpfs_fgetattrs()
subroutine returns a value of 0, but sets attrSizeP to 0 and leaves the contents of the buffer unchanged.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fileDesc
The file descriptor identifying the file whose extended attributes are being retrieved.
flags Must have one of these three values:
GPFS_ATTRFLAG_DEFAULT
v Saves the attributes for file placement
v Saves the currently assigned storage pool
GPFS_ATTRFLAG_NO_PLACEMENT
v Does not save attributes for file placement
v Does not save the currently assigned storage pool
GPFS_ATTRFLAG_IGNORE_POOL
v Saves attributes for file placement
v Does not save the currently assigned storage pool
bufferP
Pointer to a buffer to store the extended attribute information.
bufferSize
The size of the buffer that was passed in.
attrSizeP
If successful, returns the actual size of the attribute information that was stored in the buffer. If
the bufferSize was too small, returns the minimum buffer size.
Exit status
Exceptions
None.
Error status
EBADF
The file descriptor is not valid.
EFAULT
The address is not valid.
EINVAL
The file descriptor does not refer to a GPFS file.
ENOSPC
bufferSize is too small to return all of the attributes. On return, *attrSizeP is set to the required
size.
ENOSYS
The gpfs_fgetattrs() subroutine is not supported under the current file system format.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_fputattrs(int fileDesc, int flags, void *bufferP)
Description
The gpfs_fputattrs() subroutine, together with gpfs_fgetattrs(), is intended for use by a backup program
to save (gpfs_fgetattrs()) and restore (gpfs_fputattrs()) all of the extended attributes of a file. This
subroutine also sets the storage pool for the file and sets data replication to the values that are saved in
the extended attributes.
If the saved storage pool is not valid or if the IGNORE_POOL flag is set, GPFS will select the storage
pool by matching a PLACEMENT rule using the saved file attributes. If GPFS fails to match a placement
rule or if there are no placement rules installed, GPFS assigns the file to the system storage pool.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fileDesc
The file descriptor identifying the file whose extended attributes are being set.
flags Non-placement attributes such as ACLs are always restored, regardless of value of the flag. Flags
must have one of these three values:
GPFS_ATTRFLAG_DEFAULT
Restores the previously assigned storage pool and previously assigned data replication.
GPFS_ATTRFLAG_NO_PLACEMENT
Does not change storage pool and data replication.
GPFS_ATTRFLAG_IGNORE_POOL
Selects storage pool and data replication by matching the saved attributes to a placement
rule instead of restoring the saved storage pool.
bufferP
A pointer to the buffer containing the extended attributes for the file.
If you specify a value of NULL, all extended ACLs for the file are deleted.
Exit status
If the gpfs_fputattrs() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
None.
Error status
EBADF
The file descriptor is not valid.
EINVAL
The buffer pointed to by bufferP does not contain valid attribute data, or the file descriptor does
not refer to a GPFS file.
ENOSYS
The gpfs_fputattrs() subroutine is not supported under the current file system format.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_fputattrswithpathname(int fileDesc, int flags, void *bufferP, const char *pathname)
Description
The gpfs_fputattrswithpathname() subroutine sets all of the extended attributes of a file. In addition,
gpfs_fputattrswithpathname() invokes the policy engine using the saved attributes to match a RESTORE
rule to set the storage pool and the data replication for the file. The caller should include the full path to
the file (including the file name) to allow rule selection based on file name or path. If the file fails to
match a RESTORE rule or if there are no RESTORE rules installed, GPFS selects the storage pool and
data replication as it does when calling gpfs_fputattrs().
Note: Compile any program that uses this subroutine with the -lgpfs flag from one the following
libraries:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fileDesc
Is the file descriptor that identifies the file whose extended attributes are to be set.
flags Non-placement attributes such as ACLs are always restored, regardless of value of the flag. Flags
must have one of these three values:
GPFS_ATTRFLAG_DEFAULT
Uses the saved attributes to match a RESTORE rule to set the storage pool and the data
replication for the file.
GPFS_ATTRFLAG_NO_PLACEMENT
Does not change storage pool and data replication.
GPFS_ATTRFLAG_IGNORE_POOL
Checks the file to see if it matches a RESTORE rule. If the file fails to match a RESTORE
rule, GPFS ignores the saved storage pool and selects a pool by matching the saved
attributes to a PLACEMENT rule.
bufferP
A pointer to the buffer containing the extended attributes for the file.
If you specify a value of NULL, all extended ACLs for the file are deleted.
Exit status
None.
Error status
EBADF
The file descriptor is not valid.
EINVAL
The buffer to which bufferP points does not contain valid attribute data.
ENOENT
No such file or directory.
ENOSYS
The gpfs_fputattrswithpathname() subroutine is not supported under the current file system
format.
Examples
Refer to “gpfs_fputattrs() Subroutine” on page 335 for examples.
Location
Library
Synopsis
#include <gpfs.h>
void gpfs_free_fssnaphandle(gpfs_fssnap_handle_t *fssnapHandle);
Description
The gpfs_free_fssnaphandle() subroutine frees the snapshot handle that is passed. The return value is
always void.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
Exit status
Exceptions
None.
Error status
None.
Examples
Location
Library
Structure
typedef struct gpfs_fssnap_handle gpfs_fssnap_handle_t;
Description
A file system or snapshot is uniquely identified by an fssnapId of type gpfs_fssnap_id_t. While the
fssnapId is permanent and global, a shorter fssnapHandle is used by the backup application
programming interface to identify the file system and snapshot being accessed. The fssnapHandle, like a
POSIX file descriptor, is volatile and may be used only by the program that created it.
Additional subroutines are provided to obtain the permanent, global fssnapId from the fssnapHandle, or
to obtain the path or the names for the file system and snapshot, if they are still available in the file
system.
The file system must be mounted in order to use the backup programming application interface. If the
fssnapHandle is created by the path name, the path may be relative and may specify any file or directory
in the file system. Operations on a particular snapshot are indicated with a path to a file or directory
within that snapshot. If the fssnapHandle is created by name, the file system's unique name may be
specified (for example, fs1) or its device name may be provided (for example, /dev/fs1). To specify an
operation on the active file system, the pointer to the snapshot's name should be set to NULL or a
zero-length string provided.
The name of the directory under which all snapshots appear may be obtained by the
gpfs_get_snapdirname() subroutine. By default this is .snapshots, but it can be changed using the
mmsnapdir command. The gpfs_get_snapdirname() subroutine returns the currently set value, which is
the one that was last set by the mmsnapdir command, or the default, if it was never changed.
Members
gpfs_fssnap_handle
File system snapshot handle
Examples
Location
Library
Structure
typedef struct gpfs_fssnap_id
{
char opaque[48];
} gpfs_fssnap_id_t;
Description
A file system or snapshot is uniquely identified by an fssnapId of type gpfs_fssnap_id_t. The fssnapId is
a permanent and global identifier that uniquely identifies an active file system or a read-only snapshot of
a file system. Every snapshot of a file system has a unique identifier that is also different from the
identifier of the active file system itself.
The fssnapId is obtained from an open fssnapHandle. Once obtained, the fssnapId should be stored
along with the file system's data for each backup. The fssnapId is required to generate an incremental
backup. The fssnapId identifies the previously backed up file system or snapshot and allows the inode
scan to return only the files and data that have changed since that previous scan.
Members
opaque
A 48 byte area for containing the snapshot identifier.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_fstat(int fileDesc, struct stat64 *Buffer)
Description
The gpfs_fstat() subroutine is used to obtain exact information about the file associated with the FileDesc
parameter. This subroutine is provided as an alternative to the stat() subroutine, which may not provide
exact mtime and atime values. See “Exceptions to Open Group technical standards” on page 467.
read, write, or execute permission for the named file is not required, but all directories listed in the path
leading to the file must be searchable. The file information is written to the area specified by the Buffer
parameter.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fileDesc
The file descriptor identifying the file for which exact status information is requested.
Buffer A pointer to the stat64 structure in which the information is returned. The stat64 structure is
described in the sys/stat.h file.
Exit status
If the gpfs_fstat() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EBADF
The file descriptor is not valid.
EINVAL
The file descriptor does not refer to a GPFS file or a regular file.
ENOSYS
The gpfs_fstat() subroutine is not supported under the current file system format.
Location
Library
Synopsis
#include <gpfs.h>
const char *gpfs_get_fsname_from_fssnaphandle
(gpfs_fssnap_handle_t *fssnapHandle);
Description
The gpfs_get_fsname_from_fssnaphandle() subroutine returns a pointer to the name of file system that is
uniquely identified by the file system snapshot handle.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
Exit status
Exceptions
None.
Error status
ENOSYS
The gpfs_get_fsname_from_fssnaphandle() subroutine is not available.
EPERM
The caller does not have superuser privileges.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_fssnap_handle_t *gpfs_get_fssnaphandle_by_fssnapid
(const gpfs_fssnap_id_t *fssnapId);
Description
The gpfs_get_fssnaphandle_by_fssnapid() subroutine creates a handle for the file system or snapshot
that is uniquely identified by the permanent, unique snapshot ID.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapId
File system snapshot ID
Exit status
Exceptions
None.
Error status
ENOMEM
Space could not be allocated for the file system snapshot handle.
ENOSYS
The gpfs_get_fssnaphandle_by_fssnapid() subroutine is not available.
EPERM
The caller does not have superuser privileges.
GPFS_E_INVAL_FSSNAPID
The file system snapshot ID is not valid.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_fssnap_handle_t *gpfs_get_fssnaphandle_by_name
(const char *fsName, const char *snapName);
Description
The gpfs_get_fssnaphandle_by_name() subroutine creates a handle for the file system or snapshot that is
uniquely identified by the file system's name and the name of the snapshot.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fsName
A pointer to the name of the file system whose snapshot handle is desired.
snapName
A pointer to the name of the snapshot whose snapshot handle is desired, or NULL to access the
active file system rather than a snapshot within the file system.
Exit status
If the gpfs_get_fssnaphandle_by_name() subroutine is unsuccessful, it returns NULL and sets the global
error variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOENT
The file system name is not valid.
ENOMEM
Space could not be allocated for the file system snapshot handle.
ENOSYS
The gpfs_get_fssnaphandle_by_name() subroutine is not available.
EPERM
The caller does not have superuser privileges.
Examples
Location
Name
gpfs_get_fssnaphandle_by_path()
Library
Synopsis
#include <gpfs.h>
gpfs_fssnap_handle_t *gpfs_get_fssnaphandle_by_path
(const char *pathName);
Description
The gpfs_get_fssnaphandle_by_path() subroutine creates a handle for the file system or snapshot that is
uniquely identified by a path through the file system's mount point to a file or directory within the file
system or snapshot.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
pathName
A pointer to the path name to a file or directory within the desired file system or snapshot.
Exit status
If the gpfs_get_fssnaphandle_by_path() subroutine is unsuccessful, it returns NULL and sets the global
error variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOENT
The path name is not valid.
ENOMEM
Space could not be allocated for the file system snapshot handle.
ENOSYS
The gpfs_get_fssnaphandle_by_path() subroutine is not available.
EPERM
The caller does not have superuser privileges.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_get_fssnapid_from_fssnaphandle
(gpfs_fssnap_handle_t *fssnapHandle,
gpfs_fssnap_id_t *fssnapId);
Description
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
fssnapId
File system snapshot ID.
Exit status
Exceptions
None.
Error status
EFAULT
Size mismatch for fssnapId.
EINVAL
NULL pointer given for returned fssnapId.
ENOSYS
The gpfs_get_fssnapid_from_fssnaphandle() subroutine is not available.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
Location
Library
Synopsis
#include <gpfs.h>
const char *gpfs_get_pathname_from_fssnaphandle
(gpfs_fssnap_handle_t *fssnapHandle);
Description
The gpfs_get_pathname_from_fssnaphandle() subroutine obtains the path name of the file system or
snapshot identified by the open file system snapshot handle.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
Exit status
Exceptions
None.
Error status
ENOSYS
The gpfs_get_pathname_from_fssnaphandle() subroutine is not available.
EPERM
The caller does not have superuser privileges.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
Examples
For an example using gpfs_get_pathname_from_fssnaphandle(), see /usr/lpp/mmfs/samples/util/
tsbackup.C.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_get_snapdirname(gpfs_fssnap_handle_t *fssnapHandle,
char *snapdirName, int bufLen);
Description
The gpfs_get_snapdirname() subroutine obtains the name of the directory that is used to contain
snapshots.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
snapdirName
Buffer into which the name of the snapshot directory will be copied.
bufLen
The size of the provided buffer.
Exit status
If the gpfs_get_snapdirname() subroutine is successful, it returns a value of 0 and the snapdirName and
bufLen parameters are set as described above.
If the gpfs_get_snapdirname() subroutine is unsuccessful, it returns a value of -1 and the global error
variable errno is set to indicate the nature of the error.
Exceptions
None.
Error status
ENOMEM
Unable to allocate memory for this request.
ENOSYS
The gpfs_get_snapdirname() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ERANGE
The buffer is too small to return the snapshot directory name.
Examples
Location
Library
Synopsis
#include <gpfs.h>
const char *gpfs_get_snapname_from_fssnaphandle
(gpfs_fssnap_handle_t *fssnapHandle);
Description
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
Exit status
Exceptions
None.
Error status
ENOSYS
The gpfs_get_snapname_from_fssnaphandle() subroutine is not available.
EPERM
The caller does not have superuser privileges.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
GPFS_E_INVAL_SNAPNAME
The snapshot has been deleted.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_getacl(char *pathname, int flags, void *aclP);
Description
The gpfs_getacl() subroutine, together with the gpfs_putacl() subroutine, is intended for use by a backup
program to save (gpfs_getacl()) and restore (gpfs_putacl()) the ACL information for the file.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
pathname
The path identifying the file for which the ACLs are being obtained.
flags Consists of one of these values:
0 Indicates that the aclP parameter is to be mapped with the gpfs_opaque_acl_t struct.
The gpfs_opaque_acl_t struct should be used by backup and restore programs.
GPFS_GETACL_STRUCT
Indicates that the aclP parameter is to be mapped with the gpfs_acl_t struct.
The gpfs_acl_t struct is provided for applications that need to interpret the ACL.
aclP Pointer to a buffer mapped by the structure gpfs_opaque_acl_t or gpfs_acl_t, depending on the
value of flags.
The first four bytes of the buffer must contain its total size.
Exit status
If the gpfs_getacl() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EINVAL
The path name does not refer to a GPFS file or a regular file.
ENOMEM
Unable to allocate memory for request.
Location
Library
Structure
typedef struct gpfs_iattr
{
int ia_version; /* this struct version */
int ia_reclen; /* sizeof this structure */
int ia_checksum; /* validity check on iattr struct */
gpfs_mode_t ia_mode; /* access mode */
gpfs_uid_t ia_uid; /* owner uid */
gpfs_gid_t ia_gid; /* owner gid */
gpfs_ino_t ia_inode; /* file inode number */
gpfs_gen_t ia_gen; /* inode generation number */
short ia_nlink; /* number of links */
short ia_flags; /* Flags (defined below) */
int ia_blocksize; /* preferred block size for io */
gpfs_mask_t ia_mask; /* Initial attribute mask (not used) */
gpfs_off64_t ia_size; /* file size in bytes */
gpfs_off64_t ia_blocks; /* 512 byte blocks of disk held by file */
gpfs_timestruc_t ia_atime; /* time of last access */
gpfs_timestruc_t ia_mtime; /* time of last data modification */
gpfs_timestruc_t ia_ctime; /* time of last status change */
gpfs_dev_t ia_rdev; /* id of device */
unsigned int ia_xperm; /* extended attributes (defined below) */
unsigned int ia_modsnapid; /* snapshot id of last modification */
unsigned int ia_filesetid; /* fileset ID */
unsigned int ia_datapoolid; /* storage pool ID for data */
} gpfs_iattr_t;
Description
Members
ia_version
The version number of this structure.
ia_reclen
The size of this structure.
ia_checksum
The checksum for this gpfs_iattr structure.
Examples
Location
Library
Structure
typedef struct gpfs_iattr64
{
int ia_version; /* this struct version */
int ia_reclen; /* sizeof this structure */
int ia_checksum; /* validity check on iattr struct */
gpfs_mode_t ia_mode; /* access mode */
gpfs_uid64_t ia_uid; /* owner uid */
gpfs_gid64_t ia_gid; /* owner gid */
gpfs_ino64_t ia_inode; /* file inode number */
gpfs_gen64_t ia_gen; /* inode generation number */
long long ia_nlink; /* number of links */
gpfs_off64_t ia_size; /* file size in bytes */
gpfs_off64_t ia_blocks; /* 512 byte blocks of disk held by file */
gpfs_timestruc64_t ia_atime; /* time of last access */
unsigned int ia_winflags; /* window’s flags (defined below) */
gpfs_timestruc64_t ia_mtime; /* time of last data modification */
unsigned int ia_flags; /* flags (defined below) */
gpfs_timestruc64_t ia_ctime; /* time of last status change */
int ia_blocksize; /* preferred block size for io */
gpfs_timestruc64_t ia_createtime; /* creation time */
gpfs_mask_t ia_mask; /* initial attribute mask (not used) */
gpfs_timestruc64_t ia_etime; /* expiration time */
unsigned int ia_xperm; /* extended attributes (defined below) */
gpfs_dev_t ia_dev; /* id of device containing file */
gpfs_dev_t ia_rdev; /* device id (if special file) */
gpfs_snapid64_t ia_modsnapid; /* snapshot id of last modification */
unsigned int ia_filesetid; /* fileset ID */
unsigned int ia_datapoolid; /* storage pool ID for data */
unsigned int ia_pcacheflags; /* pcache inode bits */
unsigned int ia_unused[10]; /* reserved space */
} gpfs_iattr64_t;
#ifdef GPFS_64BIT_INODES
#undef GPFS_IA_VERSION
#define GPFS_IA_VERSION GPFS_IA_VERSION64
#define gpfs_iattr_t gpfs_iattr64_t
#endif
Description
Members
ia_version
The version number of this structure.
ia_reclen
The size of this structure.
ia_checksum
The checksum for this gpfs_iattr structure.
ia_mode
The access mode for this inode.
ia_uid The owner user ID for this inode.
ia_gid
The owner group ID for this inode.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_iclose(gpfs_ifile_t *ifile);
Description
For an overview of using gpfs_iclose() in a backup application, see “Using APIs to develop backup
applications” on page 31.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
ifile Pointer to gpfs_ifile_t from gpfs_iopen().
Exit status
If the gpfs_iclose() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOSYS
The gpfs_iclose() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
Cached file system information was not valid.
Examples
Location
Library
Structure
typedef struct gpfs_ifile gpfs_ifile_t;
Description
The gpfs_ifile_t structure contains a handle for the file of a GPFS inode.
Members
gpfs_ifile
The handle for the file of a GPFS inode.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_igetattrs(gpfs_ifile_t *ifile,
void *buffer,
int bufferSize,
int *attrSize);
Description
The gpfs_igetattrs() subroutine retrieves all extended file attributes in opaque format. This subroutine is
intended for use by a backup program to save all extended file attributes (ACLs, attributes, and so forth).
If the file does not have any extended attributes, the subroutine sets attrSize to zero.
Notes:
1. This call does not return extended attributes used for the Data Storage Management (XDSM) API (also
known as DMAPI).
2. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
ifile Pointer to gpfs_ifile_t from gpfs_iopen().
buffer Pointer to buffer for returned attributes.
bufferSize
Size of the buffer.
attrSize
Pointer to returned size of attributes.
Exit status
If the gpfs_igetattrs() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOSPC
The buffer is too small to return all attributes. Field *attrSizeP will be set to the size necessary.
ENOSYS
The gpfs_igetattrs() subroutine is not available.
Location
| Library
| Synopsis
| #include <gpfs.h>
| int gpfs_igetattrsx(gpfs_ifile_t *ifile,
| int flags,
| void *buffer,
| int bufferSize,
| int *attrSize);
| Description
| The gpfs_igetattrsx() subroutine retrieves all extended file attributes in opaque format. It provides the
| same function as gpfs_igetattrs but includes a flags parameter that allows the caller to back up and
| restore DMAPI attributes.
| This function is intended for use by a backup program to save (and restore, using the related subroutine
| gpfs_iputattrsx) all extended file attributes (ACLs, user attributes, and so forth) in one call. If the file
| does not have any extended attributes, the subroutine sets attrSize to zero.
| Notes:
| 1. This call can optionally return extended attributes used for the Data Storage Management (XDSM)
| API (also known as DMAPI).
| 2. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
| v libgpfs.a for AIX
| v libgpfs.so for Linux
| Parameters
| ifile Pointer to gpfs_ifile_t from gpfs_iopen().
| flags Flags must have one of these three values:
| GPFS_ATTRFLAG_NO_PLACEMENT
| File attributes for placement are not saved, and neither is the current storage pool.
| GPFS_ATTRFLAG_IGNORE_PLACEMENT
| File attributes for placement are saved, but the current storage pool is not.
| GPFS_ATTRFLAG_INCL_DMAPI
| File attributes for DMAPI are included in the returned buffer.
| buffer A pointer to the buffer for returned attributes.
| bufferSize
| Size of the buffer.
| attrSize
| Pointer to returned size of attributes.
| If the gpfs_igetattrsx() subroutine is unsuccessful, it returns a value of -1 and sets the global error
| variable errno to indicate the nature of the error.
| Exceptions
| None.
| Error status
| EINVAL
| Not a GPFS file, or the flags provided are not valid.
| ENOSPC
| The buffer is too small to return all attributes. Field *attrSizeP will be set to the size necessary.
| ENOSYS
| The gpfs_igetattrsx() subroutine is not available.
| Location
Library
Synopsis
#include <gpfs.h>
int gpfs_igetfilesetname(gpfs_iscan_t *iscan,
unsigned int filesetId,
void *buffer,
int bufferSize);
Description
The gpfs_igetfilesetname() subroutine is part of the backup by inode interface. The caller provides a
pointer to the scan descriptor used to obtain the fileset ID. This library routine will return the name of
the fileset defined by the fileset ID. The name is the null-terminated string provided by the administrator
when the fileset was defined. The maximum string length is defined by GPFS_MAXNAMLEN.
Note:
1. This routine is not thread safe. Only one thread at a time is allowed to invoke this routine for the
given scan descriptor.
2. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan Pointer to gpfs_iscan_t used to obtain the fileset ID.
filesetId
The fileset ID.
buffer Pointer to buffer for returned attributes.
bufferSize
Size of the buffer.
Exit status
If the gpfs_igetfilesetname() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
E2BIG The buffer is too small to return the fileset name.
EINTR
The call was interrupted. This routine is not thread safe.
Examples
This programming segment gets the fileset name based on the given fileset ID. The returned fileset name
is stored in FileSetNameBuffer, which has a length of FileSetNameSize.
gpfs_iscan_t *fsInodeScanP;
gpfs_igetfilesetname(fsInodeScanP,FileSetId, &FileSetNameBuffer,FileSetNameSize);
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_igetstoragepool(gpfs_iscan_t *iscan,
unsigned int dataPoolId,
void *buffer,
int bufferSize);
Description
The gpfs_igetstoragepool() subroutine is part of the backup by inode interface. The caller provides a
pointer to the scan descriptor used to obtain the storage pool ID. This routine returns the name of the
storage pool for the given storage pool ID. The name is the null-terminated string provided by the
administrator when the storage pool was defined. The maximum string length is defined by
GPFS_MAXNAMLEN.
Note:
1. This routine is not thread safe. Only one thread at a time is allowed to invoke this routine for the
given scan descriptor.
2. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan Pointer to gpfs_iscan_t used to obtain the storage pool ID.
dataPoolId
The storage pool ID.
buffer Pointer to buffer for returned attributes.
bufferSize
Size of the buffer.
Exit status
If the gpfs_igetstoragepool() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
E2BIG The buffer is too small to return the storage pool name.
Examples
This programming segment gets the storage pool name based on the given storage pool ID. The returned
storage pool name is stored in StoragePoolNameBuffer which has the length of StoragePoolNameSize.
gpfs_iscan_t *fsInodeScanP;
gpfs_igetstoragepool(fsInodeScanP,StgpoolIdBuffer, &StgpoolNameBuffer,StgpoolNameSize);
Location
Library
Synopsis
#include <gpfs.h>
gpfs_ifile_t *gpfs_iopen(gpfs_fssnap_handle_t *fssnapHandle,
gpfs_ino_t ino,
int open_flags,
const gpfs_iattr_t *statxbuf,
const char *symlink);
Description
The gpfs_iopen() subroutine opens a user file or directory for backup. The file is identified by its inode
number ino within the file system or snapshot identified by the fssnapHandle. The fssnapHandle
parameter must be the same one that was used to create the inode scan that returned the inode number
ino.
To read the file or directory, the open_flags must be set to GPFS_O_BACKUP. The statxbuf and symlink
parameters are reserved for future use and must be set to NULL.
For an overview of using gpfs_iopen() in a backup application, see “Using APIs to develop backup
applications” on page 31.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
ino inode number.
open_flags
GPFS_O_BACKUP
Read files for backup.
O_RDONLY
For gpfs_iread().
statxbuf
This parameter is reserved for future use and should always be set to NULL.
symlink
This parameter is reserved for future use and should always be set to NULL.
Exit status
If the gpfs_iopen() subroutine is successful, it returns a pointer to the inode's file handle.
If the gpfs_iopen() subroutine is unsuccessful, it returns NULL and the global error variable errno is set
to indicate the nature of the error.
Chapter 9. GPFS programming interfaces 375
Exceptions
None.
Error status
EINVAL
Missing or incorrect parameter.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_iopen() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
Cached file system information was not valid.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_iopen64(gpfs_fssnap_handle_t *fssnapHandle,
gpfs_ino64_t ino,
int open_flags,
const gpfs_iattr64_t *statxbuf,
const char *symLink);
Description
The gpfs_iopen64() subroutine opens a user file or directory for backup. The file is identified by its inode
number ino within the file system or snapshot identified by the fssnapHandle. The fssnapHandle
parameter must be the same one that was used to create the inode scan that returned the inode number
ino.
To read the file or directory, the open_flags must be set to GPFS_O_BACKUP. The statxbuf and symlink
parameters are reserved for future use and must be set to NULL.
For an overview of using gpfs_iopen64() in a backup application, see “Using APIs to develop backup
applications” on page 31.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
The file system snapshot handle.
ino The inode number.
open_flags
GPFS_O_BACKUP
Read files for backup.
O_RDONLY
For gpfs_iread().
statxbuf
This parameter is reserved for future use and should always be set to NULL.
symlink
This parameter is reserved for future use and should always be set to NULL.
Exit status
If the gpfs_iopen64() subroutine is successful, it returns a pointer to the inode's file handle.
If the gpfs_iopen64() subroutine is unsuccessful, it returns NULL and the global error variable errno is
set to indicate the nature of the error.
Chapter 9. GPFS programming interfaces 377
Exceptions
None.
Error status
EFORMAT
The file system version number is not valid.
EINVAL
Missing or incorrect parameter.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_iopen64() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
Cached file system information was not valid.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
GPFS_E_INVAL_IATTR
The iattr structure was corrupted.
Note: gpfs_iopen64() calls the standard library subroutines dup(), open(), and malloc(); if one of these
called subroutines returns an error, gpfs_iopen64() also returns that error.
Examples
Location
| Library
| Synopsis
| #include <gpfs.h>
| int GPFS_API
| gpfs_iputattrsx(gpfs_ifile_t *ifile,
| int flags,
| void *buffer,
| const char *pathName);
| Description
| The gpfs_iputattrsx() subroutine, together with gpfs_igetattrsx(), is intended for use by a backup
| program to save (gpfs_igetattrsx()) and restore (gpfs_iputattrsx()) all of the extended attributes of a file.
| This subroutine also sets the storage pool for the file and sets data replication to the values that are saved
| in the extended attributes.
| This subroutine can optionally invoke the policy engine to match a RESTORE rule using the file's
| attributes saved in the extended attributes to set the file's storage pool and data replication as when
| calling gpfs_fputattrswithpathname. When used with the policy engine, the caller should include the full
| path to the file, including the file name, to allow rule selection based on file name or path.
| By default, the routine will not use RESTORE policy rules for data placement. The pathName
| paramenter will be ignored and may be set to NULL.
| If the call does not use RESTORE policy rules, or if the file fails to match a RESTORE rule, or if there
| are not RESTORE rules installed, then the storage pool and data replication are selected as when calling
| gpfs_fputattrs().
| The buffer passed in should contain extended attribute data that was obtained by a previous call to
| gpfs_fgetattrs.
| Note: This call will restore extended attributes used for the Data Storage Management (XDSM) API (also
| known as DMAPI) if they are present in the buffer.
| Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
| v libgpfs.a for AIX
| v libgpfs.so for Linux
| Parameters
| ifile A pointer to gpfs_ifile_t from gpfs_iopen.
| flags Flags must have one of these three values:
| GPFS_ATTRFLAG_NO_PLACEMENT
| File attributes are restored, but the storage pool and data replication are unchanged.
| GPFS_ATTRFLAG_IGNORE_POOL
| File attributes are restored, but the storage pool and data replication are selected by
| matching the saved attributes to a placement rule instead of restoring the saved storage
| pool.
| Note: pathName is a UTF-8 encoded string. On Windows, applications can convert UTF-16
| (Unicode) to UTF-8 using the platform's WideCharToMultiByte function.
| Exit status
| If the gpfs_iputattrsx() subroutine is unsuccessful, it returns a value of -1 and sets the global error
| variable errno to indicate the nature of the error.
| Exceptions
| None.
| Error status
| EINVAL
| The buffer pointed to by buffer does not contain valid attribute data, or invalid flags were
| provided.
| ENOSYS
| The gpfs_iputattrsx() subroutine is not supported under the current file system format.
| EPERM
| The caller of the subroutine must have superuser privilege.
| ESTALE
| The cached fs information was not valid.
| GPFS_E_INVAL_IFILE
| The ifile paramenters provided were not valid.
| Location
Library
Synopsis
#include <gpfs.h>
int gpfs_iread(gpfs_ifile_t *ifile,
void *buffer,
int bufferSize,
gpfs_off64_t *offset);
Description
The gpfs_iread() subroutine reads data from the file indicated by the ifile parameter returned from
gpfs_iopen(). This subroutine reads data beginning at parameter offset and continuing for bufferSize
bytes into the buffer specified by buffer. If successful, the subroutine returns a value that is the length of
the data read, and sets parameter offset to the offset of the next byte to be read. A return value of 0
indicates end-of-file.
For an overview of using gpfs_iread() in a backup application, see “Using APIs to develop backup
applications” on page 31.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
ifile Pointer to gpfs_ifile_t from gpfs_iopen().
buffer Buffer for the data to be read.
bufferSize
Size of the buffer (that is, the amount of data to be read).
offset Offset of where within the file to read. If gpfs_iread() is successful, offset is updated to the next
byte after the last one that was read.
Exit status
If the gpfs_iread() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EISDIR
The specified file is a directory.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_ireaddir(gpfs_ifile_t *idir,
const gpfs_direntx_t **dirent);
Description
The gpfs_ireaddir() subroutine returns the next directory entry in a file system. For an overview of using
gpfs_ireaddir() in a backup application, see “Using APIs to develop backup applications” on page 31.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
idir Pointer to gpfs_ifile_t from gpfs_iopen().
dirent Pointer to returned pointer to directory entry.
Exit status
If the gpfs_ireaddir() subroutine is successful, it returns a value of 0 and sets the dirent parameter to
point to the returned directory entry. If there are no more GPFS directory entries, gpfs_ireaddir() returns
a value of 0 and sets the dirent parameter to NULL.
If the gpfs_ireaddir() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_ireaddir() subroutine is not available.
ENOTDIR
File is not a directory.
EPERM
The caller does not have superuser privileges.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_IFILE
Incorrect ifile parameter.
Chapter 9. GPFS programming interfaces 383
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_ireaddir64(gpfs_ifile_t *idir,
const gpfs_direntx64_t **dirent);
Description
The gpfs_ireaddir64() subroutine returns the next directory entry in a file system. For an overview of
using gpfs_ireaddir64() in a backup application, see “Using APIs to develop backup applications” on
page 31.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
idir A pointer to gpfs_ifile_t from gpfs_iopen64().
dirent A pointer to the returned pointer to the directory entry.
Exit status
If the gpfs_ireaddir64() subroutine is successful, it returns a value of 0 and sets the dirent parameter to
point to the returned directory entry. If there are no more GPFS directory entries, gpfs_ireaddir64()
returns a value of 0 and sets the dirent parameter to NULL.
If the gpfs_ireaddir64() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_ireaddir64() subroutine is not available.
ENOTDIR
File is not a directory.
EPERM
The caller does not have superuser privileges.
ESTALE
The cached file system information was not valid.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_ireadlink(gpfs_fssnap_handle_t *fssnapHandle,
gpfs_ino_t ino,
char *buffer,
int bufferSize);
Description
The gpfs_ireadlink() subroutine reads a symbolic link by inode number. Like gpfs_iopen(), it uses the
same fssnapHandle parameter that was used by the inode scan.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
File system snapshot handle.
ino inode number of the link file to read.
buffer Pointer to buffer for the returned link data.
bufferSize
Size of the buffer.
Exit status
If the gpfs_ireadlink() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
EINVAL
Missing or incorrect parameter.
ENOENT
No such file or directory.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_ireadlink() subroutine is not available.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_ireadlink64(gpfs_fssnap_handle_t *fssnapHandle,
gpfs_ino64_t ino,
char *buffer,
int bufferSize);
Description
The gpfs_ireadlink64() subroutine reads a symbolic link by inode number. Like gpfs_iopen64(), it uses
the same fssnapHandle parameter that was used by the inode scan.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fssnapHandle
The file system snapshot handle.
ino The inode number of the link file to read.
buffer A pointer to buffer for the returned link data.
bufferSize
The size of the buffer.
Exit status
If the gpfs_ireadlink64() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
EINVAL
Missing or incorrect parameter.
ENOENT
No such file or directory.
ENOMEM
Unable to allocate memory for the request.
ENOSYS
The gpfs_ireadlink64() subroutine is not available.
Note: gpfs_ireadlink64() calls the standard library subroutine readlink(); if this called subroutine returns
an error, gpfs_ireadlink64() also returns that error.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_off64_t gpfs_ireadx(gpfs_ifile_t *ifile,
gpfs_iscan_t *iscan,
void *buffer,
int bufferSize,
gpfs_off64_t *offset,
gpfs_off64_t termOffset,
int *hole);
Description
The gpfs_ireadx() subroutine performs a block level incremental read on a file opened by gpfs_iopen()
within a given incremental scan opened using gpfs_open_inodescan().
For an overview of using gpfs_ireadx() in a backup application, see “Using APIs to develop backup
applications” on page 31.
The gpfs_ireadx() subroutines returns the data that has changed since the prev_fssnapId specified for the
inode scan. The file is scanned starting at offset and terminating at termOffset, looking for changed data.
Once changed data is located, the offset parameter is set to its location, the new data is returned in the
buffer provided, and the amount of data returned is the subroutine's value.
If the change to the data is that it has been deleted (that is, the file has been truncated), no data is
returned, but the hole parameter is returned with a value of 1, and the size of the hole is returned as the
subroutine's value. The returned size of the hole may exceed the bufferSize provided. If no changed data
was found before reaching the termOffset or the end-of-file, then the gpfs_ireadx() subroutine return
value is 0.
Block level incremental backups are not available on small files (a file size smaller than the file system
block size), directories, or if the file has been deleted. The gpfs_ireadx() subroutine can still be used, but
it returns all of the file's data, operating like the standard gpfs_iread() subroutine. However, the
gpfs_ireadx() subroutine will still identify sparse files and explicitly return information on holes in the
files, rather than returning the NULL data.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
ifile Pointer to gpfs_ifile_t returned from gpfs_iopen().
iscan Pointer to gpfs_iscan_t from gpfs_open_inodescan().
buffer Pointer to buffer for returned data, or NULL to query the next increment to be read.
bufferSize
Size of buffer for returned data.
Exit status
If the gpfs_ireadx() subroutine is successful, it returns the number of bytes read and returned in bufP, or
the size of the hole encountered in the file.
If the gpfs_ireadx() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EDOM
The file system stripe ID from the iscanId does not match the ifile's.
EINVAL
Missing or incorrect parameter.
EISDIR
The specified file is a directory.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_ireadx() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ERANGE
The file system snapshot ID from the iscanId is more recent than the ifile's.
ESTALE
Cached file system information was not valid.
GPFS_E_INVAL_IFILE
Incorrect ifile parameter.
GPFS_E_INVAL_ISCAN
Incorrect iscan parameter.
Location
Library
Structure
typedef struct gpfs_iscan gpfs_iscan_t;
Description
The gpfs_iscan_t structure contains a handle for an inode scan of a GPFS file system or snapshot.
Members
gpfs_iscan
The handle for an inode scan for a GPFS file system or snapshot.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_next_inode(gpfs_iscan_t *iscan,
gpfs_ino_t termIno,
const gpfs_iattr_t **iattr);
Description
The gpfs_next_inode() subroutine obtains the next inode from the specified inode scan and sets the iattr
pointer to the inode's attributes. The termIno parameter can be used to terminate the inode scan before
the last inode in the file system or snapshot being scanned. A value of 0 may be provided to indicate the
last inode in the file system or snapshot. If there are no more inodes to be returned before the
termination inode, the gpfs_next_inode() subroutine returns a value of 0 and the inode's attribute pointer
is set to NULL.
For an overview of using gpfs_next_inode() in a backup application, see “Using APIs to develop backup
applications” on page 31.
To generate a full backup, invoke gpfs_open_inodescan() with NULL for the prev_fssnapId parameter.
Repeated invocations of gpfs_next_inode() then return inode information about all existing user files,
directories and links, in inode number order.
To generate an incremental backup, invoke gpfs_next_inode() with the fssnapId that was obtained from
a fssnapHandle at the time the previous backup was created. The snapshot that was used for the
previous backup does not need to exist at the time the incremental backup is generated. That is, the
backup application needs to remember only the fssnapId of the previous backup; the snapshot itself can
be deleted as soon as the backup is completed.
For an incremental backup, only inodes of files that have changed since the specified previous snapshot
will be returned. Any operation that changes the file's mtime or ctime is considered a change and will
cause the file to be included. Files with no changes to the file's data or file attributes, other than a change
to atime, are omitted from the scan.
Incremental backups return deleted files, but full backups do not. A deleted file is indicated by the field
ia_nlinks having a value of 0.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan Pointer to the inode scan handle.
termIno
The inode scan terminates before this inode number. The caller may specify maxIno from
gpfs_open_inodescan() or zero to scan the entire inode file.
iattr Pointer to the returned pointer to the inode's iattr.
If the gpfs_next_inode() subroutine is successful, it returns a value of 0 and a pointer. The pointer points
to NULL if there are no more inodes. Otherwise, the pointer points to the returned inode's attributes.
If the gpfs_next_inode() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_next_inode() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
Cached file system information was not valid.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_next_inode64(gpfs_iscan_t *iscan,
gpfs_ino64_t termIno,
const gpfs_iattr64_t **iattr);
Description
The gpfs_next_inode64() subroutine obtains the next inode from the specified inode scan and sets the
iattr pointer to the inode's attributes. The termIno parameter can be used to stop the inode scan before
the last inode in the file system or snapshot being scanned. A value of 0 can be provided to indicate the
last inode in the file system or snapshot. If there are no more inodes to be returned before the
termination inode, the gpfs_next_inode64() subroutine returns a value of 0 and the inode's attribute
pointer is set to NULL.
For an overview of using gpfs_next_inode64() in a backup application, see “Using APIs to develop
backup applications” on page 31.
To generate a full backup, invoke gpfs_open_inodescan64() with NULL for the prev_fssnapId parameter.
Repeated invocations of gpfs_next_inode64() returns inode information about all existing user files,
directories, and links in inode number order.
To generate an incremental backup, invoke gpfs_next_inode64() with the fssnapId that was obtained
from a fssnapHandle at the time the previous backup was created. The snapshot that was used for the
previous backup does not need to exist at the time the incremental backup is generated. That is, the
backup application needs to remember only the fssnapId of the previous backup; the snapshot itself can
be deleted as soon as the backup is completed.
For an incremental backup, only inodes of files that have changed since the specified previous snapshot
will be returned. Any operation that changes the file's mtime or ctime is considered a change and will
cause the file to be included. Files with no changes to the file's data or file attributes, other than a change
to atime, are omitted from the scan.
Incremental backups return deleted files, but full backups do not. A deleted file is indicated by the field
ia_nlinks having a value of 0.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan A pointer to the inode scan handle.
termIno
The inode scan terminates before this inode number. The caller may specify maxIno from
gpfs_open_inodescan64() or zero to scan the entire inode file.
iattr A pointer to the returned pointer to the inode's iattr.
If the gpfs_next_inode64() subroutine is successful, it returns a value of 0 and a pointer. The pointer
points to NULL if there are no more inodes. Otherwise, the pointer points to the returned inode's
attributes.
If the gpfs_next_inode64() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_next_inode64() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_next_inode_with_xattrs(gpfs_iscan_t *iscan,
gpfs_ino_t termIno,
const gpfs_iattr_t **iattr,
const char **xattrBuf,
unsigned int *xattrBufLen);
Description
The gpfs_next_inode_with_xattrs() subroutine retrieves the next inode and its extended attributes from
the inode scan. The set of extended attributes returned are defined when the inode scan was opened. The
scan stops before the last inode that was specified or the last inode in the inode file being scanned.
The termIno parameter provides a way to partition an inode scan so it can be run on more than one
node.
The returned values for xattrBuf and xattrBufLen must be provided to gpfs_next_xattr() to obtain the
extended attribute names and values. The buffer used for the extended attributes is overwritten by
subsequent calls to gpfs_next_inode(), gpfs_seek_inode(), or gpfs_stat_inode().
The returned pointers to the extended attribute name and value will be aligned to a double-word
boundary.
Parameters
iscan A pointer to the inode scan descriptor.
termIno
The inode scan stops before this inode number. The caller can specify maxIno from
gpfs_open_inodescan() or zero to scan the entire inode file.
iattr A pointer to the returned pointer to the file's iattr.
xattrBuf
A pointer to the returned pointer to the xiattr buffer.
xattrBufLen
The returned length of the xiattr buffer.
Exit status
None.
Error status
EFAULT
The buffer data was overwritten.
ENOMEM
The buffer is too small, unable to allocate memory for request.
ENOSYS
The gpfs_next_inode_with_xattrs() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
GPFS_E_INVAL_XATTR
Incorrect parameters.
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_next_inode_with_xattrs64(gpfs_iscan_t *iscan,
gpfs_ino64_t termIno,
const gpfs_iattr64_t **iattr,
const char **xattrBuf,
unsigned int *xattrBufLen);
Description
The gpfs_next_inode_with_xattrs64() subroutine retrieves the next inode and its extended attributes from
the inode scan. The set of extended attributes returned are defined when the inode scan was opened. The
scan stops before the last inode that was specified or the last inode in the inode file being scanned.
The termIno parameter provides a way to partition an inode scan so it can be run on more than one
node.
The returned values for xattrBuf and xattrBufLen must be provided to gpfs_next_xattr() to obtain the
extended attribute names and values. The buffer used for the extended attributes is overwritten by
subsequent calls to gpfs_next_inode64(), gpfs_seek_inode64(), or gpfs_stat_inode64().
The returned pointers to the extended attribute name and value will be aligned to a double-word
boundary.
Parameters
iscan A pointer to the inode scan descriptor.
termIno
The inode scan stops before this inode number. The caller can specify maxIno from
gpfs_open_inodescan64() or zero to scan the entire inode file.
iattr A pointer to the returned pointer to the file's iattr.
xattrBuf
A pointer to the returned pointer to the xiattr buffer.
xattrBufLen
The returned length of the xiattr buffer.
Exit status
None.
Error status
EFAULT
The buffer data was overwritten.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_next_inode_with_xattrs64() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
GPFS_E_INVAL_XATTR
Incorrect parameters.
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_next_xattr(gpfs_iscan_t *iscan,
const char **xattrBuf,
unsigned int *xattrBufLen,
const char **name,
unsigned int *valueLen,
const char **value);
Description
The gpfs_next_xattr() subroutine iterates over the extended attributes buffer returned by the
gpfs_next_inode_with_xattrs() or gpfs_next_inode_with_xattrs64() subroutine to return the individual
attributes and their values. The attribute names are null-terminated strings, whereas the attribute value
contains binary data.
Note: The caller is not allowed to modify the returned attribute names or values. The data returned by
gpfs_next_xattr() might be overwritten by subsequent calls to gpfs_next_xattr() or other GPFS library
calls.
Parameters
iscan A pointer to the inode descriptor.
xattrBuf
A pointer to the pointer to the attribute buffer.
xattrBufLen
A pointer to the attribute buffer length.
name A pointer to the attribute name.
valueLen
A pointer to the length of the attribute value.
value A pointer to the attribute value.
Exit status
If the gpfs_next_xattr() subroutine is successful, it returns a value of 0 and a pointer to the attribute
name. It also sets:
v The valueLen parameter to the length of the attribute value
v The value parameter to point to the attribute value
v The xattrBufLen parameter to the remaining length of buffer
v The xattrBuf parameter to index the next attribute in buffer
If the gpfs_next_xattr() subroutine is successful, but there are no more attributes in the buffer, it returns a
value of 0 and the attribute name is set to NULL. It also sets:
v The valueLen parameter to 0
v The value parameter to NULL
If the gpfs_next_xattr() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
EINVAL
Incorrect parameters.
ENOSYS
The gpfs_next_xattr() subroutine is not available.
Examples
Location
Library
Structure
typedef struct
{
int acl_buffer_len;
unsigned short acl_version;
unsigned char acl_type;
char acl_var_data[1];
} gpfs_opaque_acl_t;
Description
The gpfs_opaque_acl_t structure contains size, version, and ACL type information for the gpfs_getacl()
and gpfs_putacl() subroutines.
Members
acl_buffer_len
On input, this field must be set to the total length, in bytes, of the data structure being passed to
GPFS. On output, this field contains the actual size of the requested information. If the initial size
of the buffer is not large enough to contain all of the information, the gpfs_getacl() invocation
must be repeated with a larger buffer.
acl_version
This field contains the current version of the GPFS internal representation of the ACL. On input
to the gpfs_getacl() subroutine, set this field to zero.
acl_type
On input to the gpfs_getacl() subroutine, set this field to either GPFS_ACL_TYPE_ACCESS or
GPFS_ACL_TYPE_DEFAULT, depending on which ACL is requested. These constants are
defined in the gpfs.h header file.
acl_var_data
This field signifies the beginning of the remainder of the ACL information.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_iscan_t *gpfs_open_inodescan
(gpfs_fssnap_handle_t *fssnapHandle,
const gpfs_fssnap_id_t *prev_fssnapId,
gpfs_ino_t *maxIno);
Description
The gpfs_open_inodescan() subroutine opens a scan of the inodes in the file system or snapshot
identified by the fssnapHandle parameter. The scan traverses all user files, directories and links in the
file system or snapshot. The scan begins with the user file with the lowest inode number and returns the
files in increasing order. The gpfs_seek_inode() subroutine may be used to set the scan position to an
arbitrary inode. System files, such as the block allocation maps, are omitted from the scan. The file
system must be mounted to open an inode scan.
For an overview of using gpfs_open_inodescan() in a backup application, see “Using APIs to develop
backup applications” on page 31.
To generate a full backup, invoke gpfs_open_inodescan() with NULL for the prev_fssnapId parameter.
Repeated invocations of gpfs_next_inode() then return inode information about all existing user files,
directories and links, in inode number order.
To generate an incremental backup, invoke gpfs_open_inodescan() with the fssnapId that was obtained
from a fssnapHandle at the time the previous backup was created. The snapshot that was used for the
previous backup does not need to exist at the time the incremental backup is generated. That is, the
backup application needs to remember only the fssnapId of the previous backup; the snapshot itself can
be deleted as soon as the backup is completed.
For the incremental backup, any operation that changes the file's mtime or ctime causes the file to be
included. Files with no changes to the file's data or file attributes, other than a change to atime, are
omitted from the scan.
A full inode scan (prev_fssnapId set to NULL) does not return any inodes of nonexistent or deleted files,
but an incremental inode scan (prev_fssnapId not NULL) does return inodes for files that have been
deleted since the previous snapshot. The inodes of deleted files have a link count of zero.
If the snapshot indicated by prev_fssnapId is available, the caller may benefit from the extended read
subroutine, gpfs_ireadx(), which returns only the changed blocks within the files. Without the previous
snapshot, all blocks within the changed files are returned.
Once a full or incremental backup completes, the new_fssnapId must be saved in order to reuse it on a
subsequent incremental backup. This fssnapId must be provided to the gpfs_open_inodescan()
subroutine, as the prev_fssnapId input parameter.
Note:
1. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
Parameters
fssnapHandle
File system snapshot handle.
prev_fssnapId
Pointer to file system snapshot ID or NULL. If prev_fssnapId is provided, the inode scan returns
only the files that have changed since the previous backup. If the pointer is NULL, the inode scan
returns all user files.
maxIno
Pointer to inode number or NULL. If provided, gpfs_open_inodescan() returns the maximum
inode number in the file system or snapshot being scanned.
Exit status
If the gpfs_open_inodescan() subroutine is unsuccessful, it returns a NULL pointer and the global error
variable errno is set to indicate the nature of the error.
Exceptions
None.
Error status
EDOM
The file system snapshot ID passed for prev_fssnapId is from a different file system.
EINVAL
Incorrect parameters.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_open_inodescan() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ERANGE
The prev_fssnapId parameter is the same as or more recent than snapId being scanned.
ESTALE
Cached file system information was not valid.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
GPFS_E_INVAL_FSSNAPID
The file system snapshot ID passed for prev_fssnapId is not valid.
Examples
Library
Synopsis
#include <gpfs.h>
gpfs_open_inodescan64(gpfs_fssnap_handle_t *fssnapHandle,
const gpfs_fssnap_id_t *prev_fssnapId,
gpfs_ino64_t *maxIno);
Description
The gpfs_open_inodescan64() subroutine opens a scan of the inodes in the file system or snapshot
identified by the fssnapHandle parameter. The scan traverses all user files, directories and links in the
file system or snapshot. The scan begins with the user file with the lowest inode number and returns the
files in increasing order. The gpfs_seek_inode64() subroutine may be used to set the scan position to an
arbitrary inode. System files, such as the block allocation maps, are omitted from the scan. The file
system must be mounted to open an inode scan.
For an overview of using gpfs_open_inodescan64() in a backup application, see “Using APIs to develop
backup applications” on page 31.
To generate a full backup, invoke gpfs_open_inodescan64() with NULL for the prev_fssnapId parameter.
Repeated invocations of gpfs_next_inode64() returns inode information about all existing user files,
directories, and links in inode number order.
To generate an incremental backup, invoke gpfs_open_inodescan64() with the fssnapId that was
obtained from a fssnapHandle at the time the previous backup was created. The snapshot that was used
for the previous backup does not need to exist at the time the incremental backup is generated. That is,
the backup application needs to remember only the fssnapId of the previous backup; the snapshot itself
can be deleted as soon as the backup is completed.
For the incremental backup, any operation that changes the file's mtime or ctime causes the file to be
included. Files with no changes to the file's data or file attributes, other than a change to atime, are
omitted from the scan.
A full inode scan (prev_fssnapId set to NULL) does not return any inodes of nonexistent or deleted files,
but an incremental inode scan (prev_fssnapId not NULL) does return inodes for files that have been
deleted since the previous snapshot. The inodes of deleted files have a link count of zero.
If the snapshot indicated by prev_fssnapId is available, the caller may benefit from the extended read
subroutine, gpfs_ireadx(), which returns only the changed blocks within the files. Without the previous
snapshot, all blocks within the changed files are returned.
Once a full or incremental backup completes, the new_fssnapId must be saved in order to reuse it on a
subsequent incremental backup. This fssnapId must be provided to the gpfs_open_inodescan64()
subroutine, as the prev_fssnapId input parameter.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Exit status
If the gpfs_open_inodescan64() subroutine is unsuccessful, it returns a NULL pointer and the global error
variable errno is set to indicate the nature of the error.
Exceptions
None.
Error status
EDOM
The file system snapshot ID passed for prev_fssnapId is from a different file system.
EINVAL
Incorrect parameters.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_open_inodescan64() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ERANGE
The prev_fssnapId parameter is the same as or more recent than snapId being scanned.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
GPFS_E_INVAL_FSSNAPID
The file system snapshot ID passed for prev_fssnapId is not valid.
Note: gpfs_open_inodescan64() calls the standard library subroutines dup() and malloc(); if one of these
called subroutines returns an error, gpfs_open_inodescan64() also returns that error.
Examples
Library
Synopsis
#include <gpfs.h>
gpfs_open_inodescan_with_xattrs(gpfs_fssnap_handle_t *fssnapHandle,
const gpfs_fssnap_id_t *prev_fssnapId,
int nxAttrs,
const char *xattrsList[],
gpfs_ino_t *maxIno);
Description
The gpfs_open_inodescan_with_xattrs() subroutine opens an inode file and extended attributes for an
inode scan identified by the fssnapHandle parameter. The scan traverses all user files, directories and
links in the file system or snapshot. The scan begins with the user file with the lowest inode number and
returns the files in increasing order. The gpfs_seek_inode() subroutine can be used to set the scan
position to an arbitrary inode. System files, such as the block allocation maps, are omitted from the scan.
The file system must be mounted to open an inode scan.
To generate a full backup, invoke gpfs_open_inodescan_with_xattrs() with NULL for the prev_fssnapId
parameter. Repeated invocations of gpfs_next_inode() returns inode information about all existing user
files, directories, and links in inode number order.
For the incremental backup, any operation that changes the file's mtime or ctime causes the file to be
included. Files with no changes to the file's data or file attributes, other than a change to atime, are
omitted from the scan.
A full inode scan (prev_fssnapId set to NULL) returns all inodes of existing files. An incremental inode
scan (prev_fssnapId not NULL) returns inodes for files that have changed since the previous snapshot.
The inodes of deleted files have a link count of zero.
If the snapshot indicated by prev_fssnapId is available, the caller may benefit from the extended read
subroutine, gpfs_ireadx(), which returns only the changed blocks within the files. Without the previous
snapshot, all blocks within the changed files are returned.
Once a full or incremental backup completes, the new_fssnapId must be saved in order to reuse it on a
subsequent incremental backup. This fssnapId must be provided to the
gpfs_open_inodescan_with_xattrs() subroutine, as the prev_fssnapId input parameter.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
Parameters
fssnapHandle
The file system snapshot handle.
prev_fssnapId
A pointer to file system snapshot ID or NULL. If prev_fssnapId is provided, the inode scan
returns only the files that have changed since the previous backup. If the pointer is NULL, the
inode scan returns all user files.
nxAttrs
The count of extended attributes to be returned. If nxAttrs is set to 0, call returns no extended
attributes, like gpfs_open_inodescan(). If nxAttrs is set to -1, call returns all extended attributes
xattrsList
A pointer to an array of pointers to names of extended attributes to be returned. nxAttrsList may
be null if nxAttrs is set to 0 or -1.
maxIno
A pointer to inode number or NULL. If provided, gpfs_open_inodescan_with_xattrs() returns the
maximum inode number in the file system or snapshot being scanned.
Exit status
Exceptions
None.
Error status
EDOM
The file system snapshot ID passed for prev_fssnapId is from a different file system.
EINVAL
Incorrect parameters.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_open_inodescan_with_xattrs() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ERANGE
The prev_fssnapId parameter is the same as or more recent than snapId being scanned.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
GPFS_E_INVAL_FSSNAPID
The file system snapshot ID passed for prev_fssnapId is not valid.
Examples
Location
Library
Synopsis
#include <gpfs.h>
gpfs_open_inodescan_with_xattrs64(gpfs_fssnap_handle_t *fssnapHandle,
const gpfs_fssnap_id_t *prev_fssnapId,
int nxAttrs,
const char *xattrList[],
gpfs_ino64_t *maxIno);
Description
The gpfs_open_inodescan_with_xattrs64() subroutine opens an inode file and extended attributes for an
inode scan identified by the fssnapHandle parameter. The scan traverses all user files, directories and
links in the file system or snapshot. The scan begins with the user file with the lowest inode number and
returns the files in increasing order. The gpfs_seek_inode64() subroutine may be used to set the scan
position to an arbitrary inode. System files, such as the block allocation maps, are omitted from the scan.
The file system must be mounted to open an inode scan.
For the incremental backup, any operation that changes the file's mtime or ctime causes the file to be
included. Files with no changes to the file's data or file attributes, other than a change to atime, are
omitted from the scan.
A full inode scan (prev_fssnapId set to NULL) returns all inodes of existing files. An incremental inode
scan (prev_fssnapId not NULL) returns inodes for files that have changed since the previous snapshot.
The inodes of deleted files have a link count of zero.
If the snapshot indicated by prev_fssnapId is available, the caller may benefit from the extended read
subroutine, gpfs_ireadx(), which returns only the changed blocks within the files. Without the previous
snapshot, all blocks within the changed files are returned.
Once a full or incremental backup completes, the new_fssnapId must be saved in order to reuse it on a
subsequent incremental backup. This fssnapId must be provided to the
gpfs_open_inodescan_with_xattrs64() subroutine, as the prev_fssnapId input parameter.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
Parameters
fssnapHandle
The file system snapshot handle.
prev_fssnapId
A pointer to file system snapshot ID or NULL. If prev_fssnapId is provided, the inode scan
returns only the files that have changed since the previous backup. If the pointer is NULL, the
inode scan returns all user files.
nxAttrs
The count of extended attributes to be returned. If nxAttrs is set to 0, call returns no extended
attributes, like gpfs_open_inodescan64(). If nxAttrs is set to -1, call returns all extended attributes
xattrsList
A pointer to an array of pointers to names of extended attributes to be returned. nxAttrsList may
be null if nxAttrs is set to 0 or -1.
maxIno
A pointer to inode number or NULL. If provided, gpfs_open_inodescan_with_xattrs64() returns
the maximum inode number in the file system or snapshot being scanned.
Exit status
Exceptions
None.
Error status
EDOM
The file system snapshot ID passed for prev_fssnapId is from a different file system.
EINVAL
Incorrect parameters.
ENOMEM
Unable to allocate memory for request.
ENOSYS
The gpfs_open_inodescan_with_xattrs64() subroutine is not available.
EPERM
The caller does not have superuser privileges.
ERANGE
The prev_fssnapId parameter is the same as or more recent than snapId being scanned.
ESTALE
The cached file system information was not valid.
GPFS_E_INVAL_FSSNAPHANDLE
The file system snapshot handle is not valid.
GPFS_E_INVAL_FSSNAPID
The file system snapshot ID passed for prev_fssnapId is not valid.
Examples
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_prealloc(int fileDesc, offset_t StartOffset,
offset_t BytesToPrealloc)
Description
The gpfs_prealloc() subroutine is used to preallocate disk storage for a file that has already been opened,
prior to writing data to the file. The pre-allocated disk storage is started at the requested offset,
StartOffset, and covers at least the number of bytes requested, BytesToPrealloc. Allocations are rounded
to GPFS sub-block boundaries.
Pre-allocating disk space for a file provides an efficient method for allocating storage without having to
write any data. This can result in faster I/O compared to a file which gains disk space incrementally as it
grows.
Existing data in the file is not modified. Reading any of the pre-allocated blocks returns zeroes.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
fileDesc
An integer specifying the file descriptor returned by open().
The file designated for preallocation must be opened for writing.
StartOffset
The byte offset into the file at which to begin pre-allocation.
BytesToPrealloc
The number of bytes to be pre-allocated.
Exit status
If the gpfs_prealloc() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error. If errno is set to one of the following, some storage may have
been pre-allocated:
v EDQUOT
v ENOSPC
The only way to tell how much space was actually pre-allocated is to invoke the stat() subroutine and
compare the reported file size and number of blocks used with their values prior to preallocation.
None.
Error status
EACCES
The file is not opened for writing.
EBADF
The file descriptor is not valid.
EDQUOT
A disk quota has been exceeded
EINVAL
The file descriptor does not refer to a GPFS file or a regular file; a negative value was specified
for StartOffset or BytesToPrealloc.
ENOSPC
The file system has run out of disk space.
ENOSYS
The gpfs_prealloc() subroutine is not supported under the current file system format.
Examples
#include <stdio.h>
#include <fcntl.h>
#include <errno.h>
#include <gpfs.h>
int rc;
int fileHandle = -1;
char* fileNameP = "datafile";
offset_t startOffset = 0;
offset_t bytesToAllocate = 20*1024*1024; /* 20 MB */
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_putacl(char *pathname, int flags, void *aclP)
Description
The gpfs_putacl() subroutine together with the gpfs_getacl() subroutine is intended for use by a backup
program to save (gpfs_getacl()) and restore (gpfs_putacl()) the ACL information for the file.
Note:
1. The use of gpfs_fgetattrs() and gpfs_fputattrs() is preferred.
2. You must have write access to the file.
3. Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
pathname
Path name of the file for which the ACLs is to be set.
flags Consists of one of these values:
0 Indicates that the aclP parameter is to be mapped with the gpfs_opaque_acl_t struct.
The gpfs_opaque_acl_t struct should be used by backup and restore programs.
GPFS_PUTACL_STRUCT
Indicates that the aclP parameter is to be mapped with the gpfs_acl_t struct.
The gpfs_acl_t struct is provided for applications that need to change the ACL.
aclP Pointer to a buffer mapped by the structure gpfs_opaque_acl_t or gpfs_acl_t, depending on the
value of flags.
This is where the ACL data is stored, and should be the result of a previous invocation of
gpfs_getacl().
Exit status
If the gpfs_putacl() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Location
/usr/lpp/mmfs/lib/libgpfs.a for AIX, /usr/lpp/mmfs/lib/libgpfs.so for Linux
Library
Synopsis
#include <gpfs.h>
int gpfs_quotactl(char *pathname, int cmd, int id, void *bufferP);
Description
The gpfs_quotactl() subroutine manipulates disk quotas. It enables, disables, and manipulates disk quotas
for file systems on which quotas have been enabled.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
pathname
Specifies the path name of any file within the mounted file system to which the quota control
command is to applied.
cmd Specifies the quota control command to be applied and whether it is applied to a user, group, or
fileset quota.
The cmd parameter can be constructed using GPFS_QCMD(qcmd, Type) contained in gpfs.h.
The qcmd parameter specifies the quota control command. The Type parameter specifies one of
the following quota types:
v user (GPFS_USRQUOTA)
v group (GPFS_GRPQUOTA)
v fileset (GPFS_FILESETQUOTA)
The valid values for the qcmd parameter specified in gpfs.h are:
Q_QUOTAON
Enables quotas.
Enables disk quotas for the file system specified by the pathname parameter and type
specified in Type. The id and bufferP parameters are unused. Root user authority is
required to enable quotas.
Q_QUOTAOFF
Disables quotas.
Disables disk quotas for the file system specified by the pathname parameter and type
specified in Type. The id and bufferP parameters are unused. Root user authority is
required to disable quotas.
Q_GETQUOTA
Gets quota limits and usage information.
Retrieves quota limits and current usage for a user, group, or fileset specified by the id
parameter. The bufferP parameter points to a gpfs_quotaInfo_t structure to hold the
returned information. The gpfs_quotaInfo_t structure is defined in gpfs.h.
Exit status
If the gpfs_quotactl() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EACCES
Search permission is denied for a component of a path prefix.
EFAULT
An invalid bufferP parameter is supplied. The associated structure could not be copied in or out
of the kernel.
EINVAL
One of the following errors:
v The file system is not mounted.
v Invalid command or quota type.
v Invalid input limits: negative limits or soft limits are greater than hard limits.
ENOENT
No such file or directory.
Location
Library
Structure
typedef struct gpfs_quotaInfo
{
gpfs_off64_t blockUsage; /* current block count */
gpfs_off64_t blockHardLimit; /* absolute limit on disk blks alloc */
gpfs_off64_t blockSoftLimit; /* preferred limit on disk blks */
gpfs_off64_t blockInDoubt; /* distributed shares + "lost" usage for blks */
int inodeUsage; /* current # allocated inodes */
int inodeHardLimit; /* absolute limit on allocated inodes */
int inodeSoftLimit; /* preferred inode limit */
int inodeInDoubt; /* distributed shares + "lost" usage for inodes */
gpfs_uid_t quoId; /* uid, gid or fileset id
int entryType; /* entry type, not used */
unsigned int blockGraceTime; /* time limit for excessive disk use */
unsigned int inodeGraceTime; /* time limit for excessive inode use */
} gpfs_quotaInfo_t;
Description
The gpfs_quotaInfo_t structure contains detailed information for the gpfs_quotactl() subroutine.
Members
blockUsage
The current block count in 1 KB units.
blockHardLimit
The absolute limit on disk block allocation.
blockSoftLimit
The preferred limit on disk block allocation.
blockInDoubt
The distributed shares and block usage that have not been not accounted for.
inodeUsage
The current number of allocated inodes.
inodeHardLimit
The absolute limit on allocated inodes.
inodeSoftLimit
The preferred inode limit.
inodeInDoubt
The distributed inode share and inode usage that have not been accounted for.
quoId The user ID, group ID, or fileset ID.
entryType
Not used
blockGraceTime
The time limit (in seconds since the Epoch) for excessive disk use.
Location
Library
Synopsis
#include <gpfs.h>
int gpfs_seek_inode(gpfs_iscan_t *iscan,
gpfs_ino_t ino);
Description
The gpfs_seek_inode() subroutine advances an inode scan to the specified inode number.
The gpfs_seek_inode() subroutine is used to start an inode scan at some place other than the beginning
of the inode file. This is useful to restart a partially completed backup or an interrupted dump transfer to
a mirror. It could also be used to do an inode scan in parallel from multiple nodes, by partitioning the
inode number space into separate ranges for each participating node. The maximum inode number is
returned when the scan was opened and each invocation to obtain the next inode specifies a termination
inode number to avoid returning the same inode more than once.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan Pointer to the inode scan handle.
ino The next inode number to be scanned.
Exit status
If the gpfs_seek_inode() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOSYS
The gpfs_seek_inode() subroutine is not available.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
Examples
Library
Synopsis
#include <gpfs.h>
gpfs_seek_inode64(gpfs_iscan_t *iscan,
gpfs_ino64_t ino);
Description
The gpfs_seek_inode64() subroutine advances an inode scan to the specified inode number.
The gpfs_seek_inode64() subroutine is used to start an inode scan at some place other than the beginning
of the inode file. This is useful to restart a partially completed backup or an interrupted dump transfer to
a mirror. It could also be used to do an inode scan in parallel from multiple nodes, by partitioning the
inode number space into separate ranges for each participating node. The maximum inode number is
returned when the scan was opened and each invocation to obtain the next inode specifies a termination
inode number to avoid returning the same inode more than once.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan A pointer to the inode scan handle.
ino The next inode number to be scanned.
Exit status
If the gpfs_seek_inode64() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Error status
ENOSYS
The gpfs_seek_inode64() subroutine is not available.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
Examples
Library
Synopsis
#include <gpfs.h>
int gpfs_stat(char *pathName, struct stat64 *Buffer)
Description
The gpfs_stat() subroutine is used to obtain exact information about the file named by the pathName
parameter. This subroutine is provided as an alternative to the stat() subroutine, which may not provide
exact mtime and atime values. See “Exceptions to Open Group technical standards” on page 467.
read, write, or execute permission for the named file is not required, but all directories listed in the path
leading to the file must be searchable. The file information is written to the area specified by the Buffer
parameter.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
pathName
The path identifying the file for which exact status information is requested.
Buffer A pointer to the stat64 structure in which the information is returned. The stat64 structure is
described in the sys/stat.h file.
Exit status
If the gpfs_stat() subroutine is unsuccessful, it returns a value of -1 and sets the global error variable
errno to indicate the nature of the error.
Exceptions
None.
Error status
EINVAL
The path name does not refer to a GPFS file or a regular file.
ENOENT
The file does not exist.
ENOSYS
The gpfs_stat() subroutine is not supported under the current file system format.
ESTALE
The cached file system information was not valid.
Library
Synopsis
#include <gpfs.h>
gpfs_stat_inode(gpfs_iscan_t *iscan,
gpfs_ino_t ino,
gpfs_ino_t termIno,
const gpfs_iattr_t **iattr);
Description
The gpfs_stat_inode() subroutine is used to seek the specified inode and to retrieve that inode and its
extended attributes from the inode scan. This subroutine combines gpfs_seek_inode and get_next_inode,
but will only return the specified inode.
The termIno parameter provides a way to partition an inode scan so it can be run on more than one
node. It is only used by this call to control prefetching.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan A pointer to an inode scan descriptor.
ino The inode number to be returned.
termIno
Prefetches inodes up to this inode. The caller might specify maxIno from gpfs_open_inodescan()
or 0 to allow prefetching over the entire inode file.
iattr A pointer to the returned pointer to the file's iattr.
Exit status
If the gpfs_stat_inode() subroutine is successful, it returns a value of 0 and the iattr parameter is set to
point to gpfs_iattr_t. If the gpfs_stat_inode() subroutine is successful, but there are no more inodes
before the termIno parameter, or if the requested inode does not exist, it returns a value of 0 and the iattr
parameter is set to NULL.
If the gpfs_stat_inode() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_stat_inode64(gpfs_iscan_t *iscan,
gpfs_ino64_t ino,
gpfs_ino64_t termIno,
const gpfs_iattr64_t **iattr);
Description
The gpfs_stat_inode64() subroutine is used to seek the specified inode and to retrieve that inode and its
extended attributes from the inode scan. This subroutine combines gpfs_seek_inode64() and
get_next_inode64(), but will only return the specified inode.
The termIno parameter provides a way to partition an inode scan so it can be run on more than one
node. It is only used by this call to control prefetching.
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan A pointer to an inode scan descriptor.
ino The inode number to be returned.
termIno
Prefetches inodes up to this inode. The caller might specify maxIno from gpfs_open_inodescan()
or 0 to allow prefetching over the entire inode file.
iattr A pointer to the returned pointer to the file's iattr.
Exit status
If the gpfs_stat_inode64() subroutine is successful, it returns a value of 0 and the iattr parameter is set to
point to gpfs_iattr_t.
If the gpfs_stat_inode64() subroutine is unsuccessful, it returns a value of -1 and sets the global error
variable errno to indicate the nature of the error.
Exceptions
None.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_stat_inode_with_xattrs(gpfs_iscan_t *iscan,
gpfs_ino_t ino,
gpfs_ino_t termIno,
const gpfs_iattr_t **iattr,
const char **xattrBuf,
unsigned int *xattrBufLen);
Description
The gpfs_stat_inode_with_xattrs() subroutine is used to seek the specified inode and to retrieve that
inode and its extended attributes from the inode scan. This subroutine combines gpfs_seek_inode and
get_next_inode, but will only return the specified inode.
The termIno parameter provides a way to partition an inode scan such that it can be run on more than
one node. It is only used by this call to control prefetching.
The returned values for xattrBuf and xattrBufLen must be provided to gpfs_next_xattr() to obtain the
extended attribute names and values. The buffer used for the extended attributes is overwritten by
subsequent calls to gpfs_next_inode(), gpfs_seek_inode(), or gpfs_stat_inode_with_xattrs().
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan A pointer to an inode scan descriptor.
ino The inode number to be returned.
termIno
Prefetches inodes up to this inode. The caller might specify maxIno from gpfs_open_inodescan()
or 0 to allow prefetching over the entire inode file.
iattr A pointer to the returned pointer to the file's iattr.
xattrBuf
A pointer to the returned pointer to the xattr buffer.
xattrBufLen
The returned length of the xattr buffer.
If the gpfs_stat_inode_with_xattrs() subroutine is unsuccessful, it returns a value of -1 and sets the global
error variable errno to indicate the nature of the error.
Exceptions
None.
Error status
EPERM
The caller must have superuser privilege.
ENOSYS
The gpfs_stat_inode_with_xattrs() subroutine is not supported under the current file system
format.
ESTALE
The cached file system information was not valid.
ENOMEM
The buffer is too small.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
Location
Library
Synopsis
#include <gpfs.h>
gpfs_stat_inode_with_xattrs64(gpfs_iscan_t *iscan,
gpfs_ino64_t ino,
gpfs_ino64_t termIno,
const gpfs_iattr64_t **iattr,
const char **xattrBuf,
unsigned int *xattrBufLen);
Description
The gpfs_stat_inode_with_xattrs64() subroutine is used to seek the specified inode and to retrieve that
inode and its extended attributes from the inode scan. This subroutine combines gpfs_seek_inode64()
and get_next_inode64(), but will only return the specified inode.
The termIno parameter provides a way to partition an inode scan so it can be run on more than one
node. It is only used by this call to control prefetching.
The returned values for xattrBuf and xattrBufLen must be provided to gpfs_next_xattr() to obtain the
extended attribute names and values. The buffer used for the extended attributes is overwritten by
subsequent calls to gpfs_next_inode64(), gpfs_seek_inode64(), or gpfs_stat_inode_with_xattrs64().
Note: Compile any program that uses this subroutine with the -lgpfs flag from the following library:
v libgpfs.a for AIX
v libgpfs.so for Linux
Parameters
iscan A pointer to an inode scan descriptor.
ino The inode number to be returned.
termIno
Prefetches inodes up to this inode. The caller might specify maxIno from
gpfs_open_inodescan64() or 0 to allow prefetching over the entire inode file.
iattr A pointer to the returned pointer to the file's iattr.
xattrBuf
A pointer to the returned pointer to the xattr buffer.
xattrBufLen
The returned length of the xattr buffer.
Exit status
Exceptions
None.
Error status
EPERM
The caller must have superuser privilege.
ENOSYS
The gpfs_stat_inode_with_xattrs64() subroutine is not supported under the current file system
format.
ESTALE
The cached file system information was not valid.
ENOMEM
The buffer is too small.
GPFS_E_INVAL_ISCAN
Incorrect parameters.
Location
Library
Structure
typedef struct
{
int structLen;
int structType;
offset_t start;
offset_t length;
int isWrite;
char padding[4];
} gpfsAccessRange_t;
Description
The gpfsAccessRange_t structure declares an access range within a file for an application.
The application accesses file offsets within the given range, and does not access offsets outside the range.
Violating this hint may produce worse performance than if no hint was specified.
This hint is useful in situations where a file is partitioned coarsely among several nodes. If the ranges do
not overlap, each node can specify which range of the file it accesses. This provides a performance
improvement in some cases, such as for sequential writing within a range.
Members
structLen
Length of the gpfsAccessRange_t structure.
structType
Structure identifier GPFS_ACCESS_RANGE.
start The start of the access range offset, in bytes, from beginning of file.
length Length of the access range.
0 indicates to end of file.
isWrite
0 indicates read access.
1 indicates write access.
padding[4]
Provided to make the length of the gpfsAccessRange_t structure a multiple of 8 bytes in length.
There is no need to initialize this field.
Location
Library
Structure
typedef struct
{
int structLen;
int structType;
} gpfsCancelHints_t;
Description
The gpfsCancelHints_t structure indicates to remove any hints against the open file handle.
GPFS removes any hints that may have been issued against this open file handle:
v The hint status of the file is restored to what it would have been immediately after being opened, but
does not affect the contents of the GPFS file cache. Cancelling an earlier hint that resulted in data being
removed from the GPFS file cache does not bring that data back into the cache. Data reenters the cache
only upon access by the application or by user-driven or automatic prefetching.
v Only the GPFS_MULTIPLE_ACCESS_RANGE hint has a state that might be removed by the
GPFS_CANCEL_HINTS directive.
Note: This directive cancels only the effect of other hints, not other directives.
Members
structLen
Length of the gpfsCancelHints_t structure.
structType
Structure identifier GPFS_CANCEL_HINTS.
Location
Library
Structure
typedef struct
{
int structLen;
int structType;
} gpfsClearFileCache_t;
Description
The gpfsClearFileCache_t structure indicates file access in the near future is not expected.
The application does not expect to make any further accesses to the file in the near future, so GPFS
removes any data or metadata pertaining to the file from its cache.
Multiple node applications that have finished one phase of their computation may want to use this hint
before the file is accessed in a conflicting mode from another node in a later phase. The potential
performance benefit is that GPFS can avoid later synchronous cache consistency operations.
Members
structLen
Length of the gpfsClearFileCache_t structure.
structType
Structure identifier GPFS_CLEAR_FILE_CACHE.
Location
Library
Structure
#define GPFS_MAX_DS_AGENT_NODES 2048
typedef struct
{
int structLen;
int structType;
int partitionSize;
int agentCount;
int agentNodeNumber[GPFS_MAX_DS_AGENT_NODES]
} gpfsDataShipMap_t;
Description
The gpfsDataShipMap_t structure indicates which agent nodes are to be used for data shipping.
The value for partitionSize must be a multiple of the number of bytes in a single file system block.
Members
structLen
Length of the gpfsDataShipMap_t structure.
structType
Structure identifier GPFS_DATA_SHIP_MAP.
Error status
EINVAL
Not all participating threads have provided the same agent mapping.
ENOMEM
The available data space in memory is not large enough to allocate the data structures necessary
to run in data shipping mode.
EPERM
An attempt to open a file in data shipping mode that is already open in write mode by some
thread that did not issue the GPFS_DATA_SHIP_START directive.
ESTALE
A node in the data shipping collective has gone down.
Location
Library
Structure
typedef struct
{
int structLen;
int structType;
int numInstances;
int reserved;
} gpfsDataShipStart_t;
Description
Once all participating threads have issued this directive for a file, GPFS enters a mode where it logically
partitions the blocks of the file among a group of agent nodes. The agents are those nodes on which one
or more threads have issued the GPFS_DATA_SHIP_START directive. Each thread that has issued a
GPFS_DATA_SHIP_START directive and the associated agent nodes are referred to as the data shipping
collective.
Members
structLen
Length of the gpfsDataShipStart_t structure.
structType
Structure identifier GPFS_DATA_SHIP_START
numInstances
The number of open file instances, on all nodes, collaborating to operate on the file.
reserved
This field is currently not used.
For compatibility with future versions of GPFS, set this field to zero.
Recovery
Since GPFS_DATA_SHIP_START directives block their invoking threads until all participants respond
accordingly, there needs to be a way to recover if the application program uses the wrong value for
numInstances or one of the participating nodes crashes before issuing its GPFS_DATA_SHIP_START
directive. While a gpfs_fcntl() subroutine is blocked waiting for other threads, the subroutine can be
interrupted by any signal. If a signal is delivered to any of the waiting subroutines, all waiting subroutine
on every node are interrupted and return EINTR. GPFS does not establish data shipping if such a signal
occurs.
It is the responsibility of the application to mask off any signals that might normally occur while waiting
for another node in the data shipping collective. Several libraries use SIGALRM; the thread that makes
the gpfs_fcntl() invocation should use sigthreadmask to mask off delivery of this signal while inside the
subroutine.
Error status
EINTR
A signal was delivered to a blocked gpfs_fcntl() subroutine. All waiting subroutines, on every
node, are interrupted.
Location
Library
Structure
typedef struct
{
int structLen;
int structType;
} gpfsDataShipStop_t;
Description
Members
structLen
Length of the gpfsDataShipStop_t structure.
structType
Structure identifier GPFS_DATA_SHIP_STOP.
Error status
EIO An error occurred while flushing dirty data.
EINTR
A signal was delivered to a blocked gpfs_fcntl() subroutine. All waiting subroutines, on every
node, are interrupted.
EINVAL
An attempt has been made to issue the GPFS_DATA_SHIP_STOP directive from a node or
thread that is not part of this data shipping collective.
An attempt has been made to issue the GPFS_DATA_SHIP_STOP directive on a file that is not
in data shipping mode.
ESTALE
A node in the data shipping collective has gone down.
Location
Library
Structure
typedef struct
{
int totalLength;
int fcntlVersion;
int errorOffset;
int fcntlReserved;
} gpfsFcntlHeader_t;
Description
The gpfsFcntlHeader_t structure contains size, version, and error information for the gpfs_fcntl()
subroutine.
Members
totalLength
This field must be set to the total length, in bytes, of the data structure being passed in this
subroutine. This includes the length of the header and all hints and directives that follow the
header.
The total size of the data structure cannot exceed the value of GPFS_MAX_FCNTL_LENGTH, as
defined in the header file gpfs_fcntl.h. The current value of GPFS_MAX_FCNTL_LENGTH is 64
KB.
fcntlVersion
This field must be set to the current version number of the gpfs_fcntl() subroutine, as defined by
GPFS_FCNTL_CURRENT_VERSION in the header file gpfs_fcntl.h. The current version number
is one.
errorOffset
If an error occurs processing a system call, GPFS sets this field to the offset within the parameter
area where the error was detected.
For example,
1. An incorrect version number in the header, would cause errorOffset to be set to zero.
2. An error in the first hint following the header would set errorOffset to sizeof(header).
If no errors are found, GPFS does not alter this field.
fcntlReserved
This field is currently unused.
For compatibility with future versions of GPFS, set this field to zero.
Location
Library
Structure
typedef struct
{
int structLen;
int structType;
offset_t start;
offset_t length;
} gpfsFreeRange_t;
Description
The gpfsFreeRange_t structure undeclares an access range within a file for an application.
The application no longer accesses file offsets within the given range. GPFS flushes the data at the file
offsets and removes it from the cache.
Multiple node applications that have finished one phase of their computation may want to use this hint
before the file is accessed in a conflicting mode from another node in a later phase. The potential
performance benefit is that GPFS can avoid later synchronous cache consistency operations.
Members
structLen
Length of the gpfsFreeRange_t structure.
structType
Structure identifier GPFS_FREE_RANGE.
start The start of the access range offset, in bytes, from beginning of file.
length Length of the access range.
Zero indicates to end of file.
Location
Library
Structure
typedef struct {
int structLen;
int structType;
char buffer[GPFS_FCNTL_MAX_NAME_BUFFER];
} gpfsGetFilesetName_t
Description
Members
structLen
Length of the gpfsGetFilesetName_t structure.
structType
Structure identifier GPFS_FCNTL_GET_FILESETNAME.
buffer The size of the buffer may vary, but must be a multiple of eight. Upon successful completion of
the call, the buffer contains a null-terminated character string for the name of the requested
object.
Location
Library
Structure
typedef struct {
int structLen;
int structType;
int metadataReplicas;
int maxMetadataReplicas;
int dataReplicas;
int maxDataReplicas;
int status;
int reserved;
} gpfsGetReplication_t
Description
The gpfsGetReplication_t structure is used to obtain a file's replication factors.
Members
structLen
Length of the gpfsGetReplication_t structure.
structType
Structure identifier GPFS_FCNTL_GET_REPLICATION.
metadataReplicas
Returns the current number of copies of indirect blocks for the file.
maxMetadataReplicas
Returns the maximum number of copies of indirect blocks for a file.
dataReplicas
Returns the current number of copies of the data blocks for a file.
maxDataReplicas
Returns the maximum number of copies of data blocks for a file.
status Returns the status of the file. Status values defined below.
reserved
Unused, but should be set to 0.
Error status
Location
Library
Structure
typedef struct {
int structLen;
int structType;
char buffer[GPFS_FCNTL_MAX_NAME_BUFFER];
} gpfsGetSnapshotName_t
Description
The gpfsGetSnapshotName_t structure is used to obtain a file's snapshot name. If the file is not part of a
snapshot, a zero length snapshot name will be returned.
Members
structLen
Length of the gpfsGetSnapshotName_t structure.
structType
Structure identifier GPFS_FCNTL_GET_SNAPSHOTNAME.
buffer The size of the buffer may vary, but must be a multiple of eight. Upon successful completion of
the call, the buffer contains a null-terminated character string for the name of the requested
object.
Location
Library
Structure
typedef struct {
int structLen;
int structType;
char buffer[GPFS_FCNTL_MAX_NAME_BUFFER];
} gpfsGetStoragePool_t
Description
Members
structLen
Length of the gpfsGetStoragePool_t structure.
structType
Structure identifier GPFS_FCNTL_GET_STORAGEPOOL.
buffer The size of the buffer may vary, but must be a multiple of eight. Upon successful completion of
the call, the buffer contains a null-terminated character string for the name of the requested
object.
Location
Library
Structure
typedef struct
{
offset_t blockNumber; /* data block number to access */
int start; /*start of range (from beginning of block)*/
int length; /* number of bytes in range */
int isWrite; /* 0 - READ access 1 - WRITE access */
char padding[4];
} gpfsRangeArray_t;
typedef struct
{
int structLen;
int structType;
int accRangeCnt;
int relRangeCnt;
gpfsRangeArray_t accRangeArray[GPFS_MAX_RANGE_COUNT];
gpfsRangeArray_t relRangeArray[GPFS_MAX_RANGE_COUNT];
} gpfsMultipleAccessRange_t;
Description
The gpfsMultipleAccessRange_t structure defines prefetching and write-behind access where the
application will soon access the portions of the blocks specified in accRangeArray and has finished
accessing the ranges listed in relRangeArray. The size of a block is returned in the st_blksize field of the
stat command, so the offset, OFF, of a file is in the block, OFF/st_blksize.
v Up to GPFS_MAX_RANGE_COUNT, as defined in the header file gpfs_fcntl.h, blocks may be given
in one multiple access range hint. The current value of GPFS_MAX_RANGE_COUNT is eight.
Depending on the current load, GPFS may initiate prefetching of some or all of the blocks.
v Each range named in accRangeArray that is accepted for prefetching, should eventually be released
with an identical entry in relRangeArray, or else GPFS will stop prefetching blocks for this file.
Note: Naming a subrange of a block in relRangeArray that does not exactly match a past entry in
accRangeArray has no effect, and does not produce an error.
v Applications that make random accesses or regular patterns not recognized by GPFS may benefit from
using this hint.
GPFS already recognizes sequential and strided file access patterns. Applications that use such patterns
should not need to use this hint, as GPFS automatically recognizes the pattern and performs
prefetching and write-behind accordingly. In fact, using the multiple access range hint in programs
having a sequential or strided access pattern may degrade performance due to the extra overhead to
process the hints.
Notice that the units of prefetch and release are file blocks, not file offsets. If the application intends to
make several accesses to the same block, it will generally get better performance by including the entire
range to be accessed in the GPFS_MULTIPLE_ACCESS_RANGE hint before actually doing a read or
write. A sample program gpfsperf, which demonstrates the use of the
GPFS_MULTIPLE_ACCESS_RANGE hint, is included in the GPFS product and installed in the
/usr/lpp/mmfs/samples/perf directory.
Location
Library
Structure
typedef struct {
int structLen;
int structType;
int options;
int errReason;
int errValue1;
int errValue2;
int reserved;
int reserved2;
} gpfsRestripeData_t;
Description
The gpfsRestripeData_t structure is used to restripe a file's data blocks to updates its replication and
migrate its data. The data movement is always done immediately.
Members
structLen
Length of the gpfsRestripeData_t structure.
structType
Structure identifier GPFS_FCNTL_RESTRIPE_DATA.
options
Options for restripe command. See mmrestripefs command for complete definitions.
GPFS_FCNTL_RESTRIPE_M
Migrate critical data off of suspended disks.
GPFS_FCNTL_RESTRIPE_R
Replicate data against subsequent failure.
GPFS_FCNTL_RESTRIPE_P
Place file data in assigned storage pool.
GPFS_FCNTL_RESTRIPE_B
Rebalance file data.
reserved
Must be set to 0.
Location
Library
Structure
typedef struct {
int structLen;
int structType;
int metadataReplicas;
int maxMetadataReplicas;
int dataReplicas;
int maxDataReplicas;
int errReason;
int errValue1;
int errValue2;
int reserved;
} gpfsSetReplication_t
Description
The gpfsGetReplication_t structure is used to set a file's replication factors. However, the directive does
not cause the file to be restriped immediately. Instead, the caller must append a gpfsRestripeData_t
directive or invoke an explicit restripe using the mmrestripefs or mmrestripefile command.
Members
structLen
Length of the gpfsSetReplication_t structure.
structType
Structure identifier GPFS_FCNTL_SET_REPLICATION.
metadataReplicas
Specifies how many copies of the file system's metadata to create. Enter a value of 1 or 2, but not
greater than the value of the maxMetadataReplicas attribute of the file. A value of 0 indicates not
to change the current value.
maxMetadataReplicas
The maximum number of copies of indirect blocks for a file. Space is reserved in the inode for all
possible copies of pointers to indirect blocks. Valid values are 1 and 2, but cannot be less than
DefaultMetadataReplicas. The default is 1. A value of 0 indicates not to change the current value.
dataReplicas
Specifies how many copies of the file data to create. Enter a value of 1 or 2, but not greater than
the value of the maxDataReplicas attribute of the file. A value of 0 indicates not to change the
currant value.
maxDataReplicas
The maximum number of copies of data blocks for a file. Space is reserved in the inode and
indirect blocks for all possible copies of pointers to data blocks. Valid values are 1 and 2, but
cannot be less than DefaultDataReplicas. The default is 1. A value of 0 indicates not to change
the current value.
errReason
Returned reason for request failure. Defined below.
errValue1
Returned value depending upon errReason.
Chapter 9. GPFS programming interfaces 459
errValue2
Returned value depending upon errReason.
reserved
Unused, but should be set to 0.
Error status
Location
Library
Structure
typedef struct {
int structLen;
int structType;
int errReason;
int errValue1;
int errValue2;
int reserved;
char buffer[GPFS_FCNTL_MAX_NAME_BUFFER];
} gpfsSetStoragePool_t
Description
The gpfsSetStoragePool_t structure is used to set a file's assigned storage pool. However, the directive
does not cause the file data to be migrated immediately. Instead, the caller must append a
gpfsRestripeData_t directive or invoke an explicit restripe with the mmrestripefs or mmrestripefile
command. The caller must have su or root privileges to change a storage pool assignment.
Members
structLen
Length of the gpfsSetStoragePool_t structure.
structType
Structure identifier GPFS_FCNTL_SET_STORAGEPOOL.
buffer The name of the storage pool for the file's data. Only user files may be reassigned to different
storage pool. System files, including all directories, must reside in the system pool and may not
be moved. The size of the buffer may vary, but must be a multiple of eight.
errReason
Returned reason for request failure. Defined below.
errValue1
Returned value depending upon errReason.
errValue2
Returned value depending upon errReason.
reserved
Unused, but should be set to 0.
Error status
Location
Description
The /var/mmfs/etc/mmsdrbackup user exit, when properly installed on the primary GPFS configuration
server, will be asynchronously invoked every time there is a change to the GPFS master configuration
file. This user exit can be used to create a backup of the GPFS configuration data.
Parameters
The generation number of the most recent version of the GPFS configuration data.
Exit status
The mmsdrbackup user exit should always return a value of zero.
Location
/var/mmfs/etc
Description
The /var/mmfs/etc/nsddevices user exit, when properly installed, is invoked synchronously by the GPFS
daemon during its disk discovery processing. The purpose of this procedure is to discover and verify the
physical devices on each node that correspond to the disks previously defined to GPFS with the
mmcrnsd command. The nsddevices user exit can be used to either replace or to supplement the disk
discovery procedure of the GPFS daemon.
Read the sample file /usr/lpp/mmfs/samples/nsddevices.sample for a detailed description on how to code
and install this user exit.
Parameters
None.
Exit status
When the nsddevices user exit returns a value of zero, the GPFS disk discovery procedure is bypassed.
When the nsddevices user exit returns a value of one, the GPFS disk discovery procedure is performed
and the results are concatenated with the results from the nsddevices user exit.
Location
/var/mmfs/etc
Description
The /var/mmfs/etc/syncfsconfig user exit, when properly installed, will be synchronously invoked after
each command that may change the configuration of a file system. Examples of such commands are:
mmadddisk, mmdeldisk, mmchfs, and so forth. The syncfsconfig user exit can be used to keep the file
system configuration data in replicated GPFS clusters automatically synchronized.
Parameters
None.
Exit status
Location
/var/mmfs/etc
Applications that depend on exact reporting of changes to the following fields returned by the stat() call
may not work as expected:
1. exact mtime
2. mtime
3. ctime
4. atime
Providing exact support for these fields would require significant performance degradation to all
applications executing on the system. These fields are guaranteed accurate when the file is closed.
These values will be accurate on a node right after it accesses or modifies a file, but may not be accurate
for a short while when a file is accessed or modified on some other node.
If 'exact mtime' is specified for a file system (using the mmcrfs or mmchfs commands with the -E yes
flag), the mtime and ctime values are always correct by the time the stat() call gives its answer. If 'exact
mtime' is not specified, these values will be accurate after a couple of minutes, to allow the
synchronization daemons to propagate the values to all nodes. Regardless of whether 'exact mtime' is
specified, the atime value will be accurate after a couple of minutes, to allow for all the synchronization
daemons to propagate changes.
Alternatively, you may use the GPFS calls, gpfs_stat() and gpfs_fstat() to return exact mtime and atime
values.
The delayed update of the information returned by the stat() call also impacts system commands which
display disk usage, such as du or df. The data reported by such commands may not reflect changes that
have occurred since the last sync of the file system. For a parallel file system, a sync does not occur until
all nodes have individually synchronized their data. On a system with no activity, the correct values will
be displayed after the sync daemon has run on all nodes.
For more information about GPFS ACLs and NFS export, see Managing GPFS access control lists and NFS
export in General Parallel File System: Administration and Programming Reference.
The file system version number is assigned when the file system is first created, and is updated to the
latest supported level after the file system is migrated using the mmchfs -V command.
The format version number for a file system can be displayed with the mmlsfs -V command. If a file
system was created with an older GPFS release, new functionality that requires different on-disk data
structures will not be enabled until you run the mmchfs -V command. In addition to mmchfs -V, certain
new features may require you to additionally run the mmigratefs command.
Note: The -V option cannot be used to make file systems created prior to GPFS 3.2.1.5 available to
Windows nodes. Windows nodes can mount only file systems that are created with GPFS 3.2.1.5 or later.
The mmchfs -V option requires the specification of one of two values - full or compat:
v Specifying mmchfs -V full enables all of the new functionality that requires different on-disk data
structures. After this command, nodes in remote clusters running an older GPFS version will no longer
be able to mount the file system.
v Specifying mmchfs -V compat enables only features that are backword compatible with nodes running
GPFS 3.2. After this command, nodes in remote clusters running GPFS 3.2 or later will still be able to
mount the file system, but nodes running GPFS versions 3.1 or older will not be able to mount the file
system.
| The current highest file system format version is 12.03. This is the version that is assigned to file systems
| created with GPFS 3.4. The same version number will be assigned to older file systems after you run the
| mmchfs -V full command.
| If your current file system is at format level 11.03 (GPFS 3.3), the set of enabled features depends on the
| value specified with the mmchfs -V option:
| v After running mmchfs -V full, the file system will be able to support the following:
| – more than 2,147,483,648 files
| – fast extended attributes (which requires mmmigratefs, which is an offline utility, to be run also)
| v There are no new features that can be enabled with mmchfs -V compat.
| If your current file system is at format level 10.00 (GPFS 3.2.0.0) or 10.01 (GPFS 3.2.1.5), after running
| mmchfs -V, the file system will be able to support all of the above, plus the following:
v New maximum number of filesets in a file system (10000)
v New maximum for the number of hard links per object (2**32)
v Improved quota performance for systems with large number of users
v Policy language support for new attributes, variable names, and functions: MODE, INODE, NLINK,
RDEVICE_ID, DEVICE_ID, BLOCKSIZE, GENERATION, XATTR(), ATTR_INTEGER(), and
XATTR_FLOAT()
If your current file system is at format level 9.03 (GPFS 3.1), after running mmchfs -V, the file system will
be able to support all of the above, plus:
v Fine grain directory locking
If your current file system is at format level 8.00 (GPFS 2.3), after running mmchfs -V, the file system will
be able to support all of the above, plus:
v Storage pools
v Filesets
v Fileset quotas
If your current file system is at format level 7.00 (GPFS 2.2), after running mmchfs -V, the file system will
be able to support all of the above, plus:
v NFS V4 access control lists
v New format for the internal allocation summary files
If your current file system is at format level 6.00 (GPFS 2.1), after running mmchfs -V, the file system will
be able to support all of the above, plus extended access control list entries (-rwxc access mode bits).
The functionality described in this topic is only a subset of the functional changes introduced with the
different GPFS releases. Functional changes that do not require changing the on-disk data structures are
not listed here. Such changes are either immediately available when the new level of code is installed, or
require running the mmchconfig release=LATEST command. For a complete list, see the Summary of
changes.
Accessibility features
The following list includes the major accessibility features in GPFS:
v Keyboard-only operation
v Interfaces that are commonly used by screen readers
v Keys that are discernible by touch but do not activate just by touching them
v Industry-standard devices for ports and connectors
v The attachment of alternative input and output devices
The IBM Cluster Information Center, and its related publications, are accessibility-enabled. The
accessibility features of the information center are described at Accessibility
Keyboard navigation
This product uses standard Microsoft Windows navigation keys.
IBM may not offer the products, services, or features discussed in this document in other countries.
Consult your local IBM representative for information on the products and services currently available in
your area. Any reference to an IBM product, program, or service is not intended to state or imply that
only IBM's product, program, or service may be used. Any functionally equivalent product, program, or
service that does not infringe any of IBM's intellectual property rights may be used instead. However, it
is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or
service.
IBM may have patents or pending patent applications covering subject matter described in this
document. The furnishing of this document does not give you any license to these patents. You can send
license inquiries, in writing, to:
For license inquiries regarding double-byte character set (DBCS) information, contact the IBM Intellectual
Property Department in your country or send inquiries, in writing, to:
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law:
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication.
IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this
publication at any time without notice.
Any references in this information to non-IBM Web sites are provided for convenience only and do not in
any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of
the materials for this IBM product and use of those Web sites is at your own risk.
IBM may use or distribute any of the information you supply in any way it believes appropriate without
incurring any obligation to you.
IBM Corporation
Intellectual Property Law
Mail Station P300
2455 South Road,
Poughkeepsie, NY 12601-5400
USA
Such information may be available, subject to appropriate terms and conditions, including in some cases,
payment or a fee.
The licensed program described in this document and all licensed material available for it are provided
by IBM under terms of the IBM Customer Agreement, IBM International Program License Agreement or
any equivalent agreement between us.
This information contains examples of data and reports used in daily business operations. To illustrate
them as completely as possible, the examples include the names of individuals, companies, brands, and
products. All of these names are fictitious and any similarity to the names and addresses used by an
actual business enterprise is entirely coincidental.
COPYRIGHT LICENSE:
This information contains sample application programs in source language, which illustrates
programming techniques on various operating platforms. You may copy, modify, and distribute these
sample programs in any form without payment to IBM, for the purposes of developing, using, marketing
or distributing application programs conforming to the application programming interface for the
operating platform for which the sample programs are written. These examples have not been thoroughly
tested under all conditions. IBM, therefore, cannot guarantee or imply reliability, serviceability, or
function of these programs. You may copy, modify, and distribute these sample programs in any form
without payment to IBM for the purposes of developing, using, marketing, or distributing application
programs conforming to the application programming interfaces for the operating platform for which the
sample programs are written. These examples have not been thoroughly tested under all conditions. IBM,
therefore, cannot guarantee or imply reliability, serviceability, or function of these programs.
If you are viewing this information softcopy, the photographs and color illustrations may not appear.
Trademarks
IBM, the IBM logo, and ibm.com® are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
“Copyright and trademark information” at http://www.ibm.com/legal/copytrade.shtml.
| Intel® and Pentium® are trademarks or registered trademarks of Intel Corporation or its subsidiaries in
| the United States and other countries.
Java™ and all Java-based trademarks and logos are trademarks or registered trademarks of Sun
Microsystems, Inc. in the United States, other countries, or both.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
UNIX is a registered trademark of the Open Group in the United States and other countries.
Other company, product, or service names may be trademarks or service marks of others.
Notices 475
476 GPFS: Administration and Programming Reference
Glossary
This glossary defines technical terms and Application Environment (CAE) Specification
abbreviations used in GPFS documentation. If you C429, The Open Group ISBN
do not find the term you are looking for, refer to 1-85912-190-X.
the index of the appropriate book or view the
deadman switch timer
IBM Glossary of Computing Terms, located on the
A kernel timer that works on a node that
Internet at: http://www-306.ibm.com/software/
has lost its disk lease and has outstanding
globalization/terminology/index.jsp.
I/O requests. This timer ensures that the
node cannot complete the outstanding
B
I/O requests (which would risk causing
block utilization file system corruption), by causing a
The measurement of the percentage of panic in the kernel.
used subblocks per allocated blocks.
disk descriptor
A definition of the type of data that the
C
disk contains and the failure group to
cluster which this disk belongs. See also failure
A loosely-coupled collection of group.
independent systems (nodes) organized
disposition
into a network for the purpose of sharing
The session to which a data management
resources and communicating with each
event is delivered. An individual
other. See also GPFS cluster.
disposition is set for each type of event
cluster configuration data from each file system.
The configuration data that is stored on
disk leasing
the cluster configuration servers.
A method for controlling access to storage
cluster manager devices from multiple host systems. Any
The node that monitors node status using host that wants to access a storage device
disk leases, detects failures, drives configured to use disk leasing registers
recovery, and selects file system for a lease; in the event of a perceived
managers. The cluster manager is the failure, a host system can deny access,
node with the lowest node number preventing I/O operations with the
among the quorum nodes that are storage device until the preempted system
operating at a particular time. has reregistered.
control data structures domain
Data structures needed to manage file A logical grouping of resources in a
data and metadata cached in memory. network for the purpose of common
Control data structures include hash management and administration.
tables and link pointers for finding
cached data; lock states and tokens to F
implement distributed locking; and
failback
various flags and sequence numbers to
Cluster recovery from failover following
keep track of updates to the cached data.
repair. See also failover.
D | failover
| (1) The assumption of file system duties
Data Management Application Program
| by another node when a node fails. (2)
Interface (DMAPI)
| The process of transferring all control of
The interface defined by the Open
| the ESS to a single cluster in the ESS
Group's XDSM standard as described in
| when the other clusters in the ESS fails.
the publication System Management: Data
| See also cluster. (3) The routing of all
Storage Management (XDSM) API Common
G K
GPFS cluster kernel The part of an operating system that
A cluster of nodes defined as being contains programs for such tasks as
available for use by GPFS file systems. input/output, management and control of
hardware, and the scheduling of user
GPFS portability layer
tasks.
The interface module that each
M
Glossary 479
nodes writing to the file system, and management session due to the failure of
comparison of the allocated space to the daemon on the session node.
quota limits at regular intervals.
session node
The node on which a data management
R
session was created.
Redundant Array of Independent Disks (RAID)
Small Computer System Interface (SCSI)
A collection of two or more disk physical
An ANSI-standard electronic interface
drives that present to the host an image
that allows personal computers to
of one or more logical disk drives. In the
communicate with peripheral hardware,
event of a single physical device failure,
such as disk drives, tape drives, CD-ROM
the data can be read or regenerated from
drives, printers, and scanners faster and
the other disk drives in the array due to
more flexibly than previous interfaces.
data redundancy.
snapshot
recovery
A copy of changed data in the active files
The process of restoring access to file
and directories of a file system with the
system data when a failure has occurred.
exception of the inode number, which is
Recovery can involve reconstructing data
changed to allow application programs to
or providing alternative routing through a
distinguish between the snapshot and the
different server.
active files and directories.
replication
source node
The process of maintaining a defined set
The node on which a data management
of data in more than one location.
event is generated.
Replication involves copying designated
changes for one location (a source) to stand-alone client
another (a target), and synchronizing the The node in a one-node cluster.
data in both locations.
storage area network (SAN)
rule A list of conditions and actions that are A dedicated storage network tailored to a
triggered when certain conditions are met. specific environment, combining servers,
Conditions include attributes about an storage products, networking products,
object (file name, type or extension, dates, software, and services.
owner, and groups), the requesting client,
storage pool
and the container name associated with
A grouping of storage space consisting of
the object.
volumes, logical unit numbers (LUNs), or
addresses that share a common set of
S
administrative characteristics.
SAN-attached
stripe group
Disks that are physically attached to all
The set of disks comprising the storage
nodes in the cluster using Serial Storage
assigned to a file system.
Architecture (SSA) connections or using
fibre channel switches striping
A storage process in which information is
secondary GPFS cluster configuration server
split into blocks (a fixed amount of data)
In a GPFS cluster, the node chosen to
and the blocks are written to (or read
maintain the GPFS cluster configuration
from) a series of disks in parallel.
data in the event that the primary GPFS
cluster configuration server fails or subblock
becomes unavailable. The smallest unit of data accessible in an
I/O operation, equal to one thirty-second
Secure Hash Algorithm digest (SHA digest)
of a data block.
A character string used to identify a GPFS
security key. system storage pool
A storage pool containing file system
session failure
control structures, reserved files,
The loss of all resources of a data
directories, symbolic links, special devices,
T
token management
A system for controlling file access in
which each application performing a read
or write operation is granted some form
of access to a specific block of file data.
Token management provides data
consistency and controls conflicts. Token
management has two components: the
token management server, and the token
management function.
token management function
A component of token management that
requests tokens from the token
management server. The token
management function is located on each
cluster node.
token management server
A component of token management that
controls tokens relating to the operation
of the file system. The token management
server is located at the file system
manager node.
twin-tailed
A disk connected to two nodes.
U
user storage pool
A storage pool containing the blocks of
data that make up user files.
V
virtual file system (VFS)
A remote file system that has been
mounted so that it is accessible to the
local user.
virtual node (vnode)
The structure that contains information
about a file system object in an virtual file
system (VFS).
Glossary 481
482 GPFS: Administration and Programming Reference
Index
Special characters adminMode attribute
changing 102
--aix-trace-buffer-size adminMode configuration parameter 2
changing 310 agent node 443
--trace-dispatch appendOnly file attribute 96, 224
changing 310 atime 123, 405, 408, 411, 414, 430, 432, 434, 467
--tracedev-buffer-size attributes
changing 310 useNSDserver 41
--tracedev-compression-level automatic mount
changing 310 indicating 146
--tracedev-overwrite-buffer-size automatic quota activation 289
changing 310 automount 17
--tracedev-write-mode automounter attribute
changing 310 changing 103
--traceFileSize availability
changing 310 disk 38
A B
access ACL 52 backing up a file system 28
access control information 356 using the GPFS policy engine 29
restoring 419 backing up a temporary snapshot to the TSM server 29
access control lists backing up file system configuration information
administering 55 using the mmbackupconfig command 30
allow type 55 backup applications
applying 58 writing 31
changing 54, 58 backup server 92
creating 269 block level incremental backups 391
DELETE 56 block size
DELETE_CHILD 56 affect on maximum mounted file system size 105, 146
deleting 54, 58, 170 choosing 146
deny type 55
displaying 53, 58, 213
editing 196
exceptions 59 C
getting 216 cache 61
inheritance 55 changing
DirInherit 55 adminMode attribute 102
FileInherit 55 an administration or daemon interface for a node 130
Inherited 55 automounter attribute 103
InheritOnly 55 cipherList attribute 103
limitations 59 cluster configuration attributes 102
managing 51 cnfsMountdPort attribute 103
NFS V4 51, 55 cnfsNFSDprocs attribute 103
NFS V4 syntax 55 cnfsSharedRoot attribute 103
setting 52, 53, 57 cnfsVIP attribute 103
special names 55 configuration attributes on the mmchconfig command 9
traditional 51 dataStructureDump attribute 103
translation 57 defaultMountDir attribute 104
accessibility features for the GPFS product 471 disk states 39
ACL information 324, 404 maxblocksize attribute 105
restoring 419 maxFcntlRangesPerFile attribute 106
retrieving 356 maxFilesToCache attribute 106
activating quota limit checking 47 maxMBpS attribute 106
adding maxStatCache attribute 106
disks 34 mmapRangeLock attribute 106
adding nodes to a GPFS cluster 6 nsdServerWaitTimeForMount 106
administering nsdServerWaitTimeWindowOnMount 107
GPFS file system 1 pagepool attribute 107
administration tasks 1 prefetchThreads attribute 107
quotas 45
Index 485
file access file systems (continued)
application driven prefetching and write-behind 456 importing 219
canceling 441, 442 inconsistencies 204
clearing 442 links to snapshots 304
freeing a given range 450 listing mounted 246
gpfs_fcntl() 63 management 17
gpfsDataShipStart_t 445 migrating 121, 258
gpfsDataShipStop_t 448 mounted file system sizes 105, 146
file attribute mounting 260
extended 367, 369 mounting on multiple nodes 17
querying 224 moving to another cluster 121, 202
file attributes mtime value 122
appendOnly 96, 224 NFS export 61
file cache 442, 443 NFS V4 export 61
file descriptor querying space 189
closing 365 quotas 164
opening 375, 377 rebalancing 295
file permissions reducing fragmentation 27, 167
control 52 remote 88, 276, 279
GPFS extension 51 repairing 20, 204
file status information 342, 430, 432, 434 restoring with snapshots 289
gpfs_stat_inode_with_xattrs() 436 restripe 74
gpfs_stat_inode_with_xattrs64() 438 restriping 24, 295
file system configuration information, backing up space, querying 25
mmbackupconfig command 30 unmounting 302, 312
using the mmbackupconfig command 30 unmounting on multiple nodes 19
file system descriptor quorum 210 which nodes have mounted 19
file system manager FileInherit 55
changing nodes 13, 128 files
displaying current 244 dsm.sys 30
displaying node currently assigned 13 orphaned 204
file system name 344, 346 rebalancing 292
file system snapshot handle 340 restriping 292
file system space filesets
querying 189 changing 119
file systems creating 142
access control lists 51 deleting 177
adding disks 73 displaying 235
AIX export 61 ID 371
attributes linking 222
changing 22 name 371
displaying 21 quotas 43
backing up 28, 92 unlinking 315
block size 144 FlashCopy image 209
change manager node 128 full backup 394, 396, 405, 408, 411, 414
changing attributes 121
changing attributes for files 96
checking 20, 204, 219
control request 209
G
genkey 89
controlled by GPFS 468
GPFS
creating 144
stopping 302
creating snapshot 156
GPFS cache 61
deleting 19, 180
GPFS cluster
deleting disks 173
adding nodes 6
disk fragmentation 26
changing the GPFS cluster configuration servers 8
displaying attributes 238
creating 5, 138
displaying format version 238
deleting nodes 7
exporting 60, 202
displaying configuration information 5
exporting using NFS 59
managing 5
file system manager
GPFS cluster configuration data 202
displaying 244
GPFS cluster configuration information
format version 121
displaying 228
formatting 145
GPFS cluster configuration server
fragmentation
changing 99
querying 26
primary 100
GPFS control 468
secondary 100
handle 340
Index 487
grace period maxMBpS attribute
changing 199 changing 106
setting 199 maxStatCache attribute
group quota 159, 162, 164, 200, 253, 274, 282 changing 106
metadata 97
metadata replica 122
H metadata replication
changing 23
hints
mmaddcallback 68
description 63
mmadddisk 34, 73, 466
subroutine for passing 330
mmaddnode 6, 77
hole 391
mmapplypolicy 80
mmapRangeLock attribute
changing 106
I mmauth 88
I/O caching policy mmbackup 28, 92
changing 96 mmbackupconfig 94
in-doubt value 116, 253, 282 mmchattr 23, 24, 96
incremental backup 394, 396, 405, 408, 411, 414 mmchcluster 8, 99
inheritance of ACLs 55 mmchconfig 9, 60, 61, 102
DirInherit 55 mmchdisk 38, 39, 112
FileInherit 55 mmcheckquota 43, 45, 116
Inherited 55 mmchfileset 119
InheritOnly 55 mmchfs 22, 47, 61, 121
Inherited 55 mmchlicense 126
InheritOnly 55 mmchmgr 128
inode mmchnode 130
attributes 358, 361 mmchnsd 134
inode file handle 365, 366 mmchpolicy 136
inode number 375, 377, 387, 389, 394, 396, 398, 400, 402, 405, mmcrcluster 5, 138
408, 411, 414, 426, 428 mmcrfileset 142
inode scan 391, 394, 396, 398, 400, 402, 426, 428 mmcrfs 17, 47, 55, 61, 144
closing 325 mmcrnsd 34, 151, 298, 465, 466
opening 405, 408, 411, 414 mmcrsnapshot 31, 156
inode scan handle 325, 393 mmdefedquota 159
iscan handle 325 mmdefquotaoff 162
mmdefquotaon 164
mmdefragfs 26, 27, 167
K mmdelacl 54, 55, 58, 170
mmdelcallback 172
kernel memory 96
mmdeldisk 173, 466
mmdelfileset 177
mmdelfs 19, 180
L mmdelnode 7, 182
license inquiries 473 mmdelnsd 185
linking mmdelsnapshot 187
filesets 222 mmdf 25, 189
links to snapshots mmdiag command 192
creating 304 mmeditacl 54, 55, 57, 58, 196
deleting 304 mmedquota 45, 199
lost+found directory 20 mmexportfs 202
MMFS_FSSTRUCT 204
MMFS_SYSTEM_UNMOUNT 204
M mmfsck 204
mmfsctl 209
managing
mmgetacl 53, 57, 58, 213
a GPFS cluster 5
mmgetstate 216
GPFS quotas 43
mmimportfs 219
maxblocksize attribute
mmlinkfileset 222
changing 105
mmlsattr 23, 224
maxFcntlRangesPerFile attribute
mmlscallback 226
changing 106
mmlscluster 5, 228
maxFilesToCache attribute
mmlsconfig 230
changing 106
mmlsdisk 38, 232
maximum number of files
mmlsfileset 235
changing 121
mmlsfs 21, 61, 238
displaying 238
mmlslicense 242
N
Network File System (NFS)
cache usage 61
P
pagepool attribute
exporting a GPFS file system 59
changing 107
interoperability with GPFS 59
partitioning file blocks 445
synchronous writes 61
patent information 473
unmounting a file system 61
peer recovery cluster 209
Network Shared Disks (NSDs) 465
Peer-to-Peer Remote Copy (PPRC) 209
changing configuration attributes 40, 134
performance
creating 151
communicating file access patterns 63
deleting 185
monitoring 264
displaying 248
Persistent Reserve
NFS automount 62
disabling 42
NFS V4 51, 121, 146
enabling 42
NFS V4 ACL 122, 170, 196, 197, 213, 214, 269
policy
GPFS exceptions 468
applying 80
special names 468
PR 42
NFS V4 protocol
prefetchThreads attribute
GPFS exceptions 468
changing 107
node descriptor 77, 139
primary GPFS cluster configuration server 139
node designation 77, 139
principles
node quorum 13
common to GPFS commands 3
node quorum with tiebreaker 8, 13
problem determination information
nodes
placement of 103
adding to a cluster 77
public/private key pair 88
adding to a GPFS cluster 6
assigned as file system manager 13
Index 489
Q server node
restoring NSD path 262
querying server node, NSD
disk fragmentation 26 choosing 151
file system fragmentation 26 setting access control lists 52
replication 23 sidAutoMapRangeLength attribute
space 25 changing 107
quorum 210 sidAutoMapRangeStart attribute
quorum node 138, 182, 216 changing 108
quota files snapshot
replacing 49, 116 temporary 29
quota information 424 snapshot directory 353
quotas snapshot handle 340, 344, 345, 346, 348, 350, 352, 355
activating 274 free 339
activating limit checking 47 snapshot ID 340, 341, 345, 350
changing 45, 199, 421 comparing 326
checking 45, 116 internal 360, 363
creating reports 282 snapshot name 346, 355
deactivating 272 snapshots
deactivating quota limit checking 47 creating 156
default values 44 deleting 187
disabling 44 directory 156
displaying 46, 253 displaying 256
enabling 43 listing 256
establishing 45 maximum number 156
fileset 43 partial 156
group 43 restoring a file system 289
reports, creating 48 sparse file 391
setting 199 specifying
user 43 subnets attribute 108
standards, exceptions to 467
starting GPFS 15, 307
R before starting 14
RAID stripe size 144 status
read file permission 51 disk 38
rebalancing a file 292 stopping GPFS 15
rebalancing a file system 295 storage
refresh NSD server pre-allocating 417
mmnsddiscover 262 storage pools
release attribute ID 373
changing 107 name 373
remote copy command strict 122, 147
changing 99 structures
choosing 138 gpfs_acl_t 324
remote shell command gpfs_direntx_t 328
changing 99 gpfs_direntx64_t 329
choosing 138 gpfs_fssnap_handle_t 340
repairing a file system 20 gpfs_fssnap_id_t 341
replacing disks 37 gpfs_iattr_t 358
replicated cluster 466 gpfs_iattr64_t 361
replication 122, 147 gpfs_ifile_t 366, 367, 369
changing 23 gpfs_iscan_t 393
querying 23, 224 gpfs_opaque_acl_t 404
replication attributes gpfs_quotaInfo_t 424
changing 96 gpfsAccessRange_t 440
replication factor 96 gpfsCancelHints_t 441
requirements gpfsClearFileCache_t 442
administering GPFS 1 gpfsDataShipMap_t 443
for Tivoli Storage Manager 29 gpfsDataShipStart_t 445
restoring NSD path gpfsDataShipStop_t 448
mmnsddiscover 262 gpfsFcntlHeader_t 449
restriping a file 292 gpfsFreeRange_t 450
restriping a file system 24, 295 gpfsGetFilesetName_t 451
gpfsGetReplication_t 452
gpfsGetSnapshotName_t 454
S gpfsGetStoragePool_t 455
gpfsMultipleAccessRange_t 456
secondary GPFS cluster configuration server 140
Index 491
W
worker1Threads attribute
changing 109
write file permission 51
Printed in USA
SA23-2221-04