Chapter 18. Troubleshooting

Chapter 18. Troubleshooting
Prev		Next

Configuring and administering a CXFS cluster can be a complex task. In general, most problems can be solved by rebooting a node. However, the topics in this chapter may help you avoid rebooting:

You must perform administrative tasks with cmgr from a node that has the cluster_admin software package installed; you must connect the GUI to such a node. See the CXFS MultiOS Client-Only Guide for SGI InfiniteStorage for additional troubleshooting information.

Troubleshooting Strategy

To troubleshoot CXFS problems, do the following:

Know the Tools

This section provides an overview of the tools required to troubleshoot CXFS:

Caution: Many of the commands listed are beyond the scope of this book and are provided here for quick reference only. See the other guides and man pages referenced for complete information before using these commands.

Physical Storage Tools

Understand the following physical storage tools:

To display the hardware inventory:
- IRIX:
  irix# /sbin/hinv
- Linux 64-bit (assuming the sgi-misc RPM is installed):
  [root@linux64 root]# /usr/bin/hinv [root@linux64 root]# /usr/bin/topology
If the output is not what you expected, do a probe for devices and perform a SCSI bus reset, using the following commands:
- IRIX:
  irix# /usr/sbin/scsiha -pr bus_number
- Linux 64-bit:
  root@linux64 root]# /bin/xscsiha -pr /dev/xscsi/busnumber/bus
To configure I/O devices on an IRIX node, use the following command:
irix# /sbin/ioconfig -f /hw
To show the physical volumes, use the xvm command:
# /sbin/xvm show -v phys/
See the XVM Volume Manager Administrator's Guide.

Cluster Configuration Tools

Understand the following cluster configuration tools:

To configure XVM volumes, use the xvm command:
# /sbin/xvm
See the XVM Volume Manager Administrator's Guide.
To configure CXFS nodes and cluster, use either the GUI or the cmgr command:
- The GUI:
  # /usr/sbin/cxfsmgr
  See “GUI Features” in Chapter 10 and Chapter 10, “Reference to GUI Tasks for CXFS”.
- The cmgr command line with prompting:
  # /usr/cluster/bin/cmgr -p
  See “cmgr Overview” in Chapter 11, and Chapter 11, “Reference to cmgr Tasks for CXFS”.
To reinitialize the database, use the cdbreinit command:
# /usr/cluster/bin/cdbreinit
See “Recreating the Cluster Database”.

Cluster Control Tools

Understand the following cluster control tools:

To start and stop the cluster services daemons on administration nodes:
# /etc/init.d/cxfs_cluster start # /etc/init.d/cxfs start # /etc/init.d/cxfs # /etc/init.d/cxfs_cluster start
To start and stop the cluster services daemons on client-only nodes:
# /etc/init.d/cxfs_client start # /etc/init.d/cxfs_client stop
These commands are useful if you know that filesystems are available but are not indicated as such by the cluster status, or if cluster quorum is lost.

See the following:
To start and stop CXFS services, use the GUI or the following cmgr commands:
cmgr> start cx_services on node hostname for cluster clustername cmgr> stop cx_services on node hostname for cluster clustername
Running this command on the metadata server will cause its filesystems to be recovered by another potential metadata server. See “Cluster Services Tasks with cmgr” in Chapter 11, and “Cluster Services Tasks with the GUI” in Chapter 10.

Note: Relocation and recovery are supported only when using standby nodes. Relocation is disabled by default.
To allow and revoke CXFS kernel membership on the local node, forcing recovery of the metadata server for the local node, use the GUI or the following cmgr commands:
cmgr> admin cxfs_start cmgr> admin cxfs_stop
Wait until recovery is complete before issuing a subsequent admin cxfs_start. The local node cannot rejoin the CXFS kernel membership until its recovery is complete.

See the following:

Networking Tools

Understand the following networking tools:

To send packets to network hosts:

IRIX:
irix# /usr/etc/ping
Linux 64-bit:
[root@linux64 root]# /bin/ping

To show network status:

IRIX:
irix# /usr/etc/netstat
Linux 64-bit:
[root@linux64 root]# /bin/netstat

Cluster/Node Status Tools

Understand the following cluster/node status tools:

To show which cluster daemons are running:
# ps -ef | grep cluster
See “Verify that the Cluster Daemons are Running” in Chapter 9.
To see cluster and filesystem status, use one of the following:
- GUI:
  # /usr/sbin/cxfsmgr
  See “Display a Cluster with the GUI” in Chapter 10.
- cluster_status command:
  # /usr/cluster/cmgr-scripts/cluster_status
  See “Check Cluster Status with cluster_status” in Chapter 16.
- clconf_info command:
  # /usr/cluster/bin/clconf_info
- cxfs_info command on an IRIX or Linux 64-bit client-only node:
  # /usr/cluster/bin/cxfs_info
To see the mounted filesystems:
- IRIX:
  irix# /sbin/mount irix# /usr/sbin/df
- Linux 64-bit:
  [root@linux64 root]# /bin/mount [root@linux64 root]# /bin/df
You can also use the df command to report the number of free disk blocks
To show volumes:
# /sbin/xvm show vol/
See the XVM Volume Manager Administrator's Guide.

Performance Monitoring Tools

Understand the following performance monitoring tools:

To monitor system activity:
# /usr/bin/sar
To monitor file system buffer cache activity on IRIX nodes:
irix# /usr/sbin/bufview
Note: Do not use bufview interactively on a busy IRIX node; run it in batch mode.
To monitor operating system activity data on an IRIX node::
irix# /usr/sbin/osview
To monitor the statistics for an XVM volume, use the xvm command:
# /sbin/xvm change stat on {concatname|stripename|physname}
See the XVM Volume Manager Administrator's Guide.
To monitor system performance, use Performance Co-Pilot. See the Performance Co-Pilot for IA-64 Linux User's and Administrator's Guide, Performance Co-Pilot for IRIX Advanced User's and Administrator's Guide, the Performance Co-Pilot Programmer's Guide, and the pmie and pmieconf man pages.

Kernel Status Tools

Understand the following kernel status tools (this may require help from SGI service personnel):

To determine IRIX kernel status, use the icrash commands:
# /usr/bin/icrash
- cfs to list CXFS commands
- dcvn to list client vnodes
- dsvn to list server vnodes
- mesglist to trace messages to the receiver
- sinfo to show clients/servers and filesystems
- sthread | grep cmsd to determine the CXFS kernel membership state. You may see the following in the output:
  - cms_dead() indicates that the node is dead
  - cms_follower() indicates that the node is waiting for another node to create the CXFS kernel membership (the leader)
  - cms_leader() indicates that the node is leading the CXFS kernel membership creation
  - cms_declare_membership() indicates that the node is ready to declare the CXFS kernel membership but is waiting on resets
  - cms_nascent() indicates that the node has not joined the cluster since starting
  - cms_shutdown() indicates that the node is shutting down and is not in the CXFS kernel membership
  - cms_stable() indicates that the CXFS kernel membership is formed and stable
- tcp_channels to determine the status of the connection with other nodes
- -t -a -w filename to trace for CXFS
- -t cms_thread to trace one of the above threads
To determine Linux 64-bit kernel status, use the KDB built-in kernel debugger.

When kdb is enabled, a system panic will cause the debugger to be invoked and the keyboard LEDs will blink. The kdb prompt will display basic information. To obtain a stack trace, enter the bt command at the kdb prompt:
kdb> bt
To get a list of current processes, enter the following:
kdb> ps
To backtrace a particular process, enter the following, where PID is the process ID:
kdb> btp PID
To exit the debugger, enter the following:
kdb> go
If the system will be run in graphical mode with kdb enabled, SGI highly recommends that you use kdb on a serial console so that the kdb prompt can be seen.
To invoke internal kernel routines that provide useful debugging information, use the idbg command:
# /usr/sbin/idbg

Log Files

Understand the log files discussed in “Status in Log Files” in Chapter 16.

Gather Cluster Configuration with `cxfsdump`

Before reporting a problem to SGI, you should use the cxfsdump command to gather configuration information about the CXFS cluster, such as network interfaces, CXFS registry information, I/O, and cluster database contents. This will allow SGI support to solve the problem more quickly.

Note: The cxfsdump command requires access to AIX, IRIX, Linux 64-bit, Linux 32-bit, Mac OS X, and Solaris nodes in the cluster via the rcp and rsh commands. Because these commands are not provided on Windows nodes, the cxfsdump command must be run manually on each Windows node.

The cxfsdump /? command displays a help message on Windows nodes. The cxfsdump -help command displays a help message on other nodes.

You should run cxfsdump from a CXFS administration node in the cluster:

# /usr/cluster/bin/cxfsdump

The output will be placed in a file in the directory /var/cluster/cxfsdump-data directory on the CXFS administration node on which the cxfsdump command was run. The cxfsdump command will report the name and location of the file when it is finished.

If your cluster contains Windows nodes, you must run the command manually on each Windows node.

To gather information about just the local mode, use the cxfsdump -local command.

Following is an example of gathering information for the entire cluster from an IRIX node:

adminnode# cxfsdump

Detecting cluster configuration

 Executing CXFSDUMP on CLUSTER testcluster NODE o200a
Gathering cluster information...
Determining OS level......
Getting versions info....
Obtaining CXFS database...
Checking for tie-breakers etc...
Obtaining hardware inventory...
Grabbing /etc/hosts.....
Grabbing /etc/resolv.conf...
Grabbing /ets/nsswitch.conf...
Obtaining physvol information using XVM...
ioctl() to xvm api node failed: Invalid argument
Could not get xvm subsystem info:  xvmlib_execute_ioctl: system call failed. 
Obtaining Volume topology information using XVM...
ioctl() to xvm api node failed: Invalid argument
Could not get xvm subsystem info:  xvmlib_execute_ioctl: system call failed. 
Copying failover configuration and scsifo paths ...
Gathering network information...
Checking for any installed Patches..
Monitoring file system buffer cache for 3 minutes...
Running Systune ...
Obtaining modified system tunable parameters...
Creating ICRASH CMD file...
Executing ICRASH commands...
Copying CXFS logs...
Copying /var/cluster/ha/log/cad_log...
Copying /var/cluster/ha/log/clconfd_o200a...
Copying /var/cluster/ha/log/cli_o200a...
Copying /var/cluster/ha/log/cmond_log...
Copying /var/cluster/ha/log/crsd_o200a...
Copying /var/cluster/ha/log/fs2d_log...
Copying /var/cluster/ha/log/fs2d_log.old...
Copying SYSLOG...
Distributing /usr/cluster/bin/cxfsdump.pl to node o200c ...
Distributing /usr/cluster/bin/cxfsdump.pl to node o200b ...
Creating the output directory : /var/cluster/cxfsdump-data
Gathering node information for the cluster testcluster ...
Running RSH to node o200c...
Running RSH to node o200b...
Waiting for other cluster nodes to gather data...
FINAL CXFSDUMP OUTPUT IN /var/cluster/cxfsdump-data/testcluster_cxfsdump20020903.tar.gz

On Windows systems, cxfsdump creates a directory called cxfsdump-data in the same directory where the the passwd file is kept. The cxfsdump command will report the location where the data is stored when it is complete. For example:

FINAL CXFSDUMP output in output_filename

Avoid Problems

This section covers the following:

Proper Start Up

Ensure that you follow the instructions in “Preliminary Cluster Configuration Steps” in Chapter 9, before configuring the cluster.

Eliminate a Residual Cluster

Before you start configuring another new cluster, make sure no nodes are still in a CXFS membership from a previous cluster. Enter the following to check for a cmsd kernel thread:

IRIX:
irix# icrash -e 'sthread | grep cmsd'
Linux 64-bit:
[root@linux64 root]# ps -ef | grep cmsd

If the output shows a cmsd kernel thread, force a CXFS shutdown by entering the following:

# /usr/cluster/bin/cmgr -p
cmgr> admin cxfs_stop

Then check for a cmsd kernel thread again.

After waiting a few moments, if the cmsd kernel thread still exists, you must reboot the machine or leave it out of the new cluster definition. It will not be able to join a new cluster in this state and it may prevent the rest of the cluster from forming a new CXFS membership.

Cluster Database Membership Quorum Stability

The cluster database membership quorum must remain stable during the configuration process. If possible, use multiple windows to display the fs2d_log file for each CXFS administration node while performing configuration tasks. Enter the following:

# tail -f /var/cluster/ha/log/fs2d_log

Check the member count when it prints new quorums. Under normal circumstances, it should print a few messages when adding or deleting nodes, but it should stop within a few seconds after a new quorum is adopted.

If not enough machines respond, there will not be a quorum. In this case, the database will not be propagated.

If you detect cluster database membership quorum problems, fix them before making other changes to the database. Try restarting the cluster infrastructure daemons on the node that does not have the correct cluster database membership quorum, or on all nodes at the same time.

Enter the following on administration nodes:

# /etc/init.d/cxfs_cluster stop
# /etc/init.d/cxfs_cluster start

Enter the following on client-only nodes:

# /etc/init.d/cxfs_client stop
# /etc/init.d/cxfs_client start

Please provide the fs2d log files when reporting a cluster database membership quorum problem.

Consistency in Configuration

Be consistent in configuration files for nodes across the pool, and when configuring networks. Use the same names in the same order. See “Configuring System Files” in Chapter 8.

Define Node Function Appropriately

Use the appropriate node function definition:

Use an odd number of server-capable nodes and an odd number of CXFS administration nodes for stability.
Make unstable nodes CXFS client-only nodes.

GUI Use

The GUI provides a convenient display of a cluster and its components through the view area. You should use it to see your progress and to avoid adding or removing nodes too quickly. After defining a node, you should wait for it to appear in the view area before adding another node. After defining a cluster, you should wait for it to appear before you add nodes to it. If you make changes too quickly, errors can occur.

For more information, see “Starting the GUI” in Chapter 10.

When running the GUI on IRIX, do not move to another IRIX desktop while GUI action is taking place; this can cause the GUI to crash.

Log File Names and Sizes

You should not change the names of the log files. If you change the names of the log files, errors can occur.

Periodically, you should rotate log files to avoid filling your disk space; see “Log File Management” in Chapter 12. If you are having problems with disk space, you may want to choose a less verbose log level; see “Configure Log Groups with the GUI” in Chapter 10, or “Configure Log Groups with cmgr” in Chapter 11.

IRIX: Netscape and the Brocade Switch GUI

When accessing the Brocade Web Tools V2.0 through Netscape on an IRIX node, you must first enter one of the following before starting Netscape:

For sh or ksh shells:
$ NOJIT=1; export NOJIT
For csh shell:
% setenv NOJIT 1

If this is not done, Netscape will crash with a core dump.

Performance Problems with Unwritten Extent Tracking and Exclusive Write Tokens

This section discusses performance problems with unwritten extent tracking and exclusive write tokens.

Unwritten Extent Tracking

When you define a filesystem, you can specify whether unwritten extent tracking is on (unwritten=1) or off (unwritten=0); it is on by default.

In most cases, the use of unwritten extent tracking does not affect performance and you should use the default to provide better security.

However, unwritten extent tracking can affect performance when both of the following are true:

A file has been preallocated
These preallocated extents are written for the first time with records smaller than 4MB

For optimal performance with CXFS when both of these conditions are true, it may be necessary to build filesystems with unwritten=0 (off).

Note: There are security issues with using unwritten=0. For more information, see the IRIX Admin: Disks and Filesystems.

Exclusive Write Tokens

For proper performance, CXFS should not obtain exclusive write tokens. Therefore, use the following guidelines:

Preallocate the file.
Set the size of the file to the maximum size and do not allow it to be changed, such as through truncation.
Do not append to the file. (That is, O_APPEND is not true on the open.)
Do not mark an extent as written.
Do not allow the application to do continual preallocation calls.

If the guidelines are followed and there are still performance problems, you may find useful information by running the icrash stat command before, halfway through, and after running the MPI job. For more information, see the icrash man page.

Avoid Excessive Filesystem Activity Caused by the `crontab` File

The default root crontab file contains the following entries (line breaks inserted here for readability):

0 5 * * *  find / -local -type f '(' -name core -o -name dead.letter ')' -atime +7 
-mtime +7 -exec rm -f '{}' ';'

 0 3 * * 0 if test -x /usr/etc/fsr; then (cd /usr/tmp; /usr/etc/fsr) fi

The first entry executes a find command that looks for and removes all files with the name core or dead.letter that have not been accessed in the past seven days.

The second entry executes an fsr command that improves the organization of mounted filesystems.

The find command will be run nightly on all local filesystems. Because CXFS filesystems are considered as local on all nodes in the cluster, the nodes may generate excessive filesystem activity if they try to access the same filesystems simultaneously. Therefore, you may wish use the following sequence to disable or modify the find crontab entries on all the CXFS administration nodes except for one:

Log in as root.
Define your editor of choice, such as vi:
# setenv EDITOR vi
Edit the crontab file:
# crontab -e
Comment out or delete the find line.

The fsr command can only be run on the metadata server, so it is not harmful to leave it in the crontab file for CXFS clients, but it will not be executed.

Use System Capacity Wisely

To avoid a loss of connectivity between the metadata server and the CXFS clients, do not oversubscribe the metadata server or the private network connecting the nodes in the cluster. Avoid unnecessary metadata traffic.

If the amount of free memory is insufficient, a node may experience delays in heartbeating and as a result will be kicked out of the CXFS membership. To observe the amount of free memory in your system, use the osview tool.

See also “Out of Logical Swap Space”.

Reboot Before Changing Node ID or Cluster ID

If you want redefine a node ID or the cluster ID, you must first reboot. The problem is that the kernel still has the old values, which prohibits a CXFS membership from forming. However, if you perform a reboot first, it will clear the original values and you can then redefine the node or cluster ID.

Therefore, if you use cdbreinit on a node to recreate the cluster database, you must reboot it before changing the node IDs or the cluster ID. See “Recreating the Cluster Database”.

Remove Unused Nodes

If a node is going to be down for a while, remove it from the cluster and the pool to avoid cluster database membership and CXFS membership quorum problems. See the following sections:

Restart CXFS after a Forced Shutdown

If you perform a forced shutdown on a node, you must restart CXFS on that node before it can return to the cluster. If you do this while the database still shows that the node is in a cluster and is activated, the node will restart the CXFS membership daemon. Following a forced shutdown, the node can be prevented from restarting the CXFS membership daemon when CXFS is restarted by stopping CXFS services. (A CXFS forced shutdown alone does not stop CXFS services. A forced shutdown stops only the kernel membership daemon. Stopping CXFS services disables the node in the cluster database.)

For example, enter the following on the local node you wish to start:

# /usr/cluster/bin/cmgr -p
cmgr> stop cx_services on node localnode
cmgr> admin cxfs_start

Remove Reset Lines

When reset is enabled, CXFS requires a reset successful message before it moves the metadata server. Therefore, if you have the reset capability enabled and you must remove the reset lines for some reason, you must also disable the reset capability. See “Modify a Node Definition with the GUI” in Chapter 10, or “Modify a Node with cmgr” in Chapter 11.

Note: The reset capability or I/O fencing is mandatory to ensure data integrity for all nodes. Clusters should have an odd number of server-capable nodes. See “Cluster Environment” in Chapter 1.

Appropriate Use of `xfs_repair`

CXFS filesystems are really clustered XFS filesystems; therefore, in case of a file system corruption, you can use the xfs_check and xfs_repair commands. However, you must first ensure that you have an actual case of data corruption and retain valuable metadata information by replaying the XFS logs before running xfs_repair.

Caution: If you run xfs_repair without first replaying the XFS logs, you may introduce data corruption.

You should only run xfs_repair in case of an actual filesystem corruption; forced filesystem shutdown messages do not necessarily imply that xfs_repair should be run. Following is an example of a message that does indicate an XFS file corruption:

XFS read error in file system metadata block 106412416

When a filesystem is forcibly shut down, the log is not empty -- it contains valuable metadata. You must replay it by mounting the filesystem. The log is only empty if the filesystem is unmounted cleanly (that is, not a forced shutdown, not a crash). You can use the following command line to see an example of the transactions captured in the log file:

# xfs_logprint -t device

If you run xfs_repair before mounting the filesystem, xfs_repair will delete all of this valuable metadata.

You should run xfs_ncheck and capture the output to a file before running xfs_repair. If running xfs_repair results in files being placed in the lost+found directory, the saved output from xfs_ncheck may help you to identify the original names of the files.

If you think you have a filesystem with real corruption, do the following:

Mount the device in order to replay the log:
# mount device any_mount_point
Unmount the filesystem:
# unmount device
Check the filesystem:
# xfs_check device
View the repairs that could be made, using xfs_repair in no-modify mode:
# xfs_repair -n device
Capture filesystem file name and inode pairs:
# xfs_ncheck device > xfs_ncheck.out
If you are certain that the repairs are appropriate, complete them:
# xfs_repair device

For more information, see the IRIX Admin: Disks and Filesystems.

Identify the Cluster Status

When you encounter a problem, identify the cluster status by answering the following questions:

Are the cluster daemons running? See “Verify that the Cluster Daemons are Running” in Chapter 9.
Is the cluster state consistent on each node? Run the clconf_info command on each CXFS administration node and compare.
Which nodes are in the CXFS kernel membership? See “Check Cluster Status with cluster_status” in Chapter 16, “Check Cluster Status with cmgr” in Chapter 16, and the following files:
- IRIX: /var/adm/SYSLOG
- Linux 64-bit: /var/log/messages
Which nodes are in the cluster database (fs2d) membership? See the /var/cluster/ha/log/fs2d_log files on each CXFS administration node.
Is the database consistent on all CXFS administration nodes? Determine this logging in to each administration node and examining the /var/cluster/ha/log/fs2d_log file and database checksum.
Log onto the various CXFS client nodes or use the GUI view area display with details showing to answer the following:
- Are the devices available on all nodes? Use the following:
  - The xvm command to show the physical volumes:
    xvm:cluster> show -v phys/
  - Is the client-only node in the cluster? Use the cxfs_info command.
  - List the contents of the /dev/cxvm directory with the ls command:
    # ls /dev/cxvm
  - Use the hinv command to display the hardware inventory. See “Physical Storage Tools”.
- Are the filesystems mounted on all nodes? Use mount and clconf_info commands.
- Which node is the metadata server for each filesystem? Use the cluster_status or clconf_info commands.
On the metadata server, use the clconf_info command.
Is the metadata server in the process of recovery? Use the IRIX icrash command to search for messages and look at the following files:
- IRIX: /var/adm/SYSLOG
- Linux 64-bit: /var/log/messages
See “Kernel Status Tools”. Messages such as the following indicate that recovery status:
- In process:
  Mar 13 11:31:02 1A:p2 unix: ALERT: CXFS Recovery: Cell 1: Client Cell 0 Died, Recovering </scratch/p9/local>
- Completed:
  Mar 13 11:31:04 5A:p2 unix: NOTICE: Signaling end of recovery cell 1

Are there any long running (>20 seconds) kernel messages? Use the icrash mesglist command to examine the situation. For example:

>> mesglist
Cell:7
THREAD ADDR         MSG ID TYPE CELL MESSAGE                          Time(Secs)
================== ======= ==== ==== ================================ ==========
0xa8000000d60a4800  5db537  Rcv    0                   I_dcvn_recall          0
0xa8000000d60a4800  5db541  Snt    0                 I_dsvn_notfound          0
0xa80000188fc51800  3b9b4f  Snt    0             I_dsxvn_inode_update  17:48:58

If filesystems are not mounting, do they appear online in XVM? You can use the following xvm command:
xvm:cluster> show vol/*

Locate the Problem

To locate the problem, do the following:

Examine the log files (see “Log Files”):
- Search for errors in all log files. See “Status in Log Files” in Chapter 16. Examine all messages within the timeframe in question.
- Trace errors to the source. Try to find an event that triggered the error.
Use the IRIX icrash commands. See “Kernel Status Tools”.
Use detailed information from the view area in the GUI to drill down to specific configuration information.
Run the Test Connectivity task in the GUI. See “Test Node Connectivity with the GUI” in Chapter 10.

Determine how the nodes of the cluster see the current CXFS kernel membership by entering the following command on each CXFS administration node:

# /usr/cluster/bin/clconf_info

This command displays the following fields:

Node name
Node ID
Status (up or down)
Age (not useful; ignore this field)
Incarnation (not useful; ignore this field)
Cell ID, which is a number that is dynamically allocated by the CXFS software when you add a node to a cluster (the user does not define a cell ID number). To see the cell ID, use the clconf_info command.

For example:

# /usr/cluster/bin/clconf_info
Membership since Fri Sep 10 08:57:36 1999
Node         NodeId     Status    Age   Incarnation     CellId
cxfs6           1001        UP      1             7          0
cxfs7           1002        UP      0             0          1
cxfs8           1003        UP      0             0          2
2 CXFS FileSystems
/dev/xvm/test1 on /mnts/test1  disabled  server   0 clients 
/dev/xvm/test2 on /mnts/test2  disabled  server   0 clients

Check the following file on each CXFS administration node to make sure the CXFS filesystems have been successfully mounted or unmounted:
- IRIX: /var/adm/SYSLOG
- Linux 64-bit: /var/log/messages
If a mount/unmount fails, the error will be logged and the operation will be retried after a short delay.
Use the sar system activity reporter to show the disks that are active. For example, the following example for IRIX will show the disks that are active, put the disk name at the end of the line, and poll every second for 10 seconds:
irix# sar -DF 1 10
For more information, see the sar man page.
Use the IRIX bufview filesystem buffer cache activity monitor to view the buffers that are in use. Within bufview, you can use the help subcommand to learn about available subcommands, such as the f subcommand to limit the display to only those with the specified flag. For example, to display the in-use (busy) buffers:
# bufview f Buffer flags to display bsy
For more information, see the bufview man page.
Use the IRIX icrash command. For more information, see the icrash man page.
Get a dump of the cluster database. You can extract such a dump with the following command:
# /usr/cluster/bin/cdbutil -c 'gettree #' > dumpfile

Common Problems

The following are common problems and solutions.

Node is Permanently Fenced

If you are unable to raise the fence on a node, it may be that the switch ports are unable to determine the WWPN. See “Hardware Changes and I/O Fencing” in Chapter 12.

Cannot Access Filesystem

If you cannot access a filesystem, check the following:

Is the filesystem enabled? Check the GUI, clconf_info command, and cluster_status commands.
Were there mount errors?

GUI Will Not Run

If the GUI will not run, check the following:

Is the license properly installed? See the following:
Are the cluster daemons running? See “Verify that the Cluster Daemons are Running” in Chapter 9.
Are the tcpmux and tcpmux/sgi_sysadm services enabled in the following files?
- IRIX: /etc/inetd.conf
- Linux 64-bit: /etc/xinetd.d/tcpmux and /etc/tcpmux.conf
Are the inetd or tcp wrappers interfering? This may be indicated by connection refused or login failed messages.
Are you connecting to a CXFS administration node? The cxfsmgr command can only be executed on a CXFS administration node. The GUI may be run from another system via the Web if you connect the GUI to a CXFS administration node.

Log Files Consume Too Much Disk Space

If the log files are consuming too much disk space, you should rotate them; see “Log File Management” in Chapter 12. You may also want to consider choosing a less-verbose log level; see the following:

Unable to Define a Node

If you are unable to define a node, it may be that there are hostname resolution problems. See “Hostname Resolution and Network Configuration Rules” in Chapter 5.

System is Hung

The following may cause the system to hang:

Overrun disk drives.
Heartbeat was lost. In this case, you will see a message that mentions withdrawl of node.
As a last resort, do a non-maskable interrupt (NMI) of the system and contact SGI. (The NMI tells the kernel to panic the node so that an image of memory is saved and can be analyzed later.) For more information, see the owner's guide for the node.

Make the following files available:
- System log file:
  - IRIX: /var/adm/SYSLOG
  - Linux 64-bit: /var/log/messages
- IRIX vmcore.#.comp
- IRIX unix.#

Node is Detected but Never Joins Membership

If a node is detected in the system log file but it never receives a Membership delivered message, it is likely that there is a network problem.

See “Configuring System Files” in Chapter 8.

Cell ID Count and “Membership Delivered” Messages

The Membership delivered messages in the system log file file include a list of cell IDs for nodes that are members in the new CXFS membership.

Following each cell ID is a number, the membership version , that indicates the number of times the membership has changed since the node joined the membership.

If the Membership delivered messages are appearing frequently in the system log file, it may indicate a network problem:

Nodes that are stable and remain in the membership will have a large membership version number.
Nodes that are having problems will be missing from the messages or have a small membership version number.

See “Configuring System Files” in Chapter 8.

You Cannot Log In

If you cannot log in to a CXFS administration node, you can use one of the following commands, assuming the node you are on is listed in the other nodes' .rhosts files:

# rsh hostname ksh -i
# rsh hostname csh -i

I/O Error in Filesystem

The following message indicates a problem (output lines wrapped here for readability):

ALERT: I/O error in filesystem ("/mnt") metadata dev 0xbd block 0x41df03 ("xlog_iodone")
ALERT:     b_error 0 b_bcount 32768 b_resid 0
NOTICE: xfs_force_shutdown(/mnt,0x2) called from line 966 of file ../fs/xfs/xfs_log.c. 
  Return address = 0xc0000000008626e8
ALERT: I/O Error Detected.  Shutting down filesystem: /mnt
ALERT: Please umount the filesystem, and rectify the problem(s)

You can fix this problem using xfs_repair only if there is no metadata in the XFS log. See “Appropriate Use of xfs_repair”, for the appropriate procedure.

I/O errors can also appear if the node is unable to access the storage. This can happen for several reasons:

The node has been physically disconnected from the SAN
A filesystem shutdown due to loss of membership
A filesystem shutdown due to lost of the metadata server
The node has been fenced out of the SAN

Cannot Mount Filesystems

If you are unable to raise the fence on a node, it may be that the switch ports are unable to determine the WWPN. See “Hardware Changes and I/O Fencing” in Chapter 12.

If you have defined filesystems and then rename your cluster (by deleting the old cluster and defining a new cluster), CXFS will not be able to mount the existing filesystems. This happens because the clustered XVM volume on which your CXFS filesystem resides is not accessible to the new cluster, and the volumes are therefore considered as foreign.

In order to mount the filesystem on the new cluster, you must use the XVM steal command to bring the clustered XVM volume into the domain of the new cluster. For more information, see the XVM Volume Manager Administrator's Guide .

GUI Displays Invalid Filesystems

If you create new slices on a previously sliced disk that have the same starting blocks as slices already existing on the disk, and if the old slices had filesystems, then the GUI will display those old filesystems even though they may not be valid.

Multiple `client_timeout` Values

A client_timeout value is set by the clconfd and cxfs_client daemons. The value depends on the order in which filesystems are mounted on the various nodes. The value adapts to help ensure that all filesystems get mounted in a timely manner. The value has no effect on the filesystem operation after it is mounted.

The value for client_timeout may differ among nodes, and therefore having multiple values is not really a problem.

The retry value is forced to be 0 and you cannot change it.

Caution: You should not attempt to change the client_timeout value. Improperly setting the values for client_timeout and retry could cause the mount command to keep waiting for a server and could delay the availability of the CXFS filesystems.

No HBA WWPNs are Detected

On most platforms, the cxfs_client software automatically detects the world wide port names (WWPNs) of any supported host bus adapters (HBAs) in the system that are connected to a switch that is configured in the cluster database. These HBAs will then be available for fencing.

However, if no WWPNs are detected, there will be messages logged to the following file:

IRIX: /var/adm/cxfs_client
Linux 64-bit: /var/log/cxfs_client

If no WWPNs are detected, you can manually specify the WWPNs in the /etc/fencing.conf fencing file for the Linux 64-bit platform. This method does not work if the WWPNs are partially discovered.

The fencing file is not used on the IRIX platform.

The fencing file enumerates the worldwide port name for all of the HBAs that will be used to mount a CXFS filesystem. There must be a line for the HBA WWPN as a 64-bit hexadecimal number.

Note: The WWPN is that of the HBA itself, not any of the devices that are visible to that HBA in the fabric.

If used, the fencing file must contain a simple list of WWPNs, one per line.

If you use the fencing file, you must update it whenever the HBA configuration changes, including the replacement of an HBA.

Do the following:

Set up the Brocade Fibre Channel switch and HBA.
Follow the Fibre Channel cable on the back of the node to determine the port to which it is connected in the Brocade Fibre Channel switch. Ports are numbered beginning with 0. (For example, if there are 8 ports, they will be numbered 0 through 7.)
Use the telnet command to connect to the Brocade Fibre Channel switch and log in as user admin (the password is password by default).

Execute the switchshow command to display the switches and their WWPN numbers.

For example:

brocade04:admin> switchshow
switchName:     brocade04
switchType:     2.4
switchState:    Online   
switchRole:     Principal
switchDomain:   6
switchId:       fffc06
switchWwn:      10:00:00:60:69:12:11:9e
switchBeacon:   OFF
port  0: sw  Online        F-Port  20:00:00:01:73:00:2c:0b
port  1: cu  Online        F-Port  21:00:00:e0:8b:02:36:49
port  2: cu  Online        F-Port  21:00:00:e0:8b:02:12:49
port  3: sw  Online        F-Port  20:00:00:01:73:00:2d:3e
port  4: cu  Online        F-Port  21:00:00:e0:8b:02:18:96
port  5: cu  Online        F-Port  21:00:00:e0:8b:00:90:8e
port  6: sw  Online        F-Port  20:00:00:01:73:00:3b:5f
port  7: sw  Online        F-Port  20:00:00:01:73:00:33:76
port  8: sw  Online        F-Port  21:00:00:e0:8b:01:d2:57
port  9: sw  Online        F-Port  21:00:00:e0:8b:01:0c:57
port 10: sw  Online        F-Port  20:08:00:a0:b8:0c:13:c9
port 11: sw  Online        F-Port  20:0a:00:a0:b8:0c:04:5a
port 12: sw  Online        F-Port  20:0c:00:a0:b8:0c:24:76
port 13: sw  Online        L-Port  1 public
port 14: sw  No_Light      
port 15: cu  Online        F-Port  21:00:00:e0:8b:00:42:d8

The WWPN is the hexadecimal string to the right of the port number. For example, the WWPN for port 0 is 2000000173002c0b (you must remove the colons from the WWPN reported in the switchshow output to produce the string to be used in the fencing file).

Create the /etc/fencing.conf fencing file and add the WWPN for the port determined in step 2. (Comment lines begin with #.)

For dual-ported HBAs, you must include the WWPNs of any ports that are used to access cluster disks. This may result in multiple WWPNs per HBA in the file; the numbers will probably differ by a single digit.

For example, if you determined that port 0 is the port connected to the Brocade Fibre Channel switch, you fencing file should contain the following:
# WWPN of the HBA installed on this system # 2000000173002c0b
After the node is added to the cluster, enable the fencing feature by using the CXFS GUI or cmgr command on a CXFS administration node.

XFS Internal Errors in System Log File

After a filesystem has been defined in CXFS, running mkfs on it (or using “Make Filesystems with the GUI” in Chapter 10) will cause XFS internal errors to appear in the system log file. For example (line breaks added for readability):

Aug 17 09:25:52 1A:yokohama-mds1 unix: ALERT: Filesystem "(NULL)": XFS internal error
xfs_mount_validate_sb(4) at line 237 of file ../fs/xfs/xfs_mount.c.
Caller 0xc000000000326ef4

Aug 17 09:14:52 6X:yokohama-mds1 clconfd[360]: < E clconf 11> CI_FAILURE, fsinfo_update(/dev/cxvm/work)
kernel returned 1010 (Filesystem is corrupted)

To avoid these errors, run mkfs before defining the filesystem in CXFS, or delete the CXFS filesystem before running mkfs. See “Delete a CXFS Filesystem with the GUI” in Chapter 10, and “Delete a CXFS Filesystem with cmgr” in Chapter 11.

Understanding Error Messages

This section describes some of the error messages you may see. In general, the example messages are listed first by type and then in alphabetical order, starting with the message identifier or text.

Sections are as follows:

Normal Messages

You can expect to see the following messages. They are normal and do not indicate a problem.

`NOTICE: Error reading mesg header 4 channel 1 cell 2`
	Error number 4 (`EINTR`) on `MEMBERSHIP` message channel (channel 1; channel 0 is the main channel for CXFS and XVM data) for connection with node 2. The `EINTR` indicates that this message channel is purposely being torn down and does not indicate an error in itself. (Any other error number is a real error that will cause the local node to declare the other node failed.) This is an informative message; no corrective action is required.
`NOTICE: Membership delivered. Membership contains 0(21) 1(12) cells`
	Node 0 and node 1 are in the CXFS membership; node 0 has been in the last 21 CXFS memberships, node 1 has been in the last 12. A membership is formed each time a node is added or deleted from the quorum; this number is also incremented when a membership is verified during the quorum change process, therefore the numbers are not sequential. This is an informative message; no corrective action is required.
`NOTICE: Resetting cells 0x4`
	The number here is a bitmask of node numbers on which a reset is being requested. In this case, `0x4` equates to node 2. This is an informative message; no corrective action is required.
`CI_FAILURE, Cell 1 Machine cxfs1: server has no information about a machine that has reset capabilities for this machine`
	A reset mechanism was not provided for this node. The node will not be automatically reset if it fails. If you do not have reset capability, this message can be ignored. Reset lines or I/O fencing is mandatory all nodes; clusters should have an odd number of server-capable nodes.
`NOTICE: Error reading mesg header 4 channel 1 cell 2`
	The `mesg header 4` text indicates that this is just an informative message.
`clconfd[16574]: <<CI> E config 2> CI_ERR_NOTFOUND, Error reading CMS status for machine tango, assuming machine is FailSafe-disabled in cluster twango.`
	This indicates that the cluster is CXFS only and that you are not using FailSafe.
`CI_CLCONFERR_INIT in ep_name() not binding socket`
	This message appears before the daemons start.
`clconfd[16574]: <<CI> E clconf 0> CI_CLCONFERR_INIT, in ep_name(): not binding socket`
	This `clconfd` message appears when daemons are starting up.
`date <I0 clconfd clconf 610:0 clconfd_client.c:84> client registration: clconfinfo, id 9119`, `date<I0 clconfd clconf 610:0 clconfd_service.c:781> sending reply configuration and membership msg to client: clconfinfo, id 9119`, `date <I0 clconfd clconf 610:0 clconfd_client.c:96> client un-registration: clconfinfo, id 9119`
	These messages are issued if you run the `clcon_info` command. The `clconf_info` command first registers as a CXFS client with `clconfd`; it then gets a reply message to its request for configuration and membership status; finally, it unregisters when it is done.
`date <I0 clconfd clconf 610:0 clconfd_service.c:781 sending reply configuration and membership msg to client: cad, id 602`
	This message indicates that the `cad` daemon is polling `clconfd` for status regularly. `cad` does not register and unregister each time like `clconf_info` because it is a daemon and it does not exit after each request. You will see register/unregister messages for `cad` only when `cad` or `clconfd` restarts.
`dcvn_import_force: error 1502 from invk_dsvn_obtain_exist`
	This is a normal message sent during the recovery process.

`clconfd` Daemon Death

If the clconfd daemon exits immediately after it starts up, it means that the CXFS license has not been properly installed. For information about the associated error message, see “License Error”.

You must install the license on each node before you can use CXFS. If you increase the number of CPUs in your system, you may need a new license. See Chapter 6, “IRIX CXFS Installation”.

Out of Logical Swap Space

The following example system log file message indicates an oversubscribed system:

ALERT: inetd [164] - out of logical swap space during fork while
allocating uarea - see swap(1M)
Availsmem 8207 availrmem 427 rlx freemem 10, real freemem 9

See “Use System Capacity Wisely”.

The cluster daemons could also be leaking memory in this case. You may need to restart them:

On administration nodes:
# /etc/init.d/cxfs_cluster restart

On client-only nodes:

# /etc/init.d/cxfs_client stop
# /etc/init.d/cxfs_client start

No Cluster Name ID Error

For example:

Mar  1 15:06:18 5A:nt-test-07 unix: NOTICE: Physvol (name cip4) has no 
CLUSTER name id: set to ""

This message means the following:

The disk labeled as an XVM physvol was probably labeled under IRIX 6.5.6f and the system was subsequently upgraded to a newer version that uses a new version of XVM label format. This does not indicate a problem.
The cluster name had not yet been set when XVM encountered these disks with an XVM cluster physvol label on them. This is normal output when XVM performs the initial scan of the disk inventory, before node/cluster initialization has completed on this host.

The message indicates that XVM sees a disk with an XVM cluster physvol label, but that this node has not yet joined a CXFS membership; therefore, the cluster name is empty ("").

When a node or cluster initializes, XVM rescans the disk inventory, searching for XVM cluster physvol labels. At that point, the cluster name should be set for this host. An empty cluster name after node/cluster initialization indicates a problem with cluster initialization.

The first time any configuration change is made to any XVM element on this disk, the label will be updated and converted to the new label format, and these notices will go away.

For more information about XVM, see the XVM Volume Manager Administrator's Guide.

Lost CXFS Membership

The following message in the system log file indicates a kernel-triggered revocation of CXFS membership:

Membership lost - withdrawing from cluster

You must actively allow CXFS membership for the local node in this situation. See “Allow Membership of the Local Node with the GUI” in Chapter 10, or “Allow Membership of the Local Node with cmgr” in Chapter 11.

License Error

If you see the following message in the /var/cluster/ha/log/clconf_hostname logfile, it means that the CXFS license was not properly installed:

CXFS not properly licensed for this host.  Run
               '/usr/cluster/bin/cxfslicense -d'
       for detailed failure information.

If you do not have the CXFS license properly installed, you will see an error on the console when trying to run CXFS. For example, on an administration node:

Cluster services:CXFS not properly licensed for this host.  Run
        '/usr/cluster/bin/cxfslicense -d'
for detailed failure information.  After fixing the
license, please run '/etc/init.d/cxfs_cluster restart'.

An error such as the following example will appear in the system log file:

Mar  4 12:58:05 6X:typhoon-q32 crsd[533]: <<CI> N crs 0> Crsd restarted.
Mar  4 12:58:05 6X:typhoon-q32 clconfd[537]: <<CI> N clconf 0> 
Mar  4 12:58:05 5B:typhoon-q32 CLCONFD failed the CXFS license check.Use the 
Mar  4 12:58:05 5B:typhoon-q32    '/usr/cluster/bin/cxfslicense -d'
Mar  4 12:58:05 5B:typhoon-q32 command to diagnose the license problem.

If the clconfd daemon dies right after it starts up, this error is present.

You must install the license on each node before you can use CXFS. See Chapter 6, “IRIX CXFS Installation”.

IP Address Error

If you have conflicting cluster ID numbers at your site, you will see errors such as the following:

WARNING: mtcp ignoring  alive message from 1 with wrong ip addr 128.162.89.34
WARNING: mtcp ignoring  alive message from 0 with wrong ip addr 128.162.89.33

A cluster ID number must be unique. To solve this problem, make the cluster ID numbers unique.

This error can occur if you redefine the cluster configuration and start CXFS services while some nodes have stale information from a previous configuration.

To solve the problem, first try the steps in “Eliminate a Residual Cluster”. If that does not work, reboot the nodes that have stale information. You can determine which nodes have stale information as follows: stale nodes will complain about all of the nodes, but the up-to-date nodes will complain only about the stale nodes. The /var/cluster/ha/log/clconfd_ log file on the stale nodes will also show error messages about SGI_CMS_CONFIG_ID failures.

If there are too many error messages to recognize the stale nodes, reboot every node.

System Log File Errors

CXFS logs both normal operations and critical errors to the system log file, as well as to individual log files for each log group.

The system log files are:

IRIX: /var/adm/SYSLOG
Linux 64-bit: /var/log/messages

In general, errors in the system log file file take the following form:

timestamp priority_&_facility : hostname process[ID]: <internal_info> CODE message_text

For example:

Sep  7 11:12:59 6X:cxfs0 cli[5830]: < E clconf 0> CI_IPCERR_NOSERVER, clconf
ipc: ipcclnt_connect() failed, file /var/cluster/ha/comm/clconfd-ipc_cxfs0

Table 18-1 shows the parts of the preceding message.

Table 18-1. System Log File Error Message Format

Content	Part	Meaning
`Sep 7 11:12:59`	Time Stamp	September 7 at 11:12 AM.
`6X`	Facility and level	6X indicates an informational message. See `syslogd` and the file `/usr/include/sys/syslog.h`.
`cxfs0`	Node name	The node whose logical name is `cxfs0` is the node on which the process is running.
`cli[5830]`	Process[ID]	The process sending the message is `cli` and its process ID number is `5830`.
`<CI>E clconf 0`	Internal information: message source, logging subsystem, and thread ID	The message is from the cluster infrastructure (CI). `E` indicates that it is an error. The `clconf` command is the logging subsystem. `0` indicates that it is not multithreaded.
`CI_IPCERR_NOSERVER, clconf ipc`	Internal error code	Information about the type of message; in this case, a message indicating that the server is missing. No error code is printed if it is a normal message.
`ipcclnt_connect() failed, file /var/cluster/ha/comm/clconfd-ipc_cxfs0`	Message text	A connection failed for the `clconfd-ipc_cxfs0` file.

The following sections present only the message identifiers and text.

`cli` Error Messages

For all cli messages, only the last message from the command (which begins with CLI private command failed) is meaningful. You can ignore all other cli messages.

The following are example errors from the cli daemon.

`CI_ERR_INVAL, CLI private command: failed (Machine (cxfs0) exists.)`
	You tried to create a new node definition with logical name `cxfs0`; however, that node name already exists in the cluster database. Choose a different name.
`CI_ERR_INVAL, CLI private command: failed (IP address (128.162.89.33) specified for control network is cxfs0 is assigned to control network of machine (cxfs0).)`
	You specified the same IP address for two different control networks of node `cxfs0`. Use a different IP address.
`CI_FAILURE, CLI private command: failed (Unable to validate hostname of machine (cxfs0) being modified.)`
	The DNS resolution of the `cxfs0` name failed. To solve this problem, add an entry for `cxfs0` in `/etc/hosts` on all nodes.
`CI_IPCERR_NOPULSE, CLI private command: failed (Cluster state is UNKNOWN.)`
	The cluster state is `UNKNOWN` and the command could not complete. This is a transient error. However, if it persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”.

`clconfd` Error Messages

The following errors are sent by the clconfd daemon.

`CI_CONFERR_NOTFOUND, Could not access root node.`
	The cluster database is either non-existent or corrupted, or the database daemons are not responding. Check that the database does exist. If you get an error or the dump is empty, re-create the database; for more information, see “Clearing the Cluster Database”. If the database exists, restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”.
`CI_ERR_NOTFOUND, Could not get Cellular status for local machine (cxfs1)`
	The database is corrupted or cannot be accessed. Same actions as above.
`CI_FAILURE, Call to open cdb for logging configuration when it is already open.`
	This indicates a software problem requiring you to restart the daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”.
`CI_FAILURE, Cell 1 Machine cxfs1: server has no information about a machine that has reset capabilities for this machine`
	A reset mechanism was not provided for this node. The node will not be automatically reset if it fails. To ensure proper failure handling, use the GUI or the `cmgr` command to modify the node's definition and add reset information. See “Define a Node with the GUI” in Chapter 10, or “Modify a Node with cmgr” in Chapter 11.
`CI_FAILURE, CMD(/sbin/umount -k /dev/xvm/bob1): exited with status 1 (0x1)`
	An error occurred when trying to unmount the `/dev/xvm/bob1` filesystem. Messages from the `umount` command are usually issued just before this message and provide more information about the reason for the failure.
`CI_FAILURE, CMD(/sbin/clmount -o 'server_list=(cxfs0,cxfs1)' /dev/xvm/bob2 /bob2): exited with status 1 (0x1)`
	An error occurred when trying to mount the `/dev/xvm/bob2` filesystem. Messages from the `mount` command are usually issued just before this message and provide more information about the reason of the failure.
`CI_FAILURE, CMD(/sbin/clmount -o 'server_list=(cxfs2,cxfs0)' /dev/xvm/stripe4 /xvm/stripe4): exited with status 1 (0x1)`
	You have tried to mount a filesystem without first running `mkfs`. You must use `mkfs` to construct the filesystem before mounting it. For more information, see the `mkfs` man page.
`CI_FAILURE, Could not write newincarnation number to CDB, error = 9.`
	There was a problem accessing the cluster database. Retry the operation. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”. If the problem persists, clear the database, reboot, and re-create the database. See “Clearing the Cluster Database”.
`CI_FAILURE, Exiting, monitoring agent should revive me.`
	The daemon requires fresh data. It will be automatically restarted.
`CI_FAILURE, No node for client (3) of filesystem (/dev/xvm/bob1) on (/bob1).`
	(There may be many repetitions of this message.) The filesystem appears to still be mounted on a CXFS client node that is no longer in the cluster database. If you can identify the CXFS client node that used to be in the cluster and still has the filesystem mounted, reboot that node. Otherwise, reboot the entire cluster.
`CI_FAILURE, No node for server (-1) of filesystem (/dev/xvm/bob1) on (/bob1).`
	(There may be many repetitions of this message.) The filesystem appears to still be mounted on a server node that is no longer in the cluster database. If you can identify the server node that used to be in the cluster and still has the filesystem mounted, reboot that node. Otherwise, reboot the entire cluster.
`CI_ FAILURE, Node cxfs0: SGI_CMS_HOST_ID(tcp,128.162.8 >9.33) error 149 (Operation already in progress)`
	The kernel already had this information; you can ignore this message.
`CI_FAILURE, Unregistered from crs.`
	The `clconfd` daemon is no longer connected to the reset daemon and will not be able to handle resets of failed nodes. There is no corrective action.
`CI_IPCERR_NOSERVER, Crs_register failed,will retry later. Resetting not possible yet.`
	The `clconfd` daemon cannot connect to the reset daemon. It will not be able to handle resets of failed nodes. Check the reset daemon's log file (`/var/cluster/ha/log/crsd_`) for more error messages.
`Clconfd is out of membership, will restart after notifying clients.`
	The `clconfd` daemon does not have enough information about the current state of the cluster. It will exit and be automatically restarted with fresh data.
`CMD(/sbin/clmount -o 'server_list=(cxfs2,cxfs0)' /dev/xvm/stripe4 /xvm/stripe4): /dev/xvm/stripe4: Invalid argument`
	You have tried to mount a filesystem without first running `mkfs`. You must use `mkfs` to construct the filesystem before mounting it. For more information, see the `mkfs` man page.
`CMD(/sbin/clmount -o 'server_list=(cxfs0,cxfs1)' /dev/xvm/bob2 /bob2): /dev/xvm/bob2: Invalid argumentSep 9 14:12:43 6X:cxfs0 clconfd[345]: < E clconf 3> CI_FAILURE, CMD(/sbin/clmount -o 'server_list=(cxfs0,cxfs1)' /dev/xvm/bob2 /bob2): exited with status 1 (0x1)`
	The first message comes from the `clmount` command (the internal CXFS mount command) and explains the error (an invalid argument was issued). The second message says that the mount failed.

`crsd` Error Messages

The following errors are sent by the crsd daemon.

`CI_ERR_NOTFOUND, No logging entries found for group crsd, no logging will take place - Database entry #global#logging#crsd not found.`
	No `crsd` logging definition was found in the cluster database. This can happen if you start cluster processes without creating the database. See “Recreating the Cluster Database”.
`CI_ERR_RETRY, Could not find machine listing.`
	The `crsd` daemon could not find the local node in the cluster database. You can ignore this message if the local node definition has not yet been created.
`CI_ERR_SYS:125, bind() failed.`
	The `sgi-crsd` port number in the `/etc/services` file is not unique, or there is no `sgi-crsd` entry in the file. For information about adding this entry, see “/etc/services on CXFS Administration Nodes ” in Chapter 8.
`CI_FAILURE, Entry for sgi-crsd is missing in /etc/services.`
	The `sgi-crsd` entry is missing from the `/etc/services` file. For information about adding this entry, see “/etc/services on CXFS Administration Nodes ” in Chapter 8.
`CI_FAILURE, Initialization failed, exiting.`
	A sequence of messages will be ended with this message; see the messages prior to this one in order to determine the cause of the failure.

`cmond` Error Messages

The following errors are sent by the cmond daemon.

Could not register for notification.cdb_error = 7

An error number of 7 indicates that the cluster database was not initialized when the cluster process was started.

This may be caused if you execute the cdbreinit on one CXFS administration node while some other CXFS administration nodes in the pool are still running fs2d and already have the node listed in the database.

Do the following:

Execute the following command on the nodes that show the error:
# /usr/cluster/bin/cdb-init-std-nodes
This command will recreate the missing nodes without disrupting the rest of the database.
If the error persists, force the daemons to restart by executing the following command:
# /etc/init.d/cxfs_cluster restart
Verify that cmond is restarted.
If the error persists, reinitialize the database on just the node that is having problems.
If the error still persists, reinitialize all nodes in the cluster.

See “Recreating the Cluster Database”.

Process clconfd:343 of group cluster_cx exited, status = 3.

The clconfd process exited with status 3, meaning that the process will not be restarted by cmond. No corrective action is needed.

Process crsd:1790 of group cluster_control exited, status = 127

The crsd process exited with an error (nonzero) status. Look at the corresponding daemon logs for error messages.

`cxfs_client` Error Messages

The following errors are sent by the cxfs_client daemon.

`cxfs_client: cis_get_hba_wwns warning: fencing configuration file "fencing.conf" not found`
	The fencing file was not found, therefore the fencing configuration will not be updated on the server.
`cxfs_client:op_failed ERROR: Mount failed for concat0`
	A filesystem mount has failed and will be retried.

`fs2d` Error Messages

The following errors are sent by the fs2d daemon.

`Error 9 writing CDB info attribute for node #cluster#elaine#machines#cxfs2#Cellular#status`
	An internal error occurred when writing to the cluster database. Retry the operation. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”. If the problem persists, clear the database, reboot, and re-create the database. See “Clearing the Cluster Database”.
`Error 9 writing CDB string value for node #cluster#elaine#machines#cxfs2#Cellular#status`
	An internal error occurred when writing to the cluster database. Retry the operation. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”. If the problem persists, clear the database, reboot, and re-create the database. See “Clearing the Cluster Database”.
`Failed to update CDB for node #cluster#elaine#Cellular#FileSystems#fs1#FSStatus`
	An internal error occurred when writing to the cluster database. Retry the operation. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”. If the problem persists, clear the database, reboot, and re-create the database. See “Clearing the Cluster Database”.
`Failed to update CDB for node #cluster#elaine#machines#cxfs2#Cellular#status`
	An internal error occurred when writing to the cluster database. Retry the operation. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”. If the problem persists, clear the database, reboot, and re-create the database. See “Clearing the Cluster Database”.
`Machine 101 machine_sync failed with lock_timeout error`
	The `fs2d` daemon was not able to synchronize the cluster database and the `sync` process timed out. This operation will be retried automatically by `fs2d`.
`ALERT: CXFS Recovery: Cell 0: Server Cell 2 Died, Recovering`
	The server (cell 2) died and the system is now recovering a filesystem.

General Messages

`CI_CONFERR_NOTFOUND, Logging configuration error: could not read cluster database /var/cluster/cdb/cdb.db, cdb error = 3.`
	The cluster database has not been initialized. See “Recreating the Cluster Database”.
`WARNING: Error receiving messages from cell 2 tcpchannel 1`
	There has been an error on the CXFS membership channel (channel 1; channel 0 is the main message channel for CXFS and XVM data). This may be a result of tearing down the channel or may be an error of the node (node with an ID of `2` in this case). There is no corrective action.

Log File Error Messages

CXFS maintains logs for each of the CXFS daemons. For information about customizing these logs, see “Set Log Configuration with the GUI” in Chapter 10.

Log file messages take the following form:

daemon_log timestamp internal_process: message_text

For example:

cad_log:Thu Sep  2 17:25:06.092  cclconf_poll_clconfd: clconf_poll failed with error CI_IPCERR_NOPULSE

Table 18-2, shows the parts in the preceding message.

Table 18-2. Log File Error Message Format

Content	Part	Meaning
`cad_log`	Daemon identifier	The message pertains to the `cad` daemon
`Sep 2 17:25:06.092`	Time stamp and process ID	September 2 at 5:25 PM, process ID 92.
`cclconf_poll_clconfd`	Internal process information	Internal process information
`clconf_poll failed with error CI_IPCERR_NOPULSE`	Message text	The `clconfd` daemon could not be contacted to get an update on the cluster's status.

`cad` Messages

The following are examples of messages from /var/cluster/ha/log/cad_log:

ccacdb_cam_open: failed to open connection to CAM server error 4

Internal message that can be ignored because the cad operation is automatically retried.

ccamail_cam_open: failed to open connection to CAM server error 4

Internal message that can be ignored because the cad operation is automatically retried.

ccicdb_cam_open: failed to open connection to CAM server error 4

Internal message that can be ignored because the cad operation is automatically retried.

cclconf_cam_open: failed to open connection to CAM server error 4

Internal message that can be ignored because the cad operation is automatically retried.

cclconf_poll_clconfd: clconf_poll failed with error CI_IPCERR_NOCONN

The clconfd daemon is not running or is not responding to external requests. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”.

cclconf_poll_clconfd: clconf_poll failed with error CI_IPCERR_NOPULSE

The clconfd daemon could not be contacted to get an update on the cluster's status. If the error persists, stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”.

cclconf_poll_clconfd: clconf_poll failed with error CI_CLCONFERR_LONELY

The clconfd daemon does not have enough information to provide an accurate status of the cluster. It will automatically restart with fresh data and resume its service.

csrm_cam_open: failed to open connection to CAM server error 4

Internal message that can be ignored because the cad operation is automatically retried.

Could not execute notification cmd. system() failed. Error: No child processes

No mail message was sent because cad could not fork processes. Stop and restart the cluster daemons; see “Stopping and Restarting Cluster Infrastructure Daemons”.

error 3 sending event notification to client 0x000000021010f078

GUI process exited without cleaning up.

error 8 sending event notification to client 0x000000031010f138

GUI process exited without cleaning up.

`cli` Messages

The following are examples of messages from /var/cluster/ha/log/cli_ hostname:

`CI_CONFERR_NOTFOUND, No machines found in the CDB.`
	The local node is not defined in the cluster database.
`CI_ERR_INVAL, Cluster (bob) not defined`
	The cluster called `bob` is not present in the cluster database.
`CI_ERR_INVAL, CLI private command: failed (Cluster (bob) not defined)`
	The cluster called `bob` is not present in the cluster database.
`CI_IPCERR_AGAIN, ipcclnt_connect(): file /var/cluster/ha/comm/clconfd-ipc_cxfs0 lock failed - Permission denied`
	The underlying command line interface (CLI) was invoked by a login other than `root`. You should only use `cmgr` when you are logged in as `root`.
`CI_IPCERR_NOPULSE, CLI private command: failed (Cluster state is UNKNOWN.)`
	The cluster state could not be determined. Check if the `clconfd` daemon is running.
`CI_IPCERR_NOPULSE, ipcclnt_pulse_internal(): server failed to pulse`
	The cluster state could not be determined. Check if the `clconfd` daemon is running.
`CI_IPCERR_NOSERVER, clconf ipc: ipcclnt_connect() failed, file /var/cluster/ha/comm/clconfd-ipc_cxfs0`
	The local node (`cxfs0`) is not defined in the cluster database.
`CI_IPCERR_NOSERVER, Connection file /var/cluster/ha/comm/clconfd-ipc_cxfs0 not present.`
	The local node (`cxfs0`) is not defined in the cluster database.

`crsd` Errors

The following are examples of messages from /var/cluster/ha/log/crsd_hostname:

`CI_CONFERR_INVAL, Nodeid -1 is invalid.`, `I_CONFERR_INVAL, Error from ci_security_init().`, `CI_ERR_SYS:125, bind() failed.`, `CI_ERR_SYS:125, Initialization failed, exiting.`, `CI_ERR_NOTFOUND, Nodeid does not have a value.`, `CI_CONFERR_INVAL, Nodeid -1 is invalid.`
	For each of these messages, either the node ID was not provided in the node definition or the cluster processes were not running in that node when node definition was created in the cluster database. This is a warning that optional information is not available when expected.
`CI_ERR_NOTFOUND, SystemController information for node cxfs2 not found, requests will be ignored.`
	System controller information (optional information) was not provided for node `cxfs2`. Provide system controller information for node `cxfs2` by modifying node definition. This is a warning that optional information is not available when expected. Without this information, the node will not be reset if it fails, which might prevent the cluster from properly recovering from the failure.
`CI_ERR_NOTFOUND, SystemController information for node cxfs0 not found, requests will be ignored.`
	The owner node specified in the node definition for the node with a node ID of 101 has not been defined. You must define the owner node.
`CI_CRSERR_NOTFOUND, Reset request 0x10087d48 received for node 101, but its owner node does not exist.`
	The owner node specified in the node definition for the node with a node ID of 101 has not been defined. You must define the owner node.

`fs2d` Errors

The following are examples of messages from /var/cluster/ha/log/fs2d_hostname:

`Failed to copy global CDB to node cxfs1 (1), error 4`
	There are communication problems between the local node and node `cxfs2`. Check the control networks of the two nodes.
`Communication failure send new quorum to machine cxfs2 (102) (error 6003)`
	There are communication problems between the local node and node `cxfs2`. Check the control networks of the two nodes.
`Failed to copy CDB transaction to node cxfs2 (1)`
	There are communication problems between the local node and node `cxfs2`. Check the control networks of the two nodes.
`Outgoing RPC to hostname : NULL`
	If you see this message, check your Remote Procedure Call (RPC) setup. For more information, see the `rpcinfo`, `rpcinfo`, and `portmap` man pages.

Corrective Actions

This section covers the following corrective actions:

Restarting CXFS Services

If CXFS services to do not restart after a reboot, it may be that the start flag was turned off by using the stop function of the GUI or the cmgr command. In this case, issuing a /etc/init.d/cxfs_cluster start will not restart the services. You must start CXFS services. If you use the GUI or cmgr to restart the services, the configuration will be set so that future reboots will also restart CXFS services.

For information, see “Start CXFS Services with the GUI” in Chapter 10, or “Start CXFS Services with cmgr” in Chapter 11.

Clearing the Cluster Database

To clear the cluster database on all of the administration nodes of the cluster, do the following, completing each step on each administration node before moving to the next step:

Caution: This procedure deletes all configuration information.

Enter the following on all administration nodes:
# /usr/cluster/bin/cmgr -c 'admin cxfs_stop'
Enter the following on all administration nodes:
# /etc/init.d/cxfs_cluster stop
Caution: Complete steps 1 and 2 on each node before moving to step 3 for any node.
Enter the following on all administration nodes:
# /usr/cluster/bin/cdbreinit
See also “Reboot Before Changing Node ID or Cluster ID”.
Enter the following on all administration nodes:
# /etc/init.d/cxfs_cluster start

Enter the following on all administration nodes:

# /usr/cluster/bin/cmgr -c 'admin cxfs_start'

See “Eliminate a Residual Cluster”, to get rid of possible stale cluster configuration in the kernel. If needed, reboot the nodes.

Rebooting

Enter the following individually on every node to reboot the cluster:

# reboot

For information about nodes running operating systems other than IRIX or Linux 64-bit, see the CXFS MultiOS Client-Only Guide for SGI InfiniteStorage.

If you want CXFS services to restart whenever the node is rebooted, use the GUI or cmgr to start CXFS services. For information, see “Start CXFS Services with the GUI” in Chapter 10, and “Start CXFS Services with cmgr” in Chapter 11.

The following are situations that may require a rebooting:

If some CXFS clients are unable to unmount a filesystem because of a busy vnode and a reset of the node does not fix the problem, you may need to reboot every node in the cluster
If there is no recovery activity within 10 minutes, you may need to reboot the node

Recovering a Two-Node Cluster

Suppose the following:

You have cluster named clusterA that has two server-capable nodes and there is no CXFS tiebreaker:
- node1
- node2
node1 goes down and will remain down for a while.

node2 recovers and clusterA remains up.

Note: An existing cluster can drop down to 50% of the remaining server-capable nodes after the initial CXFS kernel membership is formed. For more information, see “CXFS Kernel Membership, Quorum, and Tiebreaker” in Appendix B.

node2 goes down and therefore clusterA fails.
node2 comes back up. However, clusterA cannot form because the initialization of a cluster requires either:
- More than 50% of the server-capable nodes
- 50% of the server-capable nodes, one of which is the CXFS tiebreaker

To allow node2 to form a cluster by itself, you must do the following:

Set node2 to be the CXFS tiebreaker node, using either the GUI or cmgr:
- See “Set Tiebreaker Node with the GUI” in Chapter 10.
- See “Set the Tiebreaker Node with cmgr” in Chapter 11.
Revoke the CXFS kernel membership of node2:
- See “Revoke Membership of the Local Node with the GUI” in Chapter 10.
- In cmgr, enter:
  cmgr> admin cxfs_stop
  See “Revoke Membership of the Local Node with cmgr” in Chapter 11.
Allow CXFS kernel membership of node2:
- See “Allow Membership of the Local Node with the GUI” in Chapter 10.
- In cmgr, enter:
  cmgr> admin cxfs_start
  See “Allow Membership of the Local Node with cmgr” in Chapter 11.

Unset the CXFS tiebreaker node capability.

Caution: If the CXFS tiebreaker node in a cluster with two server-capable nodes fails or if the administrator stops CXFS services, the other node will do a forced shutdown, which unmounts all CXFS filesystems. The reset capability or I/O fencing is mandatory to ensure data integrity for all nodes. Clusters should have an odd number of server-capable nodes.

Use either the GUI or cmgr:

The cluster will attempt to communicate with the node1 because it is still configured in the cluster, even though it is down. Therefore, it may take some time for the CXFS kernel membership to form and for filesystems to mount.

Rebooting without Rejoining the Cluster

The cluster flag to chkconfig controls the other cluster administration daemons and the replicated cluster database. If it is turned off, the database daemons will not be started at the next reboot and the local copy of the database will not be updated if you make changes to the cluster configuration on the other nodes. This could cause problems later, especially if a majority of nodes are not running the database daemons.

If the cluster daemons are causing serious trouble and prevent the machine from booting, you can recover the node by booting in single-user mode, turning off the cluster flag, and booting in multiuser mode:

IRIX:

irix# init 1
irix# /etc/chkconfig cluster off
irix# init 2

Linux 64-bit:

[root@linux64 root]# init 1
[root@linux64 root]# /bin/chkconfig cluster off
[root@linux64 root]# init 3

Stopping and Restarting Cluster Infrastructure Daemons

To stop and restart cluster infrastructure daemons, enter the following:

On administration nodes:

# /etc/init.d/cxfs_cluster stop 
# /etc/init.d/cxfs_cluster start

On client-only nodes:

# /etc/init.d/cxfs_client stop 
# /etc/init.d/cxfs_client start

These commands affect the cluster infrastructure daemons only.

Caution: When the cluster infrastructure daemons are stopped, the node will not receive database updates and will not update the kernel configuration. This can have very unpleasant side effects. Under most circumstances, the infrastructure daemons should remain running at all times. Use these commands only as directed.

See also “Restarting CXFS Services”. For general information about the daemons, see “Daemons” in Appendix A.

Recreating the Cluster Database

To recreate the initial cluster database, do the following:

Ensure that the database membership quorum is held by nodes with a good database, in order to avoid propagating a bad database.
Enter the following:
# /usr/cluster/bin/cdbreinit

Verifying Connectivity in a Multicast Environment

To verify general connectivity in a multicast environment, you can execute a ping command on the 224.0.0.1 IP address.

To verify the CXFS heartbeat, use the 224.0.0.250 IP address, which is the default CXFS heartbeat multicast address (because it is the default, this address does not have to appear in the /etc/hosts file).

Note: A node is capable of responding only when the administration daemons (fs2d, cmond, cad, and crsd) or the cxfs_client daemon is running.

For example, to see the response for two packets sent from IRIX IP address 163.154.17.49 to the multicast address for CXFS heartbeat and ignore loopback, enter the following:

irixnodeA# ping -c 2 -I 163.154.17.49 -L 224.0.0.250
PING 224.0.0.250 (224.0.0.250): 56 data bytes
64 bytes from 163.154.17.140: icmp_seq=0 ttl=64 time=1.146 ms
64 bytes from 163.154.17.55: icmp_seq=0 DUP! ttl=255 time=1.460 ms
64 bytes from 163.154.17.52: icmp_seq=0 DUP! ttl=255 time=4.607 ms
64 bytes from 163.154.17.50: icmp_seq=0 DUP! ttl=255 time=4.942 ms
64 bytes from 163.154.17.140: icmp_seq=1 ttl=64 time=2.692 ms

----224.0.0.250 PING Statistics----
2 packets transmitted, 2 packets received, +3 duplicates, 0.0% packet
loss
round-trip min/avg/max = 1.146/2.969/4.942 ms

The above output indicates that there is a response from the following addresses:

163.154.17.140
163.154.17.55
163.154.17.52
163.154.17.50

To override the default address, you can use the -c and -m options or make the name cluster_mcast resolvable on all nodes (such as in the /etc/hosts file). For more information, see the cxfs_client man page.

Reporting Problems to SGI

When reporting a problem about a CXFS node to SGI, you should retain the information discussed in this section, depending upon the circumstances you experience.

Reporting IRIX Problems

Retain the following information for IRIX nodes:

If a panic has occurred on an IRIX node, retain the system core files in /var/adm/crash, including the following:
analysis.number unix.number vmcore.number.comp

For any type of problem, run the /usr/cluster/bin/cxfsdump utility on an IRIX node and retain the output. You can run this utility immediately after noticing a problem. The cxfsdump utility attempts to collect information from all nodes in the cluster by using the rsh command, including the following:

Information from the following files:

/var/adm/SYSLOG
/var/adm/cxfs_client         (for client-only nodes)
/var/cluster/ha/log/*        (for administration nodes)
/etc/failover.conf
/var/sysgen/stune
/etc/hosts

Output from the following commands:

/usr/cluster/bin/cdbutil gettree '#'
/usr/sbin/versions -n
/usr/sbin/systune
/sbin/hinv -vm
/sbin/xvm show -v phys
/sbin/xvm show -top -v vol
/usr/sbin/scsifo -d
/usr/etc/netstat -ia

Reporting Linux 64-bit Problems

Retain the following information for Linux 64-bit nodes:

The kernel you are running:
[root@linux64 root]# uname -a

The CXFS packages you are running:

[root@linux64 root]# rpm -q cxfs_client cxfs-modules cxfs_utils xvm-cmds

The number and types of processors in your machine:
[root@linux64 root]# cat /proc/cpuinfo
The hardware installed on your machine:
[root@linux64 root]# /sbin/lspci
Modules that are loaded on your machine:
[root@linux64 root]# /sbin/lsmod
The /var/log/cxfs_client log file
Any messages that appeared in the system logs immediately before the system exhibited the problem.
Output about the cluster obtained from the cxfsdump utility run on an administration node.
After a system kernel panic, the debugger information from the KDB built-in kernel debugger. See “Kernel Status Tools”

Output from the following commands:

Information from the following files:

/var/log/messages
/var/adm/cxfs_client         (for client-only nodes)
/var/cluster/ha/log/*        (for administration nodes)
/etc/failover.conf
/etc/hosts

Output from the following commands:

/usr/cluster/bin/cdbutil gettree '#'
/usr/bin/hinv
/usr/bin/topology
/sbin/xvm show -v phys
/sbin/xvm show -top -v vol
/bin/netstat -ia

Prev	Table of Contents	Next
Chapter 17. Migration from an IRIX Cluster to a Linux 64-bit Cluster		Appendix A. CXFS Software Architecture