Chapter 9


Inode Duplication Facility

9.1 PURPOSE

As dependency on computer systems increases, the effects of job stops caused by hardware errors are having a greater impact. To cope with this, the file system management information is duplicated to enable a reduction in the number of job stops resulting from errors and in the recovery work.

9.2 OUTLINE

If an I/O error occurs in the management area due to the occurrence of a track fault, all other I/O operations can be performed to continue the operation by creating (duplicating) the contents of the file system management area in the other virtual disk.

The virtual disk containing the file system is called the master VD (virtual disk), while the other virtual disk is called the copy VD.

9.2.1 Devices to Be Duplicated

Devices that can be duplicated include magnetic disks and striping disks. The file systems to be duplicated are SFS and SFS/H. The management areas of these file systems are duplicated. A management area to be duplicated includes the following information.

However, the following file systems cannot be duplicated.

9.2.2 Registering the Duplication

Duplication can be registered by specifying the master and copy VDs in dupconf(1M). The registration information is stored in VVL. Since closing information is also stored or updated in VVL during operation, the recovery for duplication is required if a VVL error occurs (described in Section 9.6.7).

Home


9.2.3 Duplication Control

Writing to duplicated areas is performed for both the master and copy VDs. In this case, if an error occurs, the subsequent I/O for the error system stops (is closed) and writing is performed to the other, normal system.

Reading is performed from the master VD. However, if an I/O error occurs in the master VD, the master VD is closed and reading is performed from the copy VD. If the master VD has already been closed, reading is performed from the copy VD. While either the master or copy VD is closed, and an error occurs on the other system, an I/O error occurs without closing both systems.

9.3 COMMANDS

The following new commands are provided to enable use of the duplication facility.

devinfo(1M) can be used to reference the duplication definition information.

9.4 USING THE DUPLICATION FACILITY

This section explains the setting procedure for the duplication facility. The command examples in the explanations have the following configuration.

Home


9.4.1 Setting the Duplication

Home


Home


9.4.2 Canceling the Duplication

The procedure for canceling a duplication follows.

  1. Demount the file system.

  2. Perform recovery if the master or copy VD has been closed. (The procedure is explained in Sections 9.6.2 and 9.6.3.)

  3. Cancel the duplication using the dupconf(1M) command.

    Example:

  4. Mount the file system.

9.4.3 Reconstructing the File System

Duplication must be canceled and then reexecuted when a duplicated file system is reconstructed or attributes are changed (reallocation facility on/off, cluster size change, etc.).

The reconstruction procedure follows.

  1. Cancel the duplication (explained in Section 9.4.2).

  2. Reconstruct the file system. (Use dupmkf(1M) if mkfs(1M) must be executed.)

  3. Set the duplication (explained in Section 9.4.1).

9.4.4 Relationship with Existing Commands

9.4.4.1 BACKUP COMMANDS

If the tar, mtar, cpio, dump, restore, or dd commands are used for backup and restoration, matching of the duplicated areas is retained after execution. The user need not take these operations into account.

9.4.4.2 EXISTING IOS COMMANDS

If duplication is used, be particularly careful when executing the following commands.

9.5 ERROR HANDLING

Duplication enables a reduction in the number of job stops caused by errors, and also of the error recovery thus incurred. However, only the file management area is duplicated. The data area is not. Therefore, it is necessary to prepare for errors by collecting regular backup copies as usual, even if duplication is in progress.

Home


9.6 ERROR TYPES AND RECOVERY

This section explains error isolation and recovery if errors occur in duplicated systems. The recovery procedure conforms to that described in Chapter 8, Disk Recovery. However, the work involved in recovering duplicated areas is additional to that explained in Chapter 8, Disk Recovery. This section describes the general procedure, centering on duplicated area recovery (duplication information matching). Therefore, see Chapter 8, Disk Recovery, for details of the procedure for error correction. Since the recovery procedure varies with the operation mode, examination of each user site is required.

If an error occurs in a portion that is not duplicated, use the existing recovery method. The command examples in the explanations are assumed to have the following configuration.

9.6.1 Isolating Errors

If an error occurs in a file system that is performing duplication, the procedures to be performed vary with the error portions and on the detailed contents of the errors.

Errors are classified as follows.

These errors are distinguished using the following methods:

Home


9.6.2 Recovery when the Master VD Is Closed

If an error occurs in a duplicated master VD, this system is closed and the subsequent I/O operations are performed only for the copy VD. The three recovery methods used if the master VD is closed are presented in this section.

Home


After completing these recovery procedures, execute the fsck command to determine whether there is any file system inconsistency. If an inconsistency is found, the conventional procedure (fsck execution, file restoration, etc.) can be used. However, before attempting the correction, confirm that neither the master nor copy VD is closed.

Home


9.6.3 Recovery when the Copy VD Is Closed

If an error occurs in the area of the duplicated copy VD, the copy VD is closed and the subsequent I/O operations are performed only for the master VD. The recovery methods used when the copy VD is closed are discussed in this section.

Home


9.6.4 Recovery when a Copy VD Error Occurs (Master VD Closed)

One of two recovery methods is used if an error occurs in the copy VD while the master VD is closed. Determine the method to be used according to the error severity.

Home


9.6.5 Recovery when a Master VD Error Occurs (Copy VD Closed)

One of two recovery methods can be used if an error occurs in the master VD while the copy VD is closed. Determine the method to be used, according to the error severity.

9.6.6 Recovery when an Error Occurs Outside the Duplication Range

The existing recovery is performed regardless of the duplication.

9.6.7 Recovery when an Error Occurs in VVL

VVL stores duplication information. If an error occurs, perform recovery as shown next.

As mentioned, if an error occurs for both VVL systems, recovery requires much time and labor. Therefore, if one system is closed, perform recovery processing immediately.

Home


9.7 NOTES

Home

Contents Previous Chapter Next Chapter Index