Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home
Using EMS HA Monitors > Chapter 2 Monitoring Disk Resources

Creating Disk Monitoring Requests

» 

Technical documentation

Complete book in PDF
» Feedback
Content starts here

 » Table of Contents

 » Glossary

 » Index

There are two ways to create disk monitor requests from:

  • the SAM interface to EMS to send alerts to HP OpenView ITO, ClusterView, or Network Node Manager.

  • MC/ServiceGuard to configure any disk monitor resource as a package dependency.

These requests are not exclusive: you can configure the disk monitor from both MC/ServiceGuard and the SAM interface to EMS. In fact, if you are using EMS to monitor disks for MC/ServiceGuard package dependencies, it is recommended you also configure EMS to send events to your system monitoring software, e.g. HP OpenView IT/Operations, so you are alerted when something threatens data or application availability.

The following sections take some common disk configurations in a high availability environment and give examples of the types of monitor requests you might want to create.

Disk Monitoring Request Suggestions

The examples listed in Table 2-7 “Suggestions for Creating Disk Monitor Requests” are valid for both RAID and mirrored configurations. For examples on configuring MC/ServiceGuard dependencies, see Chapter 1, "Configuring MC/ServiceGuard Package Dependencies".

Table 2-7 Suggestions for Creating Disk Monitor Requests

To be alerted when...

Resources to monitorMonitoring Parameters

Notify

 

Value

Option

you are at risk for data loss (most common for use with MC/ServiceGuard)

pv_summary

when value is

>=

SUSPECT

 

lv_summary

when value is

>=

INACTIVE_DOWN 

any disks fail

pv_pvlink/status/*

when value is

not equal

UP

any disks fail, and you want to know when they are back up

pv_pvlink/status/*

when value is

not equal

UP

RETURN

you want regular reminders to fix inoperative disks, controllers, busses, and host adapters, and you want notification when they are fixed

pv_pvlink/status/*

at each interval
(use a long polling interval, 1 hour or more)

=

DOWN

REPEAT
RETURN

any logical volume becomes unavailable

lv/status/*

when value is...

not equal

UP

you have lost a mirror in your 2-way mirroring environment

lv/copies/*

when value is...

<

2

 

The following screens step you through creating a disk monitor request. Assume you want to be alerted when any disks fail and when they are back up. Figure 2-2 “Example: Selecting All Instances of /vg/vgName/pv_pvlink/status” shows you can select all instances of pv_pvlink, so you only have to enter the parameters once for each volume group.You still need to create multiple pv_pvlink requests, one for each volume group on your system. Click OK to set monitoring parameters.

Figure 2-2 Example: Selecting All Instances of /vg/vgName/pv_pvlink/status

Example: Selecting All Instances of /vg/vgName/pv_pvlink/status

Assume you have a great need to know the status of your system at all times. You would need a short polling interval, perhaps between 30 and 120 seconds. (If you notice the disk monitor consumes too much CPU, you may want to set a longer polling interval.) Assume also that you want an Initial event sent to make sure the request is configured properly. You would want to set the Return option to send an event when disks come back up. You would configure the events to use opcmsg (ITO) protocol because you use HP OpenView IT/Operations as your system management tool. The parameters for your monitoring request would look like Figure 2-3 “Example: Configuring /vg/vgName/pv_pvlink/status Parameters to Notify When Disks Fail”.

Figure 2-3 Example: Configuring /vg/vgName/pv_pvlink/status Parameters to Notify When Disks Fail

Example: Configuring /vg/vgName/pv_pvlink/status Parameters to Notify When Disks Fail

All requests are created in a similar way. You need to make sure you perform these steps for all instances in all volume groups you want to monitor.

Resources to Monitor for RAID Arrays

These considerations are relevant to all supported RAID configurations listed at the beginning of this chapter. To adequately monitor a RAID system, create requests to monitor at least the following resources for all volume groups on a node:

/vg/vgName/pv_summary

This gives you an overview of the status of the entire physical volume group and is recommended when using EMS in conjunction with MC/ServiceGuard; see “Rules for Using the EMS Disk Monitor
with MC/ServiceGuard”
.

vg/vgName/pv_pvlink/status/*

This gives you the status of each PV link in the array and is redundant to pv_summary. It is recommended when using EMS outside of the MC/ServiceGuard environment, or if you require specific status on each physical device.

vg/vgName/lv_summary

This gives you the status of data availability on the array.

Figure 2-4 “RAID Array Example” represents a node with two RAID arrays and two PV links to each are

Figure 2-4 RAID Array Example

RAID Array Example

Each LUN on the RAID array is in it own volume group: vgdance and vgsing. Assume this is one node in a 2-node cluster and you want to be notified when there is a failover, when any physical device fails, and when any logical volume becomes unavailable.

To be notified when a package fails over, you must configure an EMS request that is the same as the package dependency you configured in MC/ServiceGuard. See Chapter 1, "Configuring MC/ServiceGuard Package Dependencies". For this example, assume the package UP values were set as "UP" and "PVG_UP".

To configure the EMS alerts, create the following requests:

Table 2-8 Sample Disk Monitoring Requests

Resource

Monitoring Parameters

Notify

Condition

Option

/vg/vgdance/pv_summary

when value is...

>

PVG_UP

RETURN

/vg/vgsing/pv_summary

when value is...

>

PVG_UP

RETURN

/vg/dance/lv_summary

when value is...

>=

INACTIVE

RETURN

/vg/vgsing/lv_summary

when value is...

>=

INACTIVE

RETURN

 

If pv_summary is SUSPECT, you know a physical device fails. If pv_summary status is SUSPECT, you may want to look at your lv_summary to see if you can still access all data. If lv_summary is DOWN or INACTIVE_DOWN, you do not have a complete copy of data.

Resources to Monitor for Mirrored Disks

This section is valid for mirrored disks created with MirrorDisk/UX. Mirroring is required to be PVG-strict if you are using the disk monitor. Mirrored configurations that are not PVG-strict will not give you a correct pv_summary.

To adequately monitor mirrored disks, create requests for the following resources for all volume groups on a node:

/vg/vgName/pv_summary

This gives you summary status of all physical volumes in a volume group. A high availability system must be configured PVG strict. If not, pv_summary cannot accurately determine disk availability.

vg/vgName/pv_pvlink/status/*

This gives you the status of each physical disk and links.

vg/vgName/lv_summary

This gives you the status of data availability for logical volumes.

vg/vgName/lv/copies/*

This gives you the total number of copies of data currently available.

Figure 2-5 “Mirrored Disks Example” represents two nodes with 2-way mirrored configuration with 10 disks on 2 busses. Both copies are in a single volume group. Assume you want to be notified when any physical device fails, and when you only have one copy of data, or when there is an MC/ServiceGuard failover. To configure this last request, you must duplicate your MC/ServiceGuard package dependency. See Chapter 1 "Configuring MC/ServiceGuard Package Dependencies".

Figure 2-5 Mirrored Disks Example

Mirrored Disks Example

To configure the EMS alerts, create the following requests on each node:

Resource

Monitoring Parameters

Notify

Condition

Option

/vg/vg01/pv_summary

when value is...

>=

PVG_UP

RETURN

/vg/vg01/lv_summary

when value is...

>=

INACTIVE

RETURN

/vg/vg01/lv/copies/*

when value is...

<=1

RETURN

Alerts need to be interpreted in relation to each other. In the table above, you would get an alert when PVG_UP is true. Although all data is available, the condition PVG_UP implies there are physical volumes that are not functioning and need to be fixed.You may want to examine lv/copies to see how many copies of data are accessible and determine how urgently you need to repair the failures. If you have 3-way mirroring and only 1 copy of data is available, for example, you may want to correct the failure immediately to eliminate the single point of failure. Table 2-9 “Example for Interpreting the pv_summary for Mirrored Disks” is an example of how the disk monitor determines whether data is available in a mirrored configuration with 5 disks on each bus.

Table 2-9 Example for Interpreting the pv_summary for Mirrored Disks

number of valid devices

meaning

pv_summary
value

10

all PVs and data accessible

UP

9

1 PV down, all data accessible

PVG_UP

8-5

if 5 PVs are from the same PVG, then all data is available

PVG_UP

if 2 or more physical volumes from different PVGs are DOWN, the disk monitor cannot conclude that all data is available

SUSPECT

4-1

some data missing

DOWN

0

no data available

 

Resources to Monitor for Lock Disks

Lock disks are used as a tie-breaker in a forming or reforming cluster, so if you are using a lock disk with your cluster, you should request a monitor for that disk and send an alert to your system management software if the lock disk is unavailable. If the lock disk is unavailable during cluster formation, the cluster may fail to reform. Requests to monitor the lock disk might look like this:

Resource

Monitoring Parameters

Notify

Condition

Option

/vg/vg02/pv_pvlink/c0t0d1

when value is

>=

BUSY

RETURN

The Repeat value in the Options will send an alert until the lock disk is available.

You need to create a request on each node in the cluster. Because the bus name and SCSI path to the lock disk may be different on each node, the resource instance may have a different name. It is merely a different path to the same lock disk.

Resources to Monitor for Root Volumes

In a high availability system, it is recommended that you mirror your root volume, and have them on separate links in separate PVGs. Note that the root volume should always be ACTIVE. Requests to monitor the root volume might look like this:

Resource

Monitoring Parameters

Notify

Condition

Option

/vg/vg00/pv_pvlink/c0t0d0

when value is...

>=

BUSY

REPEAT

/vg/vg00/pv_pvlink/c1t0d0

when value is...

>=

BUSY

REPEAT

/vg/vg00/lv_summary

when value is...

not equalUP

RETURN

/vg/vg00/lv/copies/lv01

when value is...

<

1

RETURN

If one of the root volumes is unavailable, you are alerted and told which one has failed (pv_pvlink/status). If you lose a root disk mirror, you are alerted. You are also notified when the mirror is restored.

Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© 1997 Hewlett-Packard Development Company, L.P.