| United States-English |
|
|
|
![]() |
Using EMS HA Monitors > Chapter 1 Installing and Using EMSUsing EMS HA Monitors |
|
There are two ways to use EMS HA Monitors:
The following are prerequisites to using EMS:
Resource classes are structured hierarchically, similar to a filesystem structure, although they are not actually files and directories. The classes supplied with this version of EMS are listed in Figure 1-2 “Event Monitoring Service Resource Class Hierarchy”. Resource instances are listed in bold, and instances that are replaced with an actual name are in bold italics. The full path of a resource includes the class, subclasses, and instance. An example of a full resource path for the physical volume status of the device /dev/dsk/c0t1d2 belonging to volume group vgDataBase, would be /vg/vgDataBase/pv_pvlink/status/c0t1d2. This section describes the steps from the SAM interface to EMS to create monitoring requests that notify non-MC/ServiceGuard management applications such as IT/Operations.This information for creating requests is also valid for monitors sold with other products (ATM or OTS, for example) and for user-written monitors written according to developer specifications in Writing Monitors for the Event Monitoring Service (EMS). To start the EMS configuration, double-click on the Event Monitoring Service icon in the Resource Management area in SAM. The main screen, shown in Figure 1-3 “Event Monitoring Service Screen”, shows all requests configured on that system; if you haven't created requests, the screen will be empty. All resources are divided into classes. When you double-click on Add Monitoring Request in the Actions menu, the top-level classes for all installed monitors are dynamically discovered and then listed. Some Hewlett-Packard products, such as ATM or HP OTS 9000, provide EMS monitors. If those products are installed on the system, then their top-level classes will also appear here. Similarly, top-level classes belonging to user-written monitors, created using the EMS Developer's Kit, will be discovered and displayed here. Traverse the hierarchy in the upper part of the screen in Figure 1-4 “The Top Level of the Resource Hierarchy in the Add a Monitoring Request Screen” and select a resource instance to monitor in the lower part of the screen as in Figure 1-5 “Choosing a Resource Instance in the Add a Monitoring Request Screen”. The * wildcard is a convenient way to create many requests at once. Most systems have more than one disk or network card, and many have several disks. To avoid having to create a monitor request for each disk, select * (All Instances) in the Resource Instance box. See Figure 1-5 “Choosing a Resource Instance in the Add a Monitoring Request Screen”. Wildcards are available only when all instances of a subclass are the same resource type. Wildcards are not available for resource classes. So, for example, a wildcard is available for the status instances in the /vg/vgName/pv_pvlink/status subclass, but no wildcard appears for the volume group subclasses under the /vg resource class. The screen in Figure 1-6 “Monitoring Request Parameters” shows where you specify when and how to send events. The following sections describe the monitoring parameters and some common applications of them. While the monitor may be polling disks every 5 minutes, for example, you may only want to be alerted when something happens that requires your attention. When you create a request, you specify the conditions under which you receive an alert. Here are the terms under which you can be notified:
If you select conditional notification, you may select one or more of these options:
For example, assume you have a request for notification if status > 3 for a resource with a values range of 1-7. You would get alerts each time the value equaled 4, 5, 6, or 7. If the updated version of the monitor has a new status value of 8, you would see new alerts when the resource equalled 8. The polling interval determines the maximum amount of elapsed time before a monitor knows about a change in status for a particular resource. The shorter the polling interval, the more likely you are to have recent data. However, depending on the monitor, a short polling interval may use more CPU and system resources. You need to weigh the advantages and disadvantages between being able to quickly respond to events and maintaining good system performance. The minimum polling interval depends on the monitor's ability to process quickly. For most resource monitors the minimum is 30 seconds. Disk monitor requests can be as short as 1 second. MC/ServiceGuard monitors resources every few seconds. You may want to use a short polling interval (30 seconds or less) when it is critical that you make a quick failover decision. You may want a polling interval of 5 minutes or so for monitoring less critical resources. You may want to set a very long polling interval (4 hours) to monitor failed disks that are not essential to the system, but which should be replaced in the next few days. You specify the protocol the EMS framework uses to send events in the Notify via: section of the screen in Figure 1-6 “Monitoring Request Parameters”. The options are:
Templates for configuring IT/Operations and Network Node Manager to display EMS events can be found on the Hewlett-Packard High Availability public web page at http://www.hp.com/go/ha. The notification comment is useful for sending task reminders to the recipients of an event. For example, if you have a disk monitor request that reports an alert that an entire mirror has failed, when that event shows up in IT/Operations, for example, you may want it to have the name of the person to contact if disks fail. If you have configured MC/ServiceGuard package dependencies, you may want to enter the package name as a comment in the corresponding pv_summary request. There are two ways to use the copy function:
To change the monitoring parameters of a request, select the monitoring request from the main screen and select Actions: Modify Monitoring Request. This section describes how to use SAM to create package dependencies on EMS resources. This creates an EMS request to monitor that resource and to notify MC/ServiceGuard when that resource reaches a critical user-defined level. MC/ServiceGuard will then failover the package. Here are some examples of how EMS might be used:
This information for creating requests is also valid for EMS monitors sold with other products (ATM or OTS, for example) and for user-written monitors written according to developer specifications in Writing Monitors for the Event Monitoring Service (EMS).
A package can depend on any resource monitored by an EMS monitor. To create package dependencies, choose create or modify a package from the Package Configuration interface under the High Availability Clusters subarea of SAM, Figure 1-7 “Package Configuration Screen”. You see a new option called "Specify Package Resource Dependencies." Click on "Specify Package Resource Dependencies..." to add EMS resources as package dependencies; you see a screen similar to Figure 1-8 “Package Resource Dependencies Screen”. If you click "Add Resource", you get a screen similar to Figure 1-7 “Package Configuration Screen”. When you select a resource, either from the "Add a Resource" screen, or from the "Package Resource Dependencies" screen by selecting a resource and clicking "Modify Resource Dependencies..." you get a screen similar to Figure 1-9 “Resource Parameters Screen”. To make a package dependent on an EMS resource, select a Resource Up Value from the list of Available Resource Values, then click "Add." The example in Figure 1-9 “Resource Parameters Screen” shows the possible values for pv_summary. Different resources show different available "Up" values.
If you select UP, the package fails over if the value is anything but UP. If you select UP and PVG-UP, the package fails over if the pv_summary value is not equal to UP or PVG_UP; in other words, if pv_summary were SUSPECT or DOWN. The polling interval determines the maximum amount of elapsed time before the monitor knows about a change in resource status. For critical resources, you may want to set a short polling interval, for example 30 seconds, which could adversely affect system performance. With longer polling intervals you gain system performance, but risk not detecting problems soon enough. You can also add resources as package dependencies by modifying the package configuration file in /etc/cmcluster/pkg.ascii. See Managing MC/ServiceGuard for details on how to modify this file. A example of the syntax is:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||||||