Schedule demo

Windows Server Cluster Monitoring


Overview

Applications Manager's Windows Cluster monitoring capability includes monitoring of the cluster details, the cluster nodes, resource groups, cluster performance, networks, disk utilization and storage stats. You can also monitor the cluster events by configuring the Event log Rules.

Creating a new Windows Cluster monitor

Supported Versions: Windows 2016, Windows 2012, Windows 2012 R2, Windows 2008, Windows 2008 R2

Prerequisites for monitoring Windows Cluster metrics:Click here

Using the REST API to add a new Windows Cluster monitor:Click here

Follow the steps given below to create a new Windows Cluster monitor in Applications Manager:

  1. Click on New Monitor link.
  2. Select Windows Cluster under the Servers category.
  3. Specify the Display Name of the Windows Cluster.
  4. Enter the Virtual IP Address or the Listener IP Address of the cluster.
  5. Select the Version of the Windows Server from the drop-down menu.
  6. You can either use your Cluster Domain Administrator username and password, or select credentials from the Credential Manager drop-down menu. To use your Cluster Domain Administrator credentials, make sure the user account has permission to excute WMI queries on 'root\mscluster' namespace in cluster server nodes.
  7. Select the Node Discovery option. The available options are Do not Discover Nodes and Discover and Monitor Nodes
    • Do not Discover Nodes - Selecting this option will not discover the cluster server nodes as a Windows Server. If the node already added as windows server, it will be associated internally for collecting the event logs specific to cluster.
    • Discover and Monitor Nodes - Selecting this option will discover the cluster server nodes as a Windows Server and monitor availability and performance. If the node already added as windows server , it will not be discovered again and the existing server will be associated internally for collecting the event logs specific to cluster.
  8. Select the Enable Event Log Monitoring option:
    • Checked - This will enable the eventlog monitoring in all the cluster server nodes. The events generated from the configured eventlog rules will be propagated to cluster. During the eventlog collection of servers, it will collect the events for cluster as well and add them to database without generating the alert. Then, during the data collection of cluster, it will take the cluster events from database from all the nodes and then generate the alert for the configured Eventlog rule.
    • Unchecked - This will disable the eventlog monitoring in the Cluster.
      • Cluster Add : While adding the cluster, selecting this option will not enable eventlog monitoring in the nodes discovered. If a node already exists, this option will leave the current event log status in the server as it is.
      • Cluster Update : If this option is selected while updating the cluster, it will disable event log monitoring in the all the servers & cluster. Also it will clear the eventlog related Alarms and Events from database for all the servers & cluster. Hence use this option only when it is necessary.
  9. Enter the polling interval time in minutes.
  10. Choose the Monitor Group from the combo box to which you want to associate the Monitor (optional).
  11. Click Add Monitor(s).

Monitored Parameters

Go to the Monitors Category View by clicking the Monitors tab. Click on Windows Cluster under the Servers section. Displayed is the Windows Cluster's bulk configuration view in three tabs:

  • Availability tab gives the availability history for the past 24 hours or 30 days.
  • Performance tab gives the health status and events for the past 24 hours or 30 days.
  • List view enables you to perform bulk admin configurations.

Click on the monitor name to view all the Windows Cluster attributes monitored under the following tabs:

Overview

ParameterDescription
CLUSTER DETAILS

Cluster Name/IP Address

The name/IP Address of the cluster.

Quorum Owner NodeThe node name, which currently owns the Quorum Resource.
Quorum PathThe path to the quorum files.
Quorum TypeThe current quorum type. The following are the possible values:
  • InputObject
  • Cluster
  • DiskOnly
  • NodeAndDiskMajority
  • NodeAndFileShareMajority
  • NodeMajority
Number of NodesThe total number of nodes in a cluster.
Max NodesThe maximum number of nodes that can participate in a cluster.
Number of NetworksThe number of networks used by the server cluster for communication.
Resources Online The count of resources that are currently online.
Resources Offline The count of resources that are currently offline.
Resource Groups Online The count resource groups that are currently online.
Resource Groups Offline The count resource groups that are currently offline.
Disks in UseThe number of disks currently in use in the cluster.
Number of NodesThe total number of nodes in cluster.
DISK UTILIZATION

Disk Used Percentage

The total percentage of used disk space in a Cluster.
Disk Free PercentageThe total percentage of free disk space in a Cluster.
Disk SizeThe total size of disk space, in megabytes.
Disk UsedThe total used space in the disk, in megabytes.
Disk FreeThe total free space available in the disk, in megabytes.
NODES
Node NameSpecifies the label by which the node is known.
State

Specifies the current state of a node. Node states can be:

  • Up - The node is physically plugged in, turned on, booted, and capable of executing programs.
  • Down - The node is turned off or not operational.
  • Joining - The node is in the process of joining a cluster.
  • Paused - The node is running but not participating in cluster operations.
  • Unknown - The operation was not successful.
RESOURCE CONTROL AND MULTICAST RR
Messages OutstandingSpecifies the length of the internal message queue.
RHS ProcessesSpecifies how many Resource Host Monitor processes are running on the node.
RHS RestartsSpecifies how many Resource Host Monitor failures have taken place on this node.
NETWORK RECONNECTIONS
Reconnect CountSpecifies the number of times the TCP connection was broken and reestablished.


Performance

ParameterDescription
MULTICAST REQUEST REPLY
Messages OutstandingSpecifies the length of the internal message queue.
RESOURCE CONTROL MANAGER
RHS ProcessesSpecifies how many Resource Host Monitor processes are running on the node.
RHS RestartsSpecifies how many Resource Host Monitor failures have taken place on this node.
NETWORK RECONNECTIONS
Node NameSpecifies the label by which the node is known.
Reconnect CountSpecifies the number of times the TCP connection was broken and reestablished.
Normal Message Queue LengthSpecifies the number of messages in the queue waiting to be sent.
Normal Message Queue Length DeltaSpecifies the incoming message rate to the queue.
Urgent Message Queue LengthSpecifies the number of urgent messages in the queue waiting to be sent.
Urgent Message Queue Length DeltaSpecifies the incoming message rate to the queue.
RESOURCE TYPE STATS
Resource FailureIndicates the number of times, the Resource Host Monitor get terminated due to a failure of a resource.
Resource Failure Access ViolationIndicates the number of times, the Resource Host Monitor get terminated due to a failure of a resource, which caused by access violation.
Resource Failure DeadlockIndicates the number of times, the Resource Host Monitor get terminated due to a failure of a resource, which caused by deadlock.

Networks

ParameterDescription
NameSpecifies the name of the network.
AddressProvides the address for the entire network or subnet.
RoleProvides access to the network's Role property i.e, the role of the network in the cluster. The following are the possible values:
  • None - The network is not used by the cluster.
  • Cluster - The network is used to carry internal cluster communication.
  • Client - The network is used to connect client systems to the cluster.
  • Both - The network is used to connect client systems and to carry internal cluster communication.
StateSpecifies the current state of the network. The following are the possible values:
  • Unknown - The operation was not successful.
  • Unavailable - All of the network interfaces on the network are unavailable, which means the nodes that own the network interfaces are down.
  • Down - The network is not operational; none of the nodes on the network can communicate.
  • Partitioned - The network is operational, but two or more nodes on the network cannot communicate. Typically a path-specific problem has occurred.
  • Up - The network is operational; all of the nodes in the cluster can communicate.
NETWORK MESSAGES
Bytes ReceivedThe Bytes Received/sec performance counter shows the number of new cluster message bytes received on the network per second
Bytes SentThe Bytes Sent/sec performance counter shows the number of new cluster message bytes sent over the network per second.
Messages ReceivedThe Messages Received/sec performance counter shows the number of new cluster messages received on the network per second.
Messages SentThe Messages Sent/sec performance counter shows the number of new cluster messages sent over the network per second.

Storage

ParametersDescription
PathThe path (including the drive letter if present) of the clustered disk partition.
Volume LabelSpecifies access to the VolumeLabel property, which is the volume label of the partition.
SizeThe total size for the partition, in megabytes.
UsedThe total used space in the partition, in megabytes.
FreeThe total free space available for the partition, in megabytes.
Used PercentageThe percentage of used space in the partition.
Free PercentageThe percentage of free space in the partition.

Resource Groups

ParametersDescription
NameThe name of the Resource group.
Current NodeThe node in which the resource group is currently running.
Preferred NodeIndicates the preferred node names from the cluster, to which the resource group can failover/failback
StateThe current state of the resource group. The following are the possible values.
  • Unknown
  • Online
  • Offline
  • Failed
  • PartialOnline
  • Pending

Events

ParametersDescription
Rule NameThe name of the Windows Cluster Event Log rule.
Log File TypeThe Log File Type under which the the Event Log rule was created. The Windows Cluster Events is generated in 'System' log file but Applications Manager users can Create a rule for Cluster under any Log File Type in Applications Manager. Hence, you can see the other events generated in all the Servers in the Cluster Level.
Node NameName of the windows cluster node, in whcih the particular Event was generated.
SourceSpecifies the Source associated with the Event Log.
Event IDSpecifies the Event ID associated with the Event Log
TypeThe Event Type - Event of Any Type, Error, Warning and Information.Note: In case of Security Events, the types would vary between Success Audit and Failure Audit.
DescriptionDescription content for the incoming event
Generated TimeThe time at which the event is generated.

Applications

This tab provides insights into the applications running on the server and their monitoring status.

ParametersDescription
Monitors in this SystemDisplays information about the availability and health of the monitors configured on this server. Attributes shown here are Name, Type, Availability, and Health. To add new monitors, use the Add Monitors option.
Applications yet to be monitoredLists applications discovered on the server that are not currently being monitored. Attributes shown here are Name, Type, and Port. To start monitoring, select one or more applications using the checkboxes and click the Enable Monitoring button.

Loved by customers all over the world

"Standout Tool With Extensive Monitoring Capabilities"

It allows us to track crucial metrics such as response times, resource utilization, error rates, and transaction performance. The real-time monitoring alerts promptly notify us of any issues or anomalies, enabling us to take immediate action.

Reviewer Role: Research and Development

carlos-rivero
"I like Applications Manager because it helps us to detect issues present in our servers and SQL databases."
Carlos Rivero

Tech Support Manager, Lexmark

Trusted by over 6000+ businesses globally