Thresholds let you define the status of an attribute based on specific conditions. A threshold is a predefined set of values associated to an attribute, based on which an Alarm can be generated. For example, you can define a threshold to show the status of the webserver as critical, if the response time exceeds 100 seconds. Likewise, you can define a threshold to show the status as clear, if the MBean's attribute - Active is equal to true.
To define a threshold, follow the steps given below:
Manual:This can be categorized as:
| Use this option for Expressions | Description |
|---|---|
| $10D_MVA | Ten Days Moving Average |
| $30D_MVA | Thirty Days Moving Average |
| $6H_MVA | Six Hours Moving Average |
| $7D_MVA | Seven Days Moving Average |
| $LastHourValue | Last Hour Average |
| $10H_MVA | Ten Hours Moving Average |
Now specify the conditions for the Critical, Warning or Clear severity. We provide two condition support for every severity when creating Threshold Profiles. You can set the monitored value of a particular severity to be > (greater than), < (lesser than), = (equal to), != (not equal to), <= (lesser than or equal to), >= (greater than or equal to) to a customizable Threshold Limit. You can set additional condition when defining your threshold limit for each severity by clicking on the More link at the right-hand corner of the Create New Threshold Profile table.
Note:
Specify the number of polls that can be scheduled before reporting an error. If you do not want to generate an alarm for the very first time the threshold condition is crossed, you can set Applications Manager to examine a certain X number of polls from a Y number of consecutive polls (both numbers are customizable), before generating an alarm. For e.g., If you want an email alarm to be generated only if the CPU Disk utilization is above 100%, consecutively for more than three polls out of five polls, then you can configure the number of polls to be examined out of a number consecutive polls before generating an alarm. By default, it takes the value from Global Settings. Refer Configuring Consecutive Polls for setting configurations for individual monitors.
Note: In the case of Automatic Adaptive profiles:
The thresholds thus defined can be associated with the attributes for determining the status of the attributes of the Monitor Group. You can also associate thresholds and actions directly through Global Alarm Configuration instead of the individual Monitor screen.
You can use New Threshold Profiles to apply Adaptive thresholds broadly across targets within your environment. For a Standard Threshold Profile, raw data will be assessed to the threshold limit and alert will be generated in every poll. For Anomaly Threshold Profiles, hourly average data is compared with the past archived data (this should be configured by the user). Adaptive thresholds are a combination of both standard and anomaly thresholds and has two categories - Auto and Manual. Applications Manager allows you to configure alarms by associating thresholds based on anomaly detection with the help of Machine Learning. It makes use of RPCA algorithm to determine the anomaly which helps in identifying sudden or unusual spikes in the data immediately upon occurrence and triggers alerts accordingly. Once an Auto-Adaptive profile is configured, a history of data is collected for the attribute of the monitor to understand its characteristic and behavior. After a few hours of data collection, it is compared with the current data to check abnormalities;if any and then an alarm is raised in case of anomalies. If an abnormal value is detected for the attribute, with it's deviation violating the threshold, then alerts will be generated and the anomaly details will be shown in the attribute's RCA message. The RCA message will contain the anomaly details of the attribute that caused the alarm, by displaying the amount of deviation found between the expected and actual values. If the collected value does not have any anomalies or the deviation does not meet 'Critical' or 'Warning' threshold, then the alarms will be cleared with 'Clear' severity. Note: Machine Learning based anomaly detection is applicable only to Numeric attributesand does not support the following attributes:
Manual Adaptive Thresholds - Baseline and Custom expression Adaptive Baseline Threshold compares raw data values with past archived data at every poll. Adaptive Custom Expression contains some configured expressions (Need to configure by the user from UI) and it calculates the value for that expression and compares it with raw data at every poll. |
Refer to Associating Threshold and Action with Attributes for more details.
Applications Manager gives you the ability to create a Threshold & Action profile report in .csv format. This report shows the threshold creation time, Monitor Group, Type, Threshold Profile, Threshold Status, Critical, Warning, and Clear Actions and more for catagorized and uncatagorized monitors. Here's how you can create the report:
The downloaded CSV report contains the following columns:
| Column Name | Description |
|---|---|
| Monitor Group | The top-level group the monitor belongs to. |
| Sub Group | The sub-group associated with the monitor. |
| Parent Monitor | The name of the parent monitor. |
| Monitor Name | The name of the monitor. |
| Monitor Type | The category or type of the monitor. |
| Creation Time | The date and time when the monitor was created. |
| Attribute Name | The specific attribute to which the threshold and actions apply. |
| Threshold Profile | The name of the profile defining the alert thresholds for the attribute. |
| Critical Condition | The condition configured for triggering a Critical severity alert. |
| Warning Condition | The condition configured for triggering a Warning severity alert. |
| Clear Condition | The condition configured for triggering a Clear severity. |
| Critical Action | The list of actions configured for a Critical severity. |
| Warning Action | The list of actions configured for a Warning severity. |
| Clear Action | The list of actions configured for a Clear severity. |
You can create Threshold Profiles from the Central Server in Applications Manager, the same as in the Professional Edition, and the profiles created automatically sync with the Probe Servers. However, these profiles can only be used and cannot be edited from the Probe Servers.
It allows us to track crucial metrics such as response times, resource utilization, error rates, and transaction performance. The real-time monitoring alerts promptly notify us of any issues or anomalies, enabling us to take immediate action.
Reviewer Role: Research and Development