AWS Systems Manager is a management service that helps you automatically collect software inventory, apply OS patches, create system images, and configure Windows and Linux operating systems. It provides centralized operational visibility and control over your AWS infrastructure through features like Run Command, State Manager, and Patch Manager, enabling you to manage resources at scale.
Monitoring AWS Systems Manager is essential for maintaining the reliability and efficiency of automated operations across your managed instances. Applications Manager's AWS Systems Manager monitoring tool provides real-time tracking of key parameters such as command execution status, delivery timeouts, and managed instance counts, helping administrators quickly identify and resolve command failures and delivery issues across their fleet.
To learn how to create a new AWS Systems Manager monitor, refer here.
Go to the Monitors Category View by clicking the Monitors tab. Click on the Systems Manager (SSM) instance available under Amazon in the Cloud Apps section. Displayed is the AWS Systems Manager bulk configuration view distributed into three tabs:
By clicking a monitor from the list, you'll be taken to the AWS Systems Manager dashboard which includes the following tabs:
| Parameter | Description |
|---|---|
| COMMAND EXECUTION | |
| Failed Commands | The total number of commands that failed during execution on managed instances between the poll interval. |
| Delivery Timed Out Commands | The total number of commands that could not be delivered to the SSM agent within the timeout period between the poll interval. |
| Succeeded Commands | The total number of commands that completed successfully between the poll interval. |
| Active Commands | The total number of commands running at the time of polling. |
| MANAGED INSTANCES | |
| Managed Instances | The total number of managed instances registered with Systems Manager in this region at the time of polling. |
| Parameter | Description |
|---|---|
| COMMAND HISTORY | |
| Command ID | The unique identifier of the command invocation. |
| Status | The status of the command invocation at the time of polling. |
| Requested Time | Indicates the timestamp when the command was requested. |
| Document Name | The name of the SSM document that was executed. |
| Comment | The comment provided when the command was invoked. |
| Targeted Instances | The number of targets to which the command was sent. |
| Failed Instances | The number of targets for which the status is Failed or Execution Timed Out. |
| Delivery Timed Out Instances | The number of targets for which the status is Delivery Timed Out. |
| Completed Instances | The number of targets for which the command invocation reached a terminal state. |
Note:
It allows us to track crucial metrics such as response times, resource utilization, error rates, and transaction performance. The real-time monitoring alerts promptly notify us of any issues or anomalies, enabling us to take immediate action.
Reviewer Role: Research and Development