Schedule demo

AWS ECS Monitoring


AWS ECS - Overview

Amazon Elastic Container Service (Amazon ECS) is a highly scalable, fast, container management service that makes it easy to run, stop, and manage Docker containers on a cluster. Amazon ECS lets you launch and stop container-based applications with simple API calls, allows you to get the state of your cluster from a centralized service, and gives you access to many familiar Amazon EC2 features.

Creating a new AWS ECS monitor

To learn how to create a new AWS ECS monitor, refer here.

Monitored Parameters

Go to the Monitors Category View by clicking the Monitors tab. Click on the ECS available under Amazon in the Cloud Apps section. Displayed is the Amazon ECS bulk configuration view distributed into three tabs:

  • Availability tab gives the availability history for the past 24 hours or 30 days.
  • Performance tab gives the health status and events for the past 24 hours or 30 days.
  • List view tab enables you to perform bulk admin configurations.

By clicking a monitor from the list, you'll be taken to the AWS ECS dashboard. It has 5 tabs:

Overview

ParameterDescription
CLUSTER INFORMATION
StatusThe status of the cluster. (ACTIVE, PROVISIONING, DEPROVISIONING, FAILED, INACTIVE)
Cluster ARNThe Amazon Resource Name (ARN) that identifies the cluster.
Registered Container InstancesThe number of containers instances registered to the cluster.
CLUSTER TASKS
Running TasksThe number of tasks that are in RUNNING state.
Pending TasksThe number of tasks that are in PENDING state.
Running EC2 TasksThe number of EC2 tasks that are in RUNNING state.
Pending EC2 TasksThe number of EC2 tasks that are in PENDING state.
Running Fargate TasksThe number of Fargate tasks that are in RUNNING state.
Pending Fargate TasksThe number of Fargate tasks that are in PENDING state.
CLUSTER SERVICES
Active ServicesThe number of active services running in the cluster.
Active EC2 ServicesThe number of EC2 services that are running on the cluster in an ACTIVE state.
Draining EC2 ServicesThe number of EC2 services that are in DRAINING state.
Active Fargate ServicesThe number of Fargate services that are running on the cluster in an ACTIVE state.
Draining Fargate ServicesThe number of Fargate services that are in DRAINING state.

Cluster Performance

ParameterDescription
CPU
CPU ReservationAmount of CPU units that are reserved by running tasks in the cluster (in percentage).
CPU UtilizationAmount of CPU units that are used in the cluster (in percentage).
MEMORY
Memory ReservationAmount of memory that is reserved by running tasks in the cluster (in percentage).
Memory UtilizationAmount of memory that is used in the cluster (in percentage).
CPU USAGE
CPU ReservedThe CPU units reserved by tasks in the cluster. This metric is collected only for tasks that have a defined CPU reservation in their task definition.
CPU UtilizedThe CPU units used by tasks in the cluster. This metric is collected only for tasks that have a defined CPU reservation in their container definition.
MEMORY USAGE
Memory ReservedAmount of memory that is reserved by tasks in the cluster (in GB).
Memory UtilizedAmount of memory being used by tasks in the cluster (in GB).
DISK I/O THROUGHPUT
Storage Read RateRate at which data is read from storage in the cluster (in kB/s).
Storage Write RateRate at which data is written from storage in the cluster (in kB/s).
CLUSTER NETWORK I/O
Data Transmit RateRate at which data is transmitted by the cluster (in kB/s).
Data Receive RateRate at which data is received by the cluster (in kB/s).

Tasks

ParameterDescription
Task Details
Task IDThe unique identifier for the task.
Health StatusThe health status for the task, which is determined by the health of the essential containers in the task. (HEALTHY, UNHEALTHY, UNKNOWN)
Last StatusThe last known status of the task. (PROVISIONING, PENDING, ACTIVATING, RUNNING, DEACTIVATING, DEPROVISIONING, STOPPED)
Desired StatusDisplays the desired status of the task.
Launch TypeThe launch type on which you task is running. (EC2 or Fargate)
ConnectivityThe connectivity status of a task. (CONNECTED or DISCONNECTED)
Connectivity AtShows the time stamp for when the task went to CONNECTED state.
Configured CPU UnitsShows the number of CPU units used by the task.
Configured MemoryDisplays the amount of memory used by the task (in MB).
Number of ContainersNumber of containers in which the task is running.
Task Info
Task IDThe unique identifier for the task.
Container Instance IDThe unique identifier for the container instance.
Task GroupThe name of the task group associated with your task.
Task DefinitionThe full description of the task definition.
Created AtShows the time stamp for when the task was created.
Started AtShows the time stamp for when the task was started.
Started ByShows the tag specified when a task is started.

Services

ParameterDescription
Service Status
Service NameThe name of the service.
StatusThe status of the service. (ACTIVE, DRAINING, INACTIVE)
CPU UtilizationThe percentage of CPU units that are used in the service.
Memory UtilizationThe percentage of memory that are used in the service.
Pending TasksThe number of tasks in the service that are in the PENDING state.
Running TasksThe number of tasks in the service that are in the RUNNING state.
Desired TasksThe desired number of instantiations of the task definition to keep running on the service.
Launch TypeThe launch type on which your service is running. (EC2 or FARGATE)
Scheduling StrategyThe scheduling strategy to use for the service. (REPLICA or DAEMON)
Service Insights
Service NameThe name of the service.
CPU ReservedThe CPU units reserved by tasks in the service. This metric is collected only for tasks that have a defined CPU reservation in their task definition.
CPU UtilizedThe CPU units used by tasks in the service. This metric is collected only for tasks that have a defined CPU reservation in their container definition.
Memory ReservedThe memory that is reserved by tasks in the service (in GB).
Memory UtilizedThe memory being used by tasks in the service (in GB).
Storage Read RateRate at which data is read from storage in the service (in kB/s).
Storage Write RateRate at which data is written from storage in the service (in kB/s).
Data Transmit RateRate at which data is transmitted by the service (in kB/s).
Data Receive RateRate at which data is received by the service (in kB/s).
TaskSet CountThe number of task sets in the service.
Service Details
Service NameThe name of the service.
Task DefinitionThe task definition to use for tasks in the service.
Platform VersionThe platform version on which to run your service.
Created AtThe timestamp for when the service was created.
Created ByThe principal that created the service.
Service Events
Event IDIndicates the ID for the event.
Service NameName of the service.
Generated TimeDate and time at which the event was generated.
MessageThe message shown for the event.

Container Instances

ParameterDescription
CPU
Instance CPU Reserved CapacityThe percentage of CPU currently being reserved on a single EC2 instance in the cluster.
Instance CPU UtilizationThe total percentage of CPU units being used on a single EC2 instance in the cluster.
MEMORY
Instance Memory Reserved CapacityThe percentage of memory currently being reserved on a single EC2 instance in the cluster.
Instance Memory UtilizationThe total percentage of memory being used on a single EC2 instance in the cluster.
CPU USAGE
Instance CPU UsedAmount of CPU units being used on a single EC2 instance in the cluster.
Instance CPU RemainingAmount of CPU units remaining after use on a single EC2 instance in the cluster.
Instance CPU LimitMaximum amount of CPU units that can be assigned to a single EC2 instance in the cluster.
MEMORY
Instance Memory UsedAmount of memory being used on a single EC2 instance in the cluster (in GB).
Instance Memory RemainingAmount of memory remaining after use on a single EC2 instance in the cluster (in GB).
Instance Memory LimitMaximum amount of memory that can be assigned to a single EC2 instance in the cluster (in GB).
FILESYSTEM UTILIZATION
Instance FileSystem UtilizationTotal amount of file system capacity being used on a single EC2 instance in the cluster (in percentage).
NETWORK USAGE
Instance Network Traffic RateRate at which data is sent and received over the network on a single EC2 instance in the cluster (in kB/s).
Container Instances
Container Instance IDThe ID of the container instance.
StatusThe status of the container instance. (REGISTERING, REGISTRATION_FAILED, ACTIVE, INACTIVE, DEREGISTERING, DRAINING)
Running TasksNumber of tasks that are in RUNNING state per container instance.
Pending TasksNumber of tasks that are in PENDING state per container instance.
VersionThe version counter for the container instance. Every time a container instance experiences a change that triggers a CloudWatch event, the version counter is incremented.
Agent VersionThe version number of the Amazon ECS container agent.
Agent ConnectedIndicates whether the agent is connected to Amazon ECS. (True or False)
Instance IDThe EC2 instance ID of the container instance.
Registered AtThe timestamp for when the container instance was registered.
Container Instances Insights
Container Instance IDThe ID of the container instance.
Memory Reserved CapacityAmount of memory currently being reserved on the instance (in percentage).
Memory UtilizationAmount of memory currently being used on the instance (in percentage).
CPU Reserved CapacityAmount of CPU currently being reserved on the instance (in percentage).
CPU UtilizationAmount of CPU currently being used on the instance (in percentage).
FileSystem UtilizationAmount of file system capacity being used on the instance (in percentage).
Network Traffic RateRate at which data is sent and received over the network on the instance (in kB/s).
Instances Resource Details
Container Instance IDThe unique identifier for the container instance.
Available CPUAmount of CPU units available to allocate tasks.
Available MemoryAmount of memory available to allocate tasks (in MB).
Reserved PortsThe ports that were reserved by the Amazon ECS container agent.

Loved by customers all over the world

"Standout Tool With Extensive Monitoring Capabilities"

It allows us to track crucial metrics such as response times, resource utilization, error rates, and transaction performance. The real-time monitoring alerts promptly notify us of any issues or anomalies, enabling us to take immediate action.

Reviewer Role: Research and Development

carlos-rivero
"I like Applications Manager because it helps us to detect issues present in our servers and SQL databases."
Carlos Rivero

Tech Support Manager, Lexmark

Trusted by over 6000+ businesses globally