Amazon Redshift Monitoring

Overview

Redshift is a fully managed, cloud-based data warehousing service designed for fast analytic queries on large datasets. It uses a cluster-based architecture with massively parallel processing (MPP) and columnar storage to deliver petabyte-scale warehousing capabilities.

Applications Manager’s Redshift monitoring tool helps you gain visibility into the performance, resource utilization, and availability of your Redshift clusters. The monitor collects essential metrics related to cluster status, database performance, resource usage, and configuration parameters. Monitoring these metrics enables administrators to identify performance bottlenecks, optimize query workloads, and ensure smooth data warehouse operations.

Creating a new Redshift monitor

To learn how to create a new Redshift monitor, refer here.

Monitored Parameters

Go to the Monitors Category View by clicking the Monitors tab. Click on the Redshift instance available under Amazon in the Cloud Apps section. Displayed is the Redshift bulk configuration view distributed into three tabs:

Availability tab shows the availability history for the past 24 hours or 30 days.
Performance tab shows health status and events for the past 24 hours or 30 days.
List view tab enables you to perform bulk admin configurations.

By clicking a monitor from the list, you'll be taken to the Amazon Redshift dashboard which includes the following tabs:

Performance Overview

Parameter	Description
CLUSTER INFORMATION
Health Status	Indicates the health of the cluster. Possible values: Healthy, Unhealthy.
Maintenance Mode	Indicates whether the cluster is in maintenance mode. Possible values: ON, OFF.
Cluster Status	Displays the current operational state of the cluster.
Cluster Availability Status	Shows the availability of the cluster for queries. Possible values: Available, Unavailable, Maintenance, Modifying, Failed.
Number of Nodes	Total number of compute nodes in the cluster.
Total Storage Capacity	The total storage capacity of the cluster, measured in GiB.
CPU UTILIZATION
CPU Utilization (%)	The average percentage of CPU utilization for the entire cluster at the time of polling, representing overall processing load across leader and compute nodes.
DISK UTILIZATION
Disk Utilization (%)	The average percentage of disk space used across the cluster at the time of polling, representing overall storage consumption.
DATA THROUGHPUT
Rate of Read Throughput	Total amount of data read from disk per second by the cluster between the polling interval (MB/second).
Read Throughput	Total amount of data read from disk between the polling interval (MB).
Rate of Write Throughput	Total amount of data written to disk per second by the cluster between the polling interval (MB/second).
Write Throughput	Total amount of data written to disk between the polling interval (MB).
NETWORK THROUGHPUT
Inbound Network Throughput	The average rate at which the cluster receives data over the network between the polling interval (MB/second).
Outbound Network Throughput	The average rate at which the cluster sends data over the network between the polling interval (MB/second).
CLUSTER IOPS
Read IOPS	The average number of disk read operations performed per second by the cluster between the polling interval.
Write IOPS	The average number of disk write operations performed per second by the cluster between the polling interval.
CLUSTER LATENCY
Read Latency	The average time taken by the cluster to complete disk read operations between the polling interval (seconds).
Write Latency	The average time taken by the cluster to complete disk write operations between the polling interval (seconds).
DATABASE CONNECTIONS
Database Connections	The average number of active database connections to the cluster between the polling interval.
COMMIT QUEUE LENGTH
Commit Queue Length	The maximum number of transactions waiting to be committed during the poll interval.
ACTIVE USER TABLES
Active User Tables	The total number of active user tables in the cluster between the polling interval, excluding Redshift Spectrum tables.
EXCEEDED SCHEMA QUOTAS
Exceeded Schema Quotas	The maximum number of schemas that have exceeded their configured storage quotas between the polling interval.
CONCURRENCY SCALING SECONDS
Concurrency Scaling Seconds	Total seconds consumed by concurrency scaling clusters actively processing queries between the polling interval.
CONCURRENCY SCALING CLUSTERS
Active Concurrency Scaling Clusters	The maximum number of concurrency scaling clusters actively processing queries between the polling interval.
Max Concurrency Scaling Clusters	The maximum number of concurrency scaling clusters allowed for the cluster based on parameter group settings at the time of polling.

Note:Concurrency Scaling Seconds and Concurrency Scaling Clusters graph metrics are disabled from data collection by default and mapped under performance polling as Concurrency Scaling Clusters. To enable data collection, navigate to Settings → Performance Polling, select the Optimize Data Collection tab, choose Amazon Redshift as the monitor type and Concurrency Scaling Clusters as the metric name, then set the preferred time interval.

Database Performance

Parameter	Description
QUERY DURATION
Long Query Duration (>10 min)	Average time taken to complete queries that exceed 10 minutes between the polling interval (seconds).
Medium Query Duration (1–10 min)	Average time to complete queries that run between 1 and 10 minutes between the polling interval (seconds).
Short Query Duration (<1 min)	Average time to complete queries taking less than 1 minute between the polling interval (seconds).
QUERY THROUGHPUT
Long Query Throughput (>10 min)	Average number of long queries completed per second between the polling interval (queries/second).
Medium Query Throughput (1–10 min)	Average number of medium queries completed per second between the polling interval (queries/second).
Short Query Throughput (<1 min)	Average number of short queries completed per second between the polling interval (queries/second).
QUERY LIFECYCLE PHASES
Query Planning Time	Average time queries spent in the QueryPlanning stage between the polling interval (milliseconds).
Query Waiting Time	Average time queries spent in the QueryWaiting stage between the polling interval (milliseconds).
Query Commit Time	Average time queries spent in the QueryCommit stage between the polling interval (milliseconds).
QUERY DATA READ TIME
Query Executing Read Time	Average time queries spent in the QueryExecutingRead stage between the polling interval(milliseconds).
Query Executing Unload Time	Average time queries spent in the QueryExecutingUnload stage between the polling interval (milliseconds).
QUERY DATA MODIFICATION TIME
Query Executing Insert Time	Average time queries spent in the QueryExecutingInsert stage between the polling interval (milliseconds).
Query Executing Delete Time	Average time queries spent in the QueryExecutingDelete stage between the polling interval (milliseconds).
Query Executing Update Time	Average time queries spent in the QueryExecutingUpdate stage between the polling interval (milliseconds).
Query Executing CTAS Time	Average time queries spent in the QueryExecutingCtas stage between the polling interval (milliseconds).
Query Executing Copy Time	Average time queries spent in the QueryExecutingCopy stage between the polling interval (milliseconds).

Note:Query Lifecycle Phases, Query Data Read Time, and Query Data Modification Time graph metrics are disabled from data collection by default and mapped under performance polling as Query Runtime Breakdown. To enable data collection, navigate to Settings → Performance Polling, select the Optimize Data Collection tab, choose Amazon Redshift as the monitor type and Query Runtime Breakdown as the metric name, then set the preferred time interval.

Node

Parameter	Description
NODE PERFORMANCE DETAILS
Node ID	The unique identifier of the node in the cluster.
CPU Utilization	The average percentage of CPU utilization for the entire Amazon Redshift cluster at the time of polling, representing the overall processing load for specific leader or compute nodes (%).
Disk Utilization	The average percentage of disk space used on an individual node within the Amazon Redshift cluster at the time of polling, indicating storage consumption for that specific leader or compute node (%).
Read Throughput	The total amount of data read from disk by an individual node within the Amazon Redshift cluster during the polling interval, indicating read throughput for that specific leader or compute node (MB/s).
Write Throughput	The total amount of data written to disk by an individual node within the Amazon Redshift cluster between the polling interval (MB/s).
Inbound Network Throughput	The average rate at which an individual node within the Amazon Redshift cluster receives data over the network between the polling interval (MB/s).
Outbound Network Throughput	The average rate at which an individual node within the Amazon Redshift cluster sends data over the network during the polling interval, indicating outbound network throughput for that specific leader or compute node (MB/s).
NODE IOPS DETAILS
Node ID	The unique identifier of the node in the cluster.
Read IOPS	The average number of disk read operations performed by an individual node within the Amazon Redshift cluster per second between the polling interval (operations/s).
Write IOPS	The average number of disk write operations performed by an individual node within the Amazon Redshift cluster per second during the polling interval, indicating write IOPS for that specific leader or compute node (operations/s).
Read Latency	The average time taken by an individual node in seconds, within the Amazon Redshift cluster to complete disk read I/O operations between the polling interval (seconds).
Write Latency	The average time taken by an individual node in seconds, within the Amazon Redshift cluster to complete disk write I/O operations between the polling interval (seconds).

Note:Node Performance Details and Node IOPS Details metrics are disabled from data collection by default and mapped under performance polling as Node Metrics. To enable data collection, navigate to Settings → Performance Polling, select the Optimize Data Collection tab, choose Amazon Redshift as the monitor type and Node metrics as the metric name, then set the preferred time interval.

Configuration

Parameter	Description
CLUSTER CONFIGURATION
Node Type	The node type of nodes in the cluster.
Endpoint Address	The DNS address of the cluster.
Cluster Creation Time	The date and time that the cluster was created.
Availability Zone	The name of the Availability Zone in which the cluster is located.
NETWORK CONFIGURATION
VPC ID	The identifier of the VPC the cluster is in, if the cluster is in a VPC.
Publicly Accessible	Indicates whether the cluster can be accessed from a public network. Possible values: Yes, No.
Cluster Subnet Group	The name of the subnet group that is associated with the cluster. This parameter is valid only when the cluster is in a VPC.
Automated Snapshot Retention Period	The number of days that automatic cluster snapshots are retained.
DATABASE CONFIGURATION
Database Name	The name of the initial database created with the cluster; defaults to dev if not specified.
Master Username	The admin username for the cluster. This name is used to connect to the database specified in the DBName parameter.
Parameter Group	The name of the parameter group associated with the cluster.
Parameter Apply Status	Indicates whether parameter changes have been applied to the cluster.
Encryption	Indicates whether data in the cluster is encrypted at rest. Possible values: Enabled, Disabled.
Endpoint Port	The port that the database engine is listening on.
MAINTENANCE CONFIGURATION
Cluster Version	The version ID of the Amazon Redshift engine running on the cluster.
Allow Version Upgrade	Indicates if major version upgrades are applied automatically during maintenance windows.
Preferred Maintenance Window	The weekly UTC time range when system maintenance can occur.
Next Schedule	The number of days until the next scheduled maintenance window.

Amazon Redshift Monitoring

Overview

Creating a new Redshift monitor

Monitored Parameters

Performance Overview

Database Performance

Node

Configuration

Loved by customers all over the world

"Standout Tool With Extensive Monitoring Capabilities"

"I like Applications Manager because it helps us to detect issues present in our servers and SQL databases."

Carlos Rivero

Trusted by over 6000+ businesses globally