Schedule demo

Apache Cassandra Monitoring


Overview

Applications Manager provides out-of-the-box performance metrics and helps you visualize the health and availability of your Apache Cassandra NoSQL database.

Creating a new Cassandra DB monitor

Supported Versions: Apache Cassandra 2.0 and above.

Prerequisite:Click here to know about the necessary prerequisites to monitor Apache Cassandra database.

Using the REST API to add a new Cassandra monitor:Click here

To create a Cassandra database server monitor, follow the steps given below:

  1. Click on New Monitor link.
  2. Select Cassandra.
  3. Specify the Display Name of the Cassandra monitor
  4. Enter the HostName or IP Address of the host where Cassandra server runs
  5. Enter the JMX Port where the server is running. By default, it will be 7199 or else check in cassandra-env.sh file for the JMX_PORT.
  6. If you want to discover only this node, disable the option Discover all nodes in the Cluster. By default, it is enabled which means all the nodes in the cluster are discovered by default
  7. Check the Is Authentication Required field, to give the JMX credentials to be used to connect to the Cassandra server. Now enter the UserName and Password of the credential
  8. Specify the Polling Interval in minutes
  9. If you are adding a new monitor from the Central Server, select a Probe Server.
  10. Choose the Monitor Group with which you want to associate the Cassandra server to, from the combo box (optional). You can choose multiple groups to associate your monitor
  11. Click Add Monitor(s). This discovers the Cassandra server from the network and starts monitoring it

Note:
In case you are unable to add the monitor even after enabling JMX, try providing the below argument:
 -Djava.rmi.server.hostname=[YOUR_IP]

Monitored Parameters

Click on the individual monitors listed to view the following information in the below tabs:

Monitor Information

This tab displays the basic information like the name of the server, host name and the JMX port where the server is running. Additional parameters like Token, Gossip Active, Load, Generation No, Uptime, DataCenter, Rack Name are also displayed. These are the equivalent of the Nodetool result got using the command 'nodetool.bat/sh -h -p info'

Performance

ParametersDescription
Availability and Performance HistoryDisplays the availability and performance history for the last six hours.
CPU Usage (available till version 2.1.13)Specifies the usage of CPU in percentage.
Memory UtilizationSpecifies the used, free and total memory of the server in mega bytes
Exceptions (available till version 2.1.13)Count of various exceptions such as not found, invalid, unavailable, timed out, etc.
Storage StatsSpecifies the storage load in GB.
Operation Stats (available till version 2.1.13)Number of range, read and write operations per second since executor start.
Operation Latency (available till version 2.1.13)The latency of range, read and write operations since the last time the attribute was read.
Bloom Filter Statistics(available from Version 2.0.9 onwards)
Bloom Filter False PositivesNumber of false positives (Occurs when the bloom filter says a non-existential row exists)
Bloom Filter False Positive RatioFraction of all bloom filter checks resulting in a false positive
Bloom Filter Disk Space UsedThe size of the bloom filter files on disk
Memtable Statistics(available from Version 2.0.9 onwards)
Memtable Columns CountTotal number of columns present in memtable
Memtable Data SizeAmount of data stored in memtable, including column-related overhead
Memtable Switch CountNumber of times flushing results in the memtable being switched out
Client Details
Connected Native ClientsNumber of Native Clients currently connected to the server instance
Connected Thrift ClientsNumber of Thrift Clients currently connected to the server instance
Average Operational Latency (available till version 2.1.13)
Average Range LatencyThe average latency per range operation since the last poll
Average Read LatencyThe average latency per read operation since the last poll
Average Write LatencyThe average latency per write operation since the last poll
Operational Timeouts(available till version 2.1.13)
Recent TimeoutsNumber of read, write and range slice operation timeouts since the last poll

Pending Tasks

ParametersDescription
COMMITLOG/COMPACTION - PENDING TASKS(available till version 2.1.13)
Commitlog - Pending TasksNumber of Commitlog tasks waiting in the queue to be executed.
Compaction - Pending TasksNumber of Compaction tasks waiting in the queue to be executed.
COMMANDS/RESPONSES - PENDING TASKS(available till version 2.1.13)
Commands - Pending TasksNumber of Command tasks waiting in the queue to be executed.
Responses - Pending TasksNumber of Response tasks waiting in the queue to be executed.
Task Statistics of Thread Pools (available till version 2.1.13)Status of various stages of thread pools like active and pending tasks, completed and blocked tasks are being displayed. These are equivalent of the Nodetool result got using the command 'nodetool.bat/sh -h <hostname> -p <portnumber> tpstats'.

Cluster Stats

ParametersDescription
Cluster DetailsSpecifies details on live, leaving, moving, joining and unreachable nodes.
Cluster Node DetailsHealth status and storage load (in GB) of the nodes in the cluster.

Messages

This tab contains the Dropped Message and Message Timeout statistics:

ParameterDescription
Recent Total TimeoutsNumber of timeouts since last poll

Configurations

This tab contains the configuration values of files that are part of cassandra.yaml config file.

Loved by customers all over the world

"Standout Tool With Extensive Monitoring Capabilities"

It allows us to track crucial metrics such as response times, resource utilization, error rates, and transaction performance. The real-time monitoring alerts promptly notify us of any issues or anomalies, enabling us to take immediate action.

Reviewer Role: Research and Development

carlos-rivero
"I like Applications Manager because it helps us to detect issues present in our servers and SQL databases."
Carlos Rivero

Tech Support Manager, Lexmark

Trusted by over 6000+ businesses globally