Schedule demo

Elasticsearch Monitoring


Elasticsearch - An Overview

Elasticsearch is a highly scalable, distributed, open source RESTful search and analytics engine. It is multitenant-capable with an HTTP web interface and schema-free JSON documents. Based on Apache Lucene, Elasticsearch is one of the most popular enterprise search engines today and is capable of solving a growing number of use cases like log analytics, real-time application monitoring, and click stream analytics.

Monitoring Elasticsearch - What we do

Let’s take a look at what you need to see to monitor Elasticsearch, the performance metrics to gather and how you can ensure that your search server is up and operating as expected with Applications Manager's Elasticsearch monitoring:

  • Resource Utilization Details - Applications Manager automatically discovers Elasticsearch servers, monitors memory and CPU and notifies you of changes in resource consumption of thread pool queues.
  • Real-Time Data - You get up-to-the-second insight into cluster runtime metrics, individual cluster nodes, real-time threads and configurations.
  • Cluster and Node Monitoring - Stay on top of your cluster and node health in real-time with fine-grained statistics of performance from Disk I/O Java to Memory usage metrics.
  • Search and Indexing Performance - Gain complete control of your indexes and mappings. Monitor query latency, file system cache usage and request rates and take action if it surpasses a threshold.
  • Fix Performance Problems Faster - Get instant notifications when there are performance issues. Become aware of performance bottlenecks and take quick remedial actions before your end users experience issues.

Creating a new Elasticsearch monitor

Using the REST API to add a new Elasticsearch monitor:Click here

To create an Elasticsearch Monitor, follow the steps given below:

  • Click on New Monitor link. Choose ElasticsearchCluster.
  • Specify the Display Name of the Elasticsearch monitor.
  • Enter the HostName or IP Address of the host where Elasticsearch Cluster runs.
  • Enter the Port of the Elasticsearch Cluster. By default, it will be 9200.
  • Enter the polling interval time in minutes.
  • Click Test Credentials button, if you want to test the access to Elasticsearch server.
  • Choose the Monitor Group from the combo box with which you want to associate Elasticsearch Monitor (optional). You can choose multiple groups to associate your monitor.
  • Click Add Monitor(s). This discovers Elasticsearch from the network and starts monitoring.
Note:
  • Security/Firewall Requirements - The Elastic Search Cluster host and port should be accessible from the machine where Applications Manager is installed.
  • User Privilege - The required user credentials should be provided.

Monitored Parameters

Go to the Monitors Category View by clicking the Monitors tab. Click on the Elasticsearch and ElasticsearchCluster monitors under the Web Server/Services Table. Displayed is the Elasticsearch or the ElasticsearchCluster bulk configuration view distributed into three tabs:

  • Availability tab displays the Availability history for the past 24 hours or 30 days.
  • Performance tab displays the Health Status and events for the past 24 hours or 30 days.
  • List view enables you to perform bulk admin configurations.

Click on the monitor name to see all the server details listed under the following tabs:

Elasticsearch Cluster

Overview

ParameterDescription
Node Details
Node NameThe name of the node
Node TypeThe type of the node (Client or Data or Master-Eligible or Master-Data)
Avg Query TimeThe first phase of search operation is Query. The time taken to process the query in all shards
Avg Fetch TimeThe second phase of search operation is Fetch. The time taken to retrieve the query result, only from the shards which have the requested data.
CLUSTER OVERVIEW
Cluster StatusThe status of the cluster depending on the replicas of the cluster.
Total NodesThe total number of nodes in the cluster.
Total IndicesThe total number of indices in the cluster.
Total ShardsThe total number of shards in the cluster.
Total DocsThe total number of documents present in the cluster.

Cluster Details

ParameterDescription
NODES SPLITUP
Client NodeThe total number of Client Nodes in the cluster.
Data NodeThe total number of Data Nodes in the cluster.
Master NodeThe total number of Master Eligible Nodes in the cluster.
Data-Master NodeThe total number of Data Nodes, which also acts as Master Eligible Nodes in the cluster.
SHARDS COUNT
Active ShardsThe number of Active Shards present in the cluster.
Active Primary ShardsThe number of Primary Shards that are Active in the cluster.
Relocating ShardsThe number of Relocating Shards present in the cluster.
Initializing ShardsThe number of Initializing Shards present in the cluster.
Unassigned ShardsThe number of Unassigned Shards present in the cluster.
Delayed Unassigned ShardsThe number of Delayed Unassigned Shards present in the cluster.
Total ShardsThe number of Shards present in the cluster.
Top 20 Pending Tasks by Priority
Insert OrderThe order of the task in which the pending task is inserted into the queue.
PriorityThe priority assigned for the particular task.
SourceThe source for the pending task.
Wait Time by PriorityThe total waiting time of the pending task in that queue based on priority (in milliseconds).
Top 20 Pending Tasks by Wait Time
Insert OrderThe order of the task in which the pending task is inserted into the queue.
Priority by Wait TimeThe priority assigned for the particular task based on Wait Time.
SourceThe source for the pending task
Wait TimeThe total waiting time of the pending task in that queue (in milliseconds).

Indices

PARAMETERDESCRIPTION
Indices Overview
Index NameThe name of the index representing a collection of documents.
DocumentsIndicates the number of documents that are available in the particular index.
Indexing LatencyAmount of time taken to index a document in the particular index (in millisecond).
Indexing RateThe number of documents that are indexed per second.
Query LatencyAmount of time taken to process the query in the particular index (in millisecond).
Query RateThe number of queries that are processed by the index per second.
Fetch LatencyAmount of time taken to run the query and retrieve the data in the particular index (in millisecond).
Fetch RateThe number of queries that are run and retrieved data by the index per second.
Current MergesIndicates the number of merges that have occurred in the particular index.
Merge TimeAmount of time taken to merge segments in the particular index (in millisecond).
Flush TimeAmount of time taken to flush one or more indices to disk (in millisecond).
Refresh TimeAmount of time taken to refresh an index (in millisecond).

Configuration

PARAMETERDESCRIPTION
CONFIGURATION DETAILS
Cluster NameThe name of the cluster.
Total NodesThe total number of nodes in the cluster.
Master Node NameThe name of the Master Node in the cluster.
Master Node PortThe port on which the Master node of Elasticsearch runs.
Master Node IPThe IP address in which the Master Node runs.
Publish PortThe publish port of the cluster.

Elasticsearch

Overview

PARAMETERDESCRIPTION
AVERAGE SYSTEM LOAD
Avg. System LoadThe average value of the amount of load that is being processed by the system (in the last 1 minute, 5 minutes, and 15 minutes).
CPU UTILIZATION
CPU UtilizationAmount of CPU currently being utilized by the node (in %). 
SEARCH TIME
Average Query TimeThe first phase of search operation is Query. The time taken to process the query in all shards
Average Fetch TimeThe second phase of search operation is Fetch. The time taken to retrieve the query result, only from the shards which have the requested data.
SEGMENT TIME
Average Merge TimeThe average time taken for segment merging in a node. (A shard in elasticsearch is a Lucene index, broken down into segments. Segments are, periodically, merged into larger segments to keep the index size at bay and expunge deletes.)
Average Refresh TimeThe average time spent in refreshing an index. (Refresh time increases with the number of file operations for the Lucene index).
INDEXING TIME
Average Index TimeThe average time taken to index a document. (Documents are indexed i.e stored and made searchable.)
Average Delete TimeThe average time taken to delete an existing index.
Indexed CountThe number of documents indexed.
Deleted CountThe number of deleted documents.
Indexing RateThe number of documents that are indexed per second.
GET TIME
Average Get TimeThe average time taken to retrieve information about one or more indexes
Existing CountThe number of get requests that were present.
Missing CountThe number of get requests that were missing.
FLUSH TIME
Average Flush TimeThe average time taken to flush one or more indices to disk. (The flush process of an index basically frees memory from the index by flushing data to the index storage and clearing the internal transaction log.)
WARMER TIME
Average Warmer TimeThe average time taken to perform a warmup search on an index. (Index warming allows to run registered search requests to warm up the index before it is available for search.)
PERCOLATE TIME
Average Percolate TimeThe average time spent running percolator queries. (One of Elasticsearch's core feature is the ability to do search in reverse with the percolator. The percolator automatically indexes the query terms with the percolator queries. This allows the percolator to percolate documents more quickly.)

Memory Details

The total space used in the Direct Buffer pool.

PARAMETERDESCRIPTION
HEAP MEMORY
Used Heap PercentThe percentage of JVM heap currently in use.
Free Heap PercentThe percentage of JVM heap currently free
NON-HEAP MEMORY
Used Non-Heap PercentThe percentage of non-heap memory currently in use.
Free Non-Heap PercentThe percentage of non-heap memory currently free.
GARBAGE COLLECTION
GC Time - YoungThe total time spent on young-generation garbage collections.
GC Time - OldThe total time spent on old-generation garbage collections.
GC Count - YoungThe total number of young-generation garbage collections.
GC Count - OldThe total number of old-generation garbage collections.
BUFFER POOLS
Direct Buffer Space UsedThe total space used in the Direct Buffer pool.
Mapped Buffer Space UsedThe total space used in the Mapped Buffer pool.
Direct Buffer Connection CountThe total connections to Direct Buffer pool.
Mapped Buffer Connection CountThe total connections to Mapped Buffer pool.

I/O Details

PARAMETERDESCRIPTION
DISK I/O COUNT
Disk Read CountThe number of read ( from the disk) requests by Elasticsearch.
Disk Write CountThe number of write ( to the disk) requests by Elasticsearch.
DISK I/O SIZE
Disk Read SizeThe total size of read requests ( from the disk) by Elasticsearch.
Disk Write SizeThe total size of write requests ( to the disk) by Elasticsearch.
CACHE DETAILS
Cache NameThe name of the cache.
Total Size (MB)The size of the cache.
EvictionsThe number of evictions from the filter cache.
BREAKER DETAILS
Breaker NameThe name of the Circuit Breaker. (Circuit breakers are designed to deal with situations when request processing needs more memory than available. This would mean OOM (OutOfMemoryException). So sometimes it is better to fail a query instead of getting OOM, because when OOM appears JVM becomes not responsive.)
Limit Size (MB)The limit size of the particular Breaker.
Used Size (MB)The used size of the particular Breaker.
TrippedThe total number of times the breaker circuit tripped.

Thread Pools

PARAMETERDESCRIPTION
THREAD DETAILS
Thread NameThe name of the thread.
Configured ThreadsThe number of threads of current configured type.
QueueThe number of thread of current type in queue.
ActiveThe number of active threads of current type.
RejectedThe number of rejected threads of current type.
LargestThe number of largest threads of current type.

Network

PARAMETERDESCRIPTION
TRANSPORT
Transmitted BytesThe number of bytes sent by the network. (Transport metrics about cluster communication)
Received BytesThe number of bytes received by the network. (Transport metrics about cluster communication)
Transmitted PacketsThe number of data packets sent by the network. (Transport metrics about cluster communication)
Received PacketsThe number of data packets received by the network. (Transport metrics about cluster communication)
TCP CONNECTOR
Active ConnectionsThe number of active TCP connections.
Passive ConnectionsThe number of passive TCP connections.
HTTP CONNECTOR
Current ConnectionsThe number of http connections currently active.
Total ConnectionsThe total number of http connections.

Configuration

PARAMETERDESCRIPTION
CONFIGURATION DETAILS
Cluster NameThe name of the cluster.
Node NameThe name of the node in the cluster.
Node TypeThe type of the node (Client/Data/Master-Eligible/Data-Master).
HostThe IP address of the Host.
ElasticSearch VersionThe version of the installed Elasticsearch.
PortThe port in which Elasticsearch runs.
ElasticSearch HomeThe home directory of Elasticsearch.
Total ProcessorsThe total number of processors in the current node
Java VersionThe version of Java running in the node.
Java VendorThe Java vendor.

Loved by customers all over the world

"Standout Tool With Extensive Monitoring Capabilities"

It allows us to track crucial metrics such as response times, resource utilization, error rates, and transaction performance. The real-time monitoring alerts promptly notify us of any issues or anomalies, enabling us to take immediate action.

Reviewer Role: Research and Development

carlos-rivero
"I like Applications Manager because it helps us to detect issues present in our servers and SQL databases."
Carlos Rivero

Tech Support Manager, Lexmark

Trusted by over 6000+ businesses globally