OpManager -

The network, server and virtualization monitoring software

(Download PDF)

Network Monitoring – The Must Haves

 

The definition of Network monitoring no longer confines to just setting up a solution that is capable of garnering information from the network. With the corporate world looking at IT as that critical function having a say on day–to–day business, the choice of a network monitoring solution must factor–in the business aspects such as increased employee productivity and saving on infrastructure costs, besides the obvious need of the solution having to aide the business with all the ‘must–have’ functionality either built–in or by way of facilitating useful plug–ins. No two networks are same and so it is only fair to expect a network monitoring solution to work on a ‘one size fits all’ principle in the not–so–perfect IT world.

When we talk about IT directly impacting the business, it could be anything from a seemingly simple problem like a web page taking eternity to load or a poor LAN connectivity, to more serious ones like an important email from a prospect not making it to your inbox, a CRM database crash, or even dealing with a mischief–maker within. While these issues can be addressed by a variety of vertical solutions in the market, there is nothing like the convenience of a single–point access to visualize the entire network to manage the fault, performance, configuration, and security or the other resources within. With a wide range of IT functionality fitting into the ‘network monitoring’ umbrella, it is important to look for the ‘must–haves’ to help align your IT with the business goals. It serves to keep in mind that to an administrator, network performance, security, fault management, and reliability are not mutually exclusive. A solution that serves all of these on one platter, keeps the administrator and his network happy! Anything more is a welcome bonus!

The must–haves

Let us take a closer look at the main concerns of an administrator and what data he will mine for in the minutiae of information gathered by the solution :

Automatic Discovery

With the corporate networks getting more complex due to huge and distributed infrastructure, automatic discovery leaves little room for manual errors. Constant upgrades and additions to the network is nothing new and this calls for a provision to initiate a discovery on demand too. So, a solution must be capable of automating the discovery and it must also accommodate a forced discovery.

Smart Classification & Mapping

The infrastructure to be managed include network devices, servers, applications, and other resources on the network. Different parameters determine the health of performance of these resources on the network. Classification of the infrastructure based on the type, and provision to map or logically group devices like clustered environments or geographically distributed resources, empowers the administrator and helps him visualize his network and manage.

In-depth Performance Monitoring

Performance degradation is an administrator’s nightmare. Any network resource can pull down the performance of a network, and the factors affecting the performance could be internal or external. Faults such as a hardware resource outage, a WAN link failure, a database application crash etc., have a cascading effect and the impact is larger than we perceive. The key areas an administrator must keep an eye on to assure a network that is 100% available include, availability & uptime monitoring, system resources monitoring and bandwidth monitoring.

Security Management

A secured network is a good, healthy network. The challenges here are huge as the administrator cannot make the slightest compromise. Like any other aspect of network monitoring, ‘prevention is better than cure’ is the motto here too. While intrusion detection, intrusion prevention etc may not usually be a part of a network monitoring solution, support for plugging–in even a third–party utility augurs well with the network security administrators. The areas an administrator focuses on to secure his network starts at keeping a close tab on the system log messages including Windows Event Logs, Syslogs on Unix–based devices, Firewall logs etc.

Intelligent Alerting

Any good network monitoring solution must have an intelligent alerting mechanism using which the IT team can productively collaborate and work efficiently. As we discussed earlier, any component or a resource in a network can play truant and pull down the network. A delay in preventing a fault from occurring, or repairing a damage in a lesser turn–around time requires a fool–proof alerting mechanism where the concerned engineer gets to know the source of the problem by way of a meaningful alert.

Scalability of the Solution

All said and done, a network monitoring solution must not take a beating and crash or it must do so with a warning at the least! A server on which you host the monitoring solution, or the monitoring application itself is as susceptible as the other resources on the network. Having a redundant server take over and provide un–interrupted monitoring service is an administrator’s dream.

How ManageEngine OpManager fits in

ManageEngine OpManager has a single big advantage of either loosely or tightly integrating with the other applications in the ManageEngine Suite. Built on a robust platform, OpManager addresses all the key requirements an administrator or an IT Manager looks for in a network monitoring solution. Let us quickly look at how each of the above functionality is addressed in ManageEngine OpManager:

OpManager Discovery –View demo

OpManager provides various options as part of its automatic discovery feature. Further, the devices are automatically classified based on the category into Routers, Switches, Servers etc. OpManager relies on the standard SNMP ⁄ ICMP pings to perform deep discovery. The discovery options include the following:

IP Range–based Discovery

The option to discover by specifying a range is useful when there is a need to manage devices just within that range and proves less costly as opposed to scanning a complete network. When a new device is plugged–into the network, the network monitoring application must be able to only that specific device.

CIDR Discovery

CIDR is Class–less Inter-Domain Routing, an IP addressing scheme that replaces the system based on classes A,B, and C.

File–based Discovery

This option is useful when an administrator has to discover a random set of devices across networks. Discovery must be initiated for a list containing the host–names or the IP addresses of the devices to be managed.

On–demand Discovery

When a new device is plugged–into the network, the network monitoring application must be able discover only that specific device.

Infrastructure Maps and Business Views –View demo

The discovered devices are grouped by category for easy monitoring. Besides, OpManager allows you to create your own views to group the devices logically and manage them from one place. This option helps you manage the devices under each geographical location and assign authorized access to the business views.

Network Topology and Mapping

Monitoring almost every resource on the network –View demo

OpManager supports industry–standard monitoring protocols that include SNMP, WMI, Telnet, and SSH, besides allowing custom scripting. It monitors almost every network resource and reports the performance.

Availability & Uptime monitoring

A quick, safe option to determine the performance of a network and its resources is to simply check its availability. Something as simple as a ping determines if a device is up or down. Checking for availability is often the start point from where an administrator can drill down to the root cause. Proactive monitoring for availability and uptime ensures that you have a plan B in place if the server or link cannot be made functional in a given time.

System Resources monitoring

Any device is rendered unavailable if the hardware shows poor performance. An overloaded server may have its CPU and Memory eaten up by the applications hosted on it and make the server useless. By watching the performance of the resources, the system administrators can effect capacity planning, distribute the resources, or upgrade the resources much before a problem is encountered.

Bandwidth monitoring

Any business relies heavily on the WAN, which is an indispensable and costly side of an enterprise network. A network administrator must be wary of the impending outages and equip accordingly so that the business is not affected. Keeping a tab on the WAN link latency, bandwidth utilization, the round–trip–time (RTT), etc enables the administrator to avoid serious outages, and also aids in capacity planning.

Network Performance Monitoring

Securing your network with OpManager –View demo

With OpManager, you will be the first to know of any security threat to your network. Be it an unauthorized user activity, or an outside intrusion, OpManager catches it all.

The network security administrators are left to face many threats to the network security from the external world and also from within. The Firewall security events such as intrusion detection, virus attacks, denial of service attack, etc., anomalous behaviors, employee web activities etc, provide a wealth of information on potential threats. Even a small compromise can prove costly to the business. Ability to visualize enterprise security and detect security compromises is a essential component of a network monitoring solution.

Syslogs

Log monitoring as a means of ensuring security, is incomplete without monitoring the syslog, the events of Unix-based systems such as Linux, Solaris, HP–UX, IBM AIX, and other devices supporting syslog like routers, switches etc. Seasoned administrators monitor Syslogs to keep a tab on unauthorized authentications, user activity, network connections, critical system or application failures, et all.

Firewall logs

The network security administrators are left to face many threats to the network security from the external world and also from within. The Firewall security events such as intrusion detection, virus attacks, denial of service attack, etc., anomalous behaviors, employee web activities etc, provide a wealth of information on potential threats. Even a small compromise can prove costly to the business. Ability to visualize enterprise security and detect security compromises is a essential component of a network monitoring solution.

Netflow

While it is ideal to take the required measures to prevent an attack, a means to get the acts together even when a damage is done is needed to ensure there are no further attacks. It is no secret that perpetrators of malicious attacks find new means to gain entry into corporate networks, making them the biggest targets. Irrespective of the size of an enterprise, such attacks are unaffordable in terms of financial implications and data compromises. Detecting the onslaught of any virus attack or worm attack and getting to the root of the problem becomes necessary. The network administrator must be able to ‘watch’ for the spikes and predict the trend. This paves way to get to the source from which an attack originates and nail down the problem.

Alerting Mechanism –View demo

OpManager’s alerting is characterized by its real–time alerting capabilities in addition to maintaining a alert history. OpManager pro–actively checks for faults in a network by querying the devices periodically, listens for traps from devices, and also processes the system logs to generate corresponding OpManager alerts. Threshold–based alerting aids in quicker resolution time.

Network Fault Management

Alerting – Pull and Push modes

A network monitoring system must be able to proactively seek out faults by talking to the network resources (the pull mode) and must also be capable of receiving or listening to fault messages from the resources (the push mode). Receiving and processing SNMP traps, converting the log messages on various monitoring systems into application alerts etc, facilitates fault management from a single console.

Thresholds–based Alerting

Setting thresholds for the various monitored resources enables proactive alerting. The important parameters where thresholds help are, response time (very useful when you have customer SLAs), resource utilization, transfer rate, connection counts etc. A provision to re–arm an alert makes the threshold functionality complete.

Notifying Alerts

The purpose of alerting is to get the administrator or the IT staff attend to the problem as early as possible and prevent huge downtimes. So the alerts need to make it to the concerned engineer’s inbox or mobile besides other mechanisms like a pop–up on his screen etc. Some of the quick alerting mechanisms the administrators or other network engineers prefer include Email alerts, SMS alerts, a provision to plug–in some self–healing scripts, or even log a trouble–ticket with a helpdesk system based on the criticality or priority of the alert.

Scalable, Robust Application Architecture

Failover or redundancy support is necessary to achieve uninterrupted service. Implementing a redundancy system helps you to overcome failures such as a database crash or a loss in network connectivity. OpManager is highly available with complete support for redundancy. You can set up MySQL or MSSQL replication which ensures that the network monitoring task is un–interrupted. The failover to a secondary server in the event of the primary server failing, is seamless, and the users do not experience any downtime.

Failover for high availability

Summary

ManageEngine OpManager is a feature–rich network monitoring solution which is further powered by its capability to seamlessly integrate with the other offerings in the ManageEngine Suite as add-ons or plug–ins. It integrates with ServiceDesk Plus for trouble ticketing, with NetFlow Analyzer for detailed traffic analysis, with DeviceExpert for hassle–free configuration management, and with Firewall Analyzer for tight security management. A single point, intuitive console to manage and troubleshoot the network problems, reduces the hassles of having to deal with mutliple vendors and several standard and proprietory protocols. In short, just one–neck–to–hang when in trouble. Give it a try if you have not already and stay home celebrating a happy new year!