Uploaded image for project: 'OpenNMS'
  1. OpenNMS
  2. NMS-6748

Having hundreds of scheduled outage hurts the performance of the OpenNMS server

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.10.14, 1.12.8, 1.13.3
    • Fix Version/s: 1.12.9, 1.13.4
    • Component/s: Architecture
    • Security Level: Default (Default Security Scheme)
    • Labels:

      Description

      A customer manages the scheduled outages from a third party server using the ReST API in order to create and maintain the maintenance windows for the monitored machines.

      There are about 1500 different definitions on poll-outages.xml.

      Doing a Java Flight Recorder on that server, I figured out that the most popular methods on that installation are related with checking scheduled outages.

      The OpenNMS server is polling more than 25000 services and collecting data from around 7000 devices. Also, the configuration packages for Pollerd, Collectd and Threshold have all those 1500 scheduled outages associated.

      The consequence of this is that the CPU of the OpenNMS server is above 70% all the time (considering that the CPU count according with /proc/cpuinfo is 32, OpenNMS has 16GB of Heap Size and the server has 96GB of RAM).

        Attachments

          Activity

            People

            • Assignee:
              agalue Alejandro Galue
              Reporter:
              agalue Alejandro Galue
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: