Uploaded image for project: 'OpenNMS'
  1. OpenNMS
  2. NMS-10254

Visualize Minions/Locations status on the WebUI

    XMLWordPrintable

    Details

    • Type: Enhancement
    • Status: Open (View Workflow)
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 19.1.0, 20.1.0, 21.1.0, 22.0.1, Meridian-2018.1.0
    • Fix Version/s: 24.1.3
    • Component/s: Minion, Web UI - General
    • Security Level: Default (Default Security Scheme)
    • Labels:
    • Sprint:
      Horizon - July 25th 2018, Horizon - August 1st 2018, Horizon - August 8th 2018, Horizon - August 15th 2018, Horizon - August 22nd 2018, Horizon - August 29th 2018, Horizon - September 05 2018

      Description

      This might be an epic that could be divided into multiple issues/tasks, but I'll leave that decision to the developers.

      Currently, the only way to know if a Minion is working or not is by having the Minions requisitions and auto-provision enabled, so the Minion-Heartbeat service do its job.

      That being said, I believe that functionality is optional, and we need a more reliable way to display the status of all the minions easily.

      The administration page for Minions is supposed to have that functionality, as there is a "status" column, but unfortunately, that is not maintained. The ReST API for Minions uses the DAO to retrieve the data from the database, but the status column is NOT populated on the monitoringsystems table.

      Having this implemented and provide a filter to quickly show the unreachable Minions can be very useful (think of it like the outages page but for Minions).

      Besides that, we need something like that for Locations, as we might have more than one Minion per location, but if we lost all the Minions on a given location, OpenNMS won't be able to monitor the nodes on that Location, and the user might have a false sense of the reality for those nodes.

      Similar to a problem we've faced in the past with Scheduled Outages, it would be extremely beneficial to have a banner shown on the Node Page that says 2 things:

      1) A "warning" label when the Location is "compromised" (one or more Minions on a Location are unreachable, but at least one is reachable).

      2) A "critical" label when the Location is "down" (all the Minions on that Location are unreachable, meaning that the node cannot be monitored).

      That way, if there is a problem with the Minions and/or the Location, and the node doesn't reflect its actual state (or the graphs are empty), the operator can easily associate that with a Minion related issue.

        Attachments

          Activity

            People

            • Assignee:
              ranger Benjamin Reed
              Reporter:
              agalue Alejandro Galue
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: