Uploaded image for project: 'OpenNMS'
  1. OpenNMS
  2. NMS-7963

JSoup doesn't properly parse encoded HTML character which confuses the XML Collector

    Details

      Description

      When using the XML Collector to parse HTML Documents, the JSoup library is used to convert any HTML to a well formed XML document.

      The problem is that when the document contains encoded characters like "Curaçao" for "CuraƧao", the JSoup document must be initialized on a special way in order to properly parse the data and avoid exceptions.

      On a customer installation, this problem was generating a DatacollectionFailed on 17.0.0-SNAPSHOT.

        Attachments

          Activity

            People

            • Assignee:
              agalue Alejandro Galue
              Reporter:
              agalue Alejandro Galue
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: