Details
-
Bug
-
Status: Resolved (View Workflow)
-
Major
-
Resolution: Fixed
-
Meridian-2015.1.0, 16.0.3
-
Security Level: Default (Default Security Scheme)
Description
When using the XML Collector to parse HTML Documents, the JSoup library is used to convert any HTML to a well formed XML document.
The problem is that when the document contains encoded characters like "Curaçao" for "CuraƧao", the JSoup document must be initialized on a special way in order to properly parse the data and avoid exceptions.
On a customer installation, this problem was generating a DatacollectionFailed on 17.0.0-SNAPSHOT.