The HTML language module

Since: 6.45.0

Minimum Java Runtime: Java 8

The HTML language module uses jsoup for parsing.

XPath rules are supported, but the DOM is not a typical XML/XPath DOM. E.g. text nodes are normal nodes. This might change in the future.