Jsoup 1.2.3 processes HTML 5

Aug 04, 2010

Jsoup, a free Java library for processing HTML, is available in version 1.2.3 with enhanced HTML 5 support.

Jsoup, a free Java library for processing HTML, is available in version 1.2.3 with enhanced HTML 5 support.

As the parser has always implicitly supported HTML 5 tags, it now knows element definitions of the new standards. The tool can also generate an HTML-5-standards compliant page parse tree for further processing.

The second important innovation in Jsoup automatically detects the character set of a scanned document and decodes the input before parsing. There are also new selectors as well as small fixes and improvements.

Jsoup runs on Java version 1.5 and is under MIT / X license. On the Jsoup homepage there are Jar files for download and instructions in the Cookbook-style and the API reference.

Related content

comments powered by Disqus

Issue 272/2023

Buy this issue as a PDF

Digital Issue: Price $12.99
(incl. VAT)

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

News