Skip to content
Commit fcc5767f authored by Volker Krause's avatar Volker Krause
Browse files

Improve HTML to text conversion

Specifically, this avoid generating spurious whitespaces and does line
breaks that more closely follow the expected layout in HTML.

This helps extractors that have to use the text representation due to the
corresponding extractors containing no meaningful DOM structure to work
with.
parent 93603367
Pipeline #274670 skipped
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment