Increase robustness against different HTML to text conversions
This is a preparational step for improvements to the layout of texts produced from HTML input and makes those extractor scripts rely less on specific whitespace sequences.
Please register or sign in to comment