ghc-tagsoup/README.md

5 lines
321 B
Markdown
Raw Normal View History

2024-01-05 22:55:59 +01:00
# ghc-tagsoup
TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.