A library for parsing (possibly malformed) HTML/XML documents http://hackage.haskell.org/package/tagsoup
Go to file
2024-01-05 22:56:02 +01:00
ghc-tagsoup.spec automatic version update by autodist [release 0.14.8-1mamba;Tue Nov 14 2023] 2024-01-05 22:56:02 +01:00
README.md update to 0.13.3 [release 0.13.3-1mamba;Mon Apr 13 2015] 2024-01-05 22:56:02 +01:00

ghc-tagsoup

TagSoup is a library for parsing and extracting information from (possibly malformed) HTML/XML documents. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.