User:Joseph Crowe/Easy HTML Parser

Easy HTML Parser is an open source Python library allowing the parsing, manipulation and serialization of HTML documents. It is designed to be simple and easy to use. It extends the HTMLParser class from Python's standard library. As of June 2011, it consists of approximately 500 lines of Python code.

Features

 * Parsing of a superset of HTML including broken markup, to produce a mutable tree of objects representing the document.
 * Selection of nodes in the document tree using filters.
 * Reproduction of HTML markup representing a modified document.