Beautiful Soup is a self-contained parser that makes screen-scraping easy. It parses both good and bad HTML and XML and offers methods for traversing the parse tree and extracting specific parts of a document.
beautifulsoup Web python xml parser html License:Python-License