What's new? | Help | Directory | Sign in
Google
                
Code License: MIT License
Labels: html, python, parser, tokenizer, liberalxml, ruby

A ruby/python based HTML parser/tokenizer based on the WHATWG HTML5 specification for maximum compatibility with major desktop web browsers.

0.11 Release Features

Known Issues (0.11)

Documentation

Using HTML5Lib

Getting help/getting involved