| |
ID |
Type |
Status |
Priority |
Milestone |
Owner |
|
Summary + Labels |
Port |
... |
| |
25 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Lack of trailing empty line generates a parse error
|
----
|
|
| |
30 |
Defect
|
Accepted
|
Medium
|
----
|
jgraham.cantab
|
|
Missing codecs in python 2.3
|
----
|
|
| |
35 |
Defect
|
Accepted
|
Medium
|
----
|
jgraham.html
|
|
Poor performace parsing numeric entities
|
Ruby
|
|
| |
47 |
Defect
|
New
|
Medium
|
----
|
----
|
|
[PATCH] Sanitizer passes uppercase tags through untouched
|
----
|
|
| |
52 |
Defect
|
New
|
Medium
|
----
|
----
|
|
thin and thick not in CSS whitelist
|
----
|
|
| |
55 |
Defect
|
Accepted
|
High
|
----
|
jgraham.html
|
|
trunk python UnicodeError: UTF-16 stream does not start with BOM
|
----
|
|
| |
57 |
Defect
|
New
|
Medium
|
----
|
ryansking
|
|
/trunk/testdata external set as https
|
----
|
|
| |
59 |
Defect
|
New
|
Medium
|
----
|
----
|
|
maximum recursion depth exceeded in tree traversal (python)
|
Python
|
|
| |
61 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Integration of CSS Parser
|
----
|
|
| |
62 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Sanitizer does not allow stripping of tags
|
----
|
|
| |
63 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Problems with UTF8
|
Ruby
|
|
| |
66 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Check for valid utf-8 in inputstream.rb gives false negatives when $KCODE is set to "UTF8" [w/fix]
|
Ruby
|
|
| |
68 |
Defect
|
New
|
Medium
|
----
|
----
|
|
chardet is no longer maintained but rchardet
|
Ruby
|
|
| |
69 |
Enhancement
|
Accepted
|
Medium
|
----
|
----
|
|
charsUntil is slow (Python)
|
Python
|
|
| |
73 |
Defect
|
New
|
Medium
|
----
|
----
|
|
problem with reading from stdin
|
Python
|
|
| |
74 |
Defect
|
Accepted
|
Medium
|
----
|
jgraham.html
|
|
AttributeError: 'module' object has no attribute 'isValidEncoding'
|
----
|
|
| |
75 |
Defect
|
New
|
Medium
|
----
|
----
|
|
[PATCH] Filters should pass contentModelFlag changes to source
|
----
|
|
| |
76 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Validator complains about type and global attrs on input tags.
|
----
|
|
| |
77 |
Defect
|
Accepted
|
Medium
|
----
|
ryansking
|
|
Only first instance of white space is stripped
|
Ruby
|
|
| |
79 |
Defect
|
New
|
Medium
|
----
|
----
|
|
getElementById doesn't work with minidom
|
----
|
|
| |
80 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
TypeError when serializing some pages to BeautifulSoup
|
----
|
|
| |
81 |
Defect
|
New
|
High
|
----
|
----
|
|
Verision info
|
Python
|
|
| |
82 |
----
|
New
|
Critical
|
----
|
----
|
|
Zip archive is messed up
|
Python
|
|
| |
86 |
----
|
New
|
----
|
----
|
----
|
|
BeautifulSoup treebuilder string attribute is missing
|
----
|
|
| |
87 |
Defect
|
Accepted
|
High
|
----
|
----
|
|
<isindex action prompt> not supported
|
----
|
|
| |
88 |
Defect
|
Accepted
|
Critical
|
----
|
jgraham.html
|
|
Reading from stdin broken
|
Python
|
|
| |
89 |
----
|
New
|
----
|
----
|
----
|
|
Installation using setup.py fails under Windows
|
----
|
|
| |
90 |
----
|
New
|
----
|
----
|
----
|
|
All files doubled in archive html5lib-0.11.1.zip
|
----
|
|
| |
92 |
----
|
New
|
----
|
----
|
----
|
|
Possible to make IE run script after roundtripping in html5lib
|
----
|
|
| |
93 |
----
|
New
|
----
|
----
|
----
|
|
Quote attributes containing weird whitespace or '<'
|
----
|
|
| |
95 |
Task
|
Accepted
|
Medium
|
----
|
jgraham.html
|
|
Implement scripting-disabled case
|
----
|
|
| |
96 |
----
|
New
|
----
|
----
|
----
|
|
a better intToUnicodeStr
|
----
|
|
| |
98 |
----
|
New
|
----
|
----
|
----
|
|
Encoding issue: 'ascii' codec instead of appropriate one.
|
----
|
|
| |
103 |
----
|
New
|
----
|
----
|
----
|
|
Can't easy_install/pip install html5lib==dev
|
----
|
|