| Commit message (Collapse) | Author | Age | Files | Lines |
... | |
| |
| |
| |
| |
| | |
Compare token tag names where we can get away with it, since that's
faster.
|
| |
| |
| |
| | |
getTagType already checks for it
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
Many bugfixes:
* getTagType now always returns TAG_UNKNOWN for non-HTML namespaced
elements
* Fix doctype public identifiers being compared case-sensitively
* Fix adoption agency algorithm iteration count (again :D)
* Add <font color> etc. to foreign content accepted element list
* Fix pushInTemplate parser option
* Fix SVG/MathML tags being used in table scope
* Use table scope where appropriate (IN_TABLE)
* minidom: fix parseHTMLFragment for non-HTML namespaces
* minidom: fix UTF-8 validator/converter/whatever
* Also, fix some test case parsing bugs/omissions (so they actually
run :P)
* Update readme
|
| |
| |
| |
| |
| |
| |
| | |
* Fix CDATA section bracket state bug
* Fix peekStr bug
* Simplify peekStrNoCase
* Replace toUpperAscii calls with toLowerAscii
|
| |
| |
| |
| |
| |
| |
| | |
* Implement template stuff in minidom
* Foreign content fixes
* Fix </tbody> in "in row" switching to "in body" instead of
"in table body"
|
| | |
|
| |
| |
| |
| |
| |
| | |
Now they are functions.
Also, slightly reduce the number of nested templates.
|
| | |
|
| | |
|
| |
| |
| |
| | |
aesthetics
|
| | |
|
| |
| |
| |
| | |
Now it runs without errors.
|
| | |
|
| |
| |
| |
| | |
WIP
|
| |
| |
| |
| | |
Now it is done outside of the main loop.
|
| | |
|
| |
| |
| |
| | |
doesn't work otherwise
|
| |
| |
| |
| | |
not a mandatory dependency anymore
|
| |
| |
| |
| | |
Needed for tree tests, but gitignore blocked it.
|
| |
| |
| |
| | |
this needs isNone, not isSome
|
| |
| |
| |
| |
| |
| |
| | |
This removes Chakasu as a hard dependency.
Now users of this library must either implement encoding support
themselves, or use minidom_cs (which still depends on Chakasu).
|
| |
| |
| |
| |
| |
| |
| | |
This replaces DOMBuilder.document with a getter function, mainly
for consistency and flexibility. (Also, it removes the need to
convert back DOMBuilder.document into a document node after parsing
has finished.)
|
| | |
|
| |
| |
| |
| |
| | |
These were made disregarding canReinterpret and would crash any
parseHTML call with a stream that cannot be re-interpreted.
|
| | |
|
| |
| |
| |
| | |
had to be fixed too
|
| | |
|
| |
| |
| |
| |
| | |
Passing `nil' there was an unfortunate mistake that requires an API
breakage to fix.
|
| | |
|
| | |
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
minidom:
* add fragment parsing algorithms
* document parseHTML
htmlparser:
* fix table body/in caption being mixed up in resetInsertionMode
* fix frameset-ok not being initialized to true
* fix opts.ctx not being used
* naively parse tags in `match' instead of using the tokenizer
htmltokenizer:
* remove special-cased compile-time tokenizer mode
* change sbuf to an array (from a seq), and store length in a separate
variable instead of constantly resizing it
* do not check for eof in emit_current (it never occurs)
|
| |
| |
| |
| |
| | |
Nim's JSON parser is slow, in nimvm even more so.
Use a pre-generated entity_gen.nim file instead.
|
| | |
|
| | |
|
| |
| |
| |
| |
| |
| |
| |
| | |
* Fix several bugs in adoptionAgencyAlgorithm, and factor out several
"find index" operations
* Fix some frameset, table col related bugs
* minidom: simplify moveChildren, assert on adding children with an
existing parent
|
| | |
|
| | |
|
| |
| |
| |
| |
| | |
* Fix misunderstanding: the stack grows *downwards*.
* Add some comments
|
| | |
|
| | |
|
| | |
|
| |
| |
| |
| | |
So that we do not have to replace it in the parser.
|
| | |
|
| | |
|
| |
| |
| |
| | |
Still not nil-free, because insertBefore & insertText needs nil.
|
| |
| |
| |
| |
| | |
Makes it possible to set an initial value for openElements and the
form pointer, as required by the HTML fragment parsing algorithm.
|
| |
| |
| |
| |
| | |
Use this to enable the unicodeCharsProblematic test, by importing
runestream.
|
| | |
|
| | |
|
| |
| |
| |
| |
| | |
Call consume() so that the buffer is filled if we are not at EOF yet
(through checkBufLen).
|