Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
| * | Enforce no exceptions in all modules | bptato | 2024-08-01 | 5 | -0/+20 | |
| | | | | | | | | Chagashi never throws; this is just to make consumers' jobs easier | |||||
| * | charset_map: reduce size | bptato | 2024-08-01 | 8 | -12568/+5637 | |
| | | | | | | | | | | | | | | | | | | | | | | | | | | * remove unused JIS X 0212 encoding table * skip JIS X 0208 empty rows * merge Shift_JIS encoding table with JIS X 0208 table * skip EUC-KR empty rows & columns * RLE EUC-KR & GBK top + bottom left sections * skip GB18030 PUA section * use RLE EUC-KR and GB18030 decoder ranges for encoding too * fix gitignore shaves off ~154K from the binary | |||||
| * | Version 0.5.3 | bptato | 2024-07-28 | 4 | -4/+7 | |
| | | ||||||
| * | Version 0.5.2 | bptato | 2024-06-14 | 3 | -2/+5 | |
| | | ||||||
| * | decoder: better documentation | bptato | 2024-06-14 | 1 | -29/+42 | |
| | | ||||||
| * | decoder: add UnsafeSlice stringifier | bptato | 2024-06-14 | 1 | -0/+7 | |
| | | ||||||
| * | Misc bugfixes | bptato | 2024-06-14 | 2 | -19/+19 | |
| | | | | | | | | | | | | * simplify decode iterator: only yield once, etc. * fix decode iterator done not resetting `n' * fix partial UTF-8 at the end of a string eating the internal buffer | |||||
| * | Version 0.5.1 | bptato | 2024-06-13 | 3 | -3/+3 | |
| | | ||||||
| * | decoder: fix decode() iterator with error mode "fatal" | bptato | 2024-06-13 | 1 | -1/+3 | |
| | | ||||||
| * | Version 0.5.0 | bptato | 2024-06-13 | 3 | -3/+10 | |
| | | ||||||
| * | Unify decoder and validator APIs | bptato | 2024-06-13 | 8 | -221/+317 | |
| | | | | | | | | | | | | | | | | | | The validator/decoder split was an abomination, and having to buffer your input queue for UTF-8 inputs was kind of annoying too. Now we have an extra instruction that also instructs the reader to use data from its input queue. Currently this is only used for UTF-8, but we may extend it to other ASCII-compatible charsets too in the future. | |||||
| * | charset: simplify getLocaleCharset | bptato | 2024-04-30 | 2 | -17/+14 | |
| | | ||||||
| * | Update code style | bptato | 2024-04-30 | 7 | -82/+82 | |
| | | ||||||
| * | Remove unnecessary casts | bptato | 2024-04-30 | 2 | -10/+10 | |
| | | ||||||
| * | makefile: actually use NIM variable | bptato | 2024-04-30 | 1 | -4/+4 | |
| | | ||||||
| * | decodercore: fix wrong assertion | bptato | 2024-03-06 | 2 | -1/+1 | |
| | | | | | | | | + remove an artifact | |||||
| * | add a test | bptato | 2024-03-05 | 1 | -0/+1 | |
| | | | | | | | | just a sanity check for myself, it's a bug in Chawan | |||||
| * | Version 0.4.2 | bptato | 2024-02-26 | 3 | -2/+5 | |
| | | ||||||
| * | Fix compiler crash on 1.6.14 | bptato | 2024-02-26 | 3 | -5/+6 | |
| | | | | | | | | seems like it doesn't like `out' | |||||
| * | Version 0.4.1 | bptato | 2024-02-23 | 3 | -2/+5 | |
| | | ||||||
| * | Make validator actually work | bptato | 2024-02-22 | 3 | -16/+24 | |
| | | | | | | | | | | The API made little sense; `n' was being set almost randomly to "some byte at the current position." | |||||
| * | Update news | bptato | 2024-02-21 | 1 | -1/+1 | |
| | | ||||||
| * | Import Chagashi | bptato | 2024-02-21 | 59 | -0/+101329 | |
| | ||||||
* | Add 'lib/chame0/' from commit '93f31143b25349ed071c6f564ecce45a5c2b351c' | bptato | 2025-01-18 | 38 | -0/+10236 | |
|\ | | | | | | | | | | | git-subtree-dir: lib/chame0 git-subtree-mainline: cb1ec272443328c80d9e4bfb17f4853ff2745b16 git-subtree-split: 93f31143b25349ed071c6f564ecce45a5c2b351c | |||||
| * | Version 1.0.3 | bptato | 2025-01-03 | 3 | -2/+5 | |
| | | ||||||
| * | Conform strict defs | bptato | 2025-01-03 | 7 | -37/+40 | |
| | | ||||||
| * | Version 1.0.2 | bptato | 2024-11-22 | 4 | -7/+40 | |
| | | ||||||
| * | Update docs | bptato | 2024-11-22 | 3 | -28/+26 | |
| | | ||||||
| * | htmltokenizer: get rid of some redundant checks | bptato | 2024-10-21 | 2 | -166/+112 | |
| | | ||||||
| * | Update gitignore | bptato | 2024-10-21 | 1 | -2/+2 | |
| | | ||||||
| * | htmltokenizer: simplify char ref end state | bptato | 2024-10-21 | 1 | -35/+37 | |
| | | | | | | | | + some other small code quality improvements | |||||
| * | update tests | bptato | 2024-10-21 | 2 | -6/+6 | |
| | | ||||||
| * | update docs | bptato | 2024-10-21 | 2 | -17/+16 | |
| | | ||||||
| * | Update docs | bptato | 2024-10-05 | 3 | -5/+8 | |
| | | ||||||
| * | Version 1.0.1 | bptato | 2024-07-28 | 4 | -4/+8 | |
| | | ||||||
| * | htmlparser: small improvements | bptato | 2024-07-06 | 7 | -15/+13 | |
| | | | | | | | | | | | | | | * remove unnecessary stringify + function * remove test artifacts * update chagashi submodule * add test to skipDirs | |||||
| * | Update readme | bptato | 2024-06-21 | 1 | -4/+5 | |
| | | ||||||
| * | Version 1.0.0 | bptato | 2024-06-13 | 3 | -4/+10 | |
| | | ||||||
| * | Update readme | bptato | 2024-06-13 | 1 | -14/+7 | |
| | | ||||||
| * | minidom_cs: switch to chagashi | bptato | 2024-06-13 | 3 | -33/+42 | |
| | | ||||||
| * | Reorganize tests | bptato | 2024-06-13 | 20 | -6/+62 | |
| | | | | | | | | not willing to put up with nimble any longer, makefile it is. | |||||
| * | htmlparser: simplify a bit | bptato | 2024-06-13 | 1 | -13/+6 | |
| | | ||||||
| * | Update docs, todo | bptato | 2024-06-04 | 3 | -3/+16 | |
| | | ||||||
| * | Remove parse error callback | bptato | 2024-05-05 | 4 | -635/+120 | |
| | | | | | | | | It never worked properly, it's under-specified, and I don't need it. | |||||
| * | entity: reduce size somewhat | bptato | 2024-05-05 | 3 | -1060/+764 | |
| | | | | | | | | with GCC this shaves 64K off the output | |||||
| * | Version 0.14.5 | bptato | 2024-04-09 | 3 | -2/+8 | |
| | | ||||||
| * | doc: update manual | bptato | 2024-04-09 | 1 | -3/+9 | |
| | | ||||||
| * | tags: remove duplicate definitionURL | bptato | 2024-04-09 | 2 | -8/+7 | |
| | | | | | | | | | | | | it causes problems with the naive implementation of TagType enum extension (as we do it in Chawan); better to just convert it dynamically in the very very very rare case that it's needed. | |||||
| * | htmltokenizer: fix entities starting with `z' | bptato | 2024-04-09 | 3 | -2/+12 | |
| | | | | | | | | | | lower-case `z' was missing from the list of generated end indices, which broke character entities starting with `z'. | |||||
| * | Update readme | bptato | 2024-03-22 | 1 | -0/+4 | |
| | |