summary refs log tree commit diff stats
path: root/lib/packages/docutils/docutils.nimble.old
diff options
context:
space:
mode:
authorAlexander Kernozhitsky <sh200105@mail.ru>2024-05-29 06:42:07 +0200
committerGitHub <noreply@github.com>2024-05-29 06:42:07 +0200
commitb172b34a245959c7d5e8f4df3c3dcbe88b7ba6fa (patch)
treed12583026fdc68f64ba4c3af04a56b2f942637b3 /lib/packages/docutils/docutils.nimble.old
parentd923c581c118b9ea891785bbb828c3cdede587b4 (diff)
downloadNim-b172b34a245959c7d5e8f4df3c3dcbe88b7ba6fa.tar.gz
Treat CJK Ideographs as letters in `isAlpha()` (#23651)
Because of the bug in `tools/parse_unicodedata.nim`, CJK Ideographs were
not considered letters in `isAlpha()`, even though they have category
Lo. This is because they are specified as range in `UnicodeData.txt`,
not as separate characters:

```
4E00;<CJK Ideograph, First>;Lo;0;L;;;;;N;;;;;
9FEF;<CJK Ideograph, Last>;Lo;0;L;;;;;N;;;;;
```

The parser was not prepared to parse such ranges and thus omitted almost
all CJK Ideographs from consideration.

To fix this, we need to consider ranges from `UnicodeData.txt` in
`tools/parse_unicodedata.nim`.
Diffstat (limited to 'lib/packages/docutils/docutils.nimble.old')
0 files changed, 0 insertions, 0 deletions