mu - Soul of a tiny new machine. More thorough tests → More comprehensible and rewrite-friendly software → More resilient society.

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	.	Kartik K. Agaram	2021-09-01	1	-0/+1
\|
*	a few more examples of combining characters	Kartik K. Agaram	2021-09-01	1	-2/+45
\|
*	.	Kartik Agaram	2021-09-01	16	-10431/+14919
\|
*	rendering code-points with combining characters	Kartik K. Agaram	2021-09-01	4	-24/+95
\| \| \| \| \| \| \|	There's a new example app showing this ability. Still to go: support for combining characters when rendering text and streams.
*	tag combining character code-points	Kartik K. Agaram	2021-08-31	1	-727/+727
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Unfortunately the Unicode database doesn't actually provide obvious metadata for combining characters. The process I followed is as follows. I noticed that GNU Unifont provides the following files for download: - unifont-13.0.06.hex: All Plane 0 glyphs - unifont_sample-13.0.06.hex: The above .hex file with combining circles added Downloading and diffing the two yields all code-points with combining circles. I assume they are exactly the combining characters I care about. One mechanical difficulty is cross-correlating the above files that include the code-point in each line with font.subx which does not. I got things to work by modifying the above files in place until they have the same format as font.subx, using the following Vim commands on each file: :%s\|.\{64\}\|10/size^M00/is-combine^M&\| :%s\|^.\{32\}$\|08/size^M00/is-combine^M&00000000000000000000000000000000\| :%s\|..\|& \|g :%s\|10 /s iz e\|10/size\| :%s\|08 /s iz e\|08/size\| :%s\|00 /i s- co mb in e\|00/is-combine\| Now I can update the metadata with a Vim macro which jumps to the next hunk and increments /is-combine on the previous line.
*	start hacky experiment to support combining chars	Kartik K. Agaram	2021-08-31	5	-6/+4384
\| \| \| \| \| \| \| \| \| \| \|	https://en.wikipedia.org/wiki/Combining_character The plan: just draw the combining character in the same space as the previous character. This will almost certainly not work for some Unicode blocks (tibetan?) This commit only changes the data/memory/disk model to make some space. As always in Mu, we avoid bit-mask tricks even if that wastes memory.
*	.	Kartik K. Agaram	2021-08-31	2	-2/+2
\|
*	fix a typo from commit a479f0d0837	Kartik K. Agaram	2021-08-31	1	-1/+1
\| \| \| \| \| \| \| \|	Yet another gnarly reason to start checking all arg metadata in linux/pack.subx or something like that. With this bug most of my programs (including browser-slack!) were working even though the instruction stream was almost certainly misdecoded halfway through every attempt to draw glyphs.
*	slack: start rendering unicode	Kartik K. Agaram	2021-08-30	1	-24/+25
\|
*	.	Kartik Agaram	2021-08-30	61	-6942/+16397
\|
*	.	Kartik K. Agaram	2021-08-30	2	-5/+5
\| \| \| \|	Open question fixed.
*	first rendering of non-latin script	Kartik K. Agaram	2021-08-30	2	-2/+127
\| \| \| \| \|	Open question: why does column 0 get cropped? The spacing also seems excessive. Are we taking up 3 grid points?
*	.	Kartik K. Agaram	2021-08-30	1	-1/+63
\|
*	fix bad terminology: grapheme -> code point	Kartik K. Agaram	2021-08-29	15	-193/+210
\| \| \| \| \| \| \| \| \| \|	Unix text-mode terminals transparently support utf-8 these days, and so I treat utf-8 sequences (which I call graphemes in Mu) as fundamental. I then blindly carried over this state of affairs to bare-metal Mu, where it makes no sense. If you don't have a terminal handling font-rendering for you, fonts are most often indexed by code points and not utf-8 sequences.
*	.	Kartik K. Agaram	2021-08-29	2	-8/+8
\|
*	.	Kartik K. Agaram	2021-08-29	3	-8/+12
\|
*	load Font in a non-contiguous area of memory	Kartik K. Agaram	2021-08-29	5	-13/+44
\|
*	.	Kartik K. Agaram	2021-08-29	13	-26/+3
\|
*	.	Kartik K. Agaram	2021-08-29	1	-1/+1
\|
*	.	Kartik K. Agaram	2021-08-29	1	-3/+3
\|
*	improve translation scripts	Kartik K. Agaram	2021-08-29	2	-8/+38
\|
*	.	Kartik K. Agaram	2021-08-29	1	-2217/+2217
\|
*	inline SubX translation	Kartik K. Agaram	2021-08-29	4	-132/+98
\| \| \| \| \|	We can't really translate purely SubX code anyway at the top-level. Stop exposing those scripts.
*	retreat to 640KB	Kartik K. Agaram	2021-08-29	3	-17/+5
\|
*	build still broken	Kartik K. Agaram	2021-08-29	3	-2/+30
\| \| \| \| \|	Now we load all the code, but it overwrites the extended BIOS area. 640KB is no longer enough. Need to rethink loading strategy.
*	import a few more unicode blocks from Unifont	Kartik K. Agaram	2021-08-29	2	-6/+8527
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	shell/ is currently broken; we've overflowed available contiguous space for code. Block names based on https://www.compart.com/en/unicode/block: 0x0000 - 0x007f Basic Latin 128 0x0080 - 0x00ff Latin-1 Supplement 128 0x0100 - 0x017f Latin Extended-A 128 0x0180 - 0x024f Latin Extended-B 208 0x0250 - 0x02af IPA Extensions 96 0x02b0 - 0x02ff Spacing Modifier Letters 80 0x0300 - 0x036f Combining Diacritical Marks 112 0x0370 - 0x03ff Greek and Coptic 135 0x0400 - 0x04ff Cyrillic 256 0x0500 - 0x052f Cyrillic Supplement 48 0x0530 - 0x058f Armenian 91 0x0590 - 0x05ff Hebrew 88 0x0600 - 0x06ff Arabic 255 0x0700 - 0x074f Syriac 77 0x0750 - 0x077f Arabic Supplement 48 0x0780 - 0x07bf Thaana 50 0x07c0 - 0x07ff NKo 62 0x0800 - 0x083f Samaritan 61 0x0840 - 0x085f Mandaic 29 0x0860 - 0x086f Syriac Supplement 11 0x08a0 - 0x08ff Arabic Extended-A 84 0x0900 - 0x097f Devanagari 128 0x0980 - 0x09ff Bengali 96 0x0a00 - 0x0a7f Gurmukhi 80 0x0a80 - 0x0aff Gujarati 91 0x0b00 - 0x0b7f Oriya 91 0x0b80 - 0x0bff Tamil 72 0x0c00 - 0x0c7f Telugu 98 0x0c80 - 0x0cff Kannada 89 0x0d00 - 0x0d7f Malayalam 118 0x0d80 - 0x0dff Sinhala 91 0x0e00 - 0x0e7f Thai 87 0x0e80 - 0x0eff Lao 82 0x0f00 - 0x0fff Tibetan 211 0x1000 - 0x109f Myanmar 160 0x10a0 - 0x10ff Georgian 88 But don't trust the block sizes above. Thanks to gdb[1] for this helper: define z print 2 * (0x$arg1 - 0x$arg0 + 1) end e.g: (gdb) z 10a0 10ff 192 [1] https://sourceware.org/gdb/current/onlinedocs/gdb/Define.html
*	render wide glyphs in the font	Kartik K. Agaram	2021-08-29	2	-5/+36
\|
*	bugfix in commit 8e182e394	Kartik K. Agaram	2021-08-29	1	-1/+3
\| \| \| \| \|	Any command in shell that rendered the screen resulted in an infinite loop. But it took me forever to even realize it was an infinite loop.
*	.	Kartik K. Agaram	2021-08-29	1	-28/+50
\|
*	.	Kartik K. Agaram	2021-08-29	1	-8/+7
\|
*	.	Kartik K. Agaram	2021-08-29	1	-3/+3
\|
*	.	Kartik K. Agaram	2021-08-29	1	-1/+2
\|
*	width-aware drawing primitives	Kartik K. Agaram	2021-08-29	6	-52/+215
\| \| \| \|	No support yet for drawing wide graphemes.
*	.	Kartik K. Agaram	2021-08-28	6	-29/+21
\|
*	.	Kartik K. Agaram	2021-08-28	1	-59/+44
\|
*	support unused screen-cells in fake screens	Kartik K. Agaram	2021-08-28	2	-0/+31
\| \| \| \| \| \|	We'll need this when rendering 16-bit glyphs. They'll occupy two 8x16 display units on screen, but the grapheme is a single unit as far as fake screens are concerned.
*	.	Kartik K. Agaram	2021-08-28	1	-179/+187
\|
*	.	Kartik K. Agaram	2021-08-28	4	-80/+80
\| \| \| \|	Convert some old code to current idioms.
*	font data structure now supports 16-bit glyphs	Kartik K. Agaram	2021-08-28	3	-138/+266
\| \| \| \|	We can't yet render the latter 8 bits.
*	reorganize font before adding non-ASCII	Kartik K. Agaram	2021-08-27	4	-240/+240
\|
*	compute-offset: literal index	Kartik K. Agaram	2021-08-25	3	-5/+132
\|
*	.	Kartik K. Agaram	2021-08-25	1	-6/+6
\|
*	another long-overdue bugfix	Kartik K. Agaram	2021-08-22	2	-3/+87
\| \| \| \| \| \| \|	If I forgot a 'var', Mu would interpret the ':' in the var declaration as a named block, and all parsing after would be thrown off. Perhaps I should use separate characters for defining blocks vs vars.
*	fix a long-standing bug in Mu's translator	Kartik K. Agaram	2021-08-22	2	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While all test pass, this change is disquieting. When I first designed Mu I deliberately chose to exclude literal strings from most primitive instructions both for type-checking and to avoid silently passing through strange constructions. Nobody really needs to add a string to a number, and am I sure no SubX instruction will cause a memory safety issue when passed a string literal instead of a number? But clearly I have no tests encoding this desire. And any string literal could be replaced by an integer literal containing the exact same value, so what are we protecting against anyway. Let me fix the bug for now. If I run into problems I'll come back and do this right.
*	start throwing error on labels too far for /disp8	Kartik K. Agaram	2021-08-22	2	-0/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While I'm doing this I might as well lay out a story I don't seem to have told before in this commit log. I translated Mu programs to Linux before I did so to bare metal like I do in the top-level these days. The translator programs still run from the linux/ directory. However they don't always have good error messages. As long as I was translating to Linux this wasn't a huge deal because I always translated Mu programs using the bootstrap translator in linux/bootstrap/ -- which has great error messages. However, linux/bootstrap/ can't build bare-metal programs because boot.subx uses real-mode instructions that aren't supported. As a hack I created a script called misc_checks that at least tries to run everything besides boot.subx -- even though translation can never succeed. If I run it and get to errors about unknown variables I know everything besides boot.subx raised no errors. Having labels too far in /disp8 args is is the single biggest reason we need the misc_checks hack. Hopefully it's now obsolete.
*	.	Kartik K. Agaram	2021-08-22	2	-3/+4
\|
*	start throwing error on duplicate label	Kartik K. Agaram	2021-08-22	24	-76/+385
\| \| \| \| \| \| \| \| \| \|	One less error that's only in the bootstrap phase. On the other hand, for simplicity I got rid of the ability to override the Entry label. One less special case, but we're also going further from the ability to run subsets of layers. We haven't really been exercising it for a long time, though (commit 7842, March 2021 when we made baremetal the default).
*	.	Kartik K. Agaram	2021-08-22	2	-16/+12
\|
*	.	Kartik Agaram	2021-08-15	94	-14489/+19517
\|
*	move gap buffer code to top-level	Kartik K. Agaram	2021-08-15	4	-2026/+0
\| \| \| \|	Now that it's been used in a second app without needing any changes.