mu - Soul of a tiny new machine. More thorough tests → More comprehensible and rewrite-friendly software → More resilient society.

	Commit message (Collapse)	Author	Age	Files	Lines
*	5461 - SubX is now self-hosting!	Kartik Agaram	2019-07-24	1	-0/+0
\| \| \| \| \|	All extant SubX programs generate identical binaries using either the C++ or the self-hosted SubX translators.
*	5460	Kartik Agaram	2019-07-24	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \|	A little more resizing of buffers. apps/hex.subx is now building an identical binary. I'm now aborting on allocation failures. That requires disabling a couple of tests. (I'm not quite confident enough of this decision to delete them outright.) I want to treat all segfaults as bugs, and machine code is no place to add boilerplate checks for return values of standard library functions.
*	5455	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \| \|	Clean up. All apps now translating correctly except for the phases of the self-hosted translator. Next step: SubX-in-SubX in SubX-in-SubX.
*	5454	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \|	Bugfix fifteen -- on the C++ side.
*	5453	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \|	All that debugging and it turns out the bug is on the C++ side!
*	5452	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Snapshot while debugging survey.subx by print. I can see the error in 1 minute with this command: subx run apps/survey < a.pack (where a.pack is obtained from `ntranslate 049.subx 05[0-8].subx`) By contrast, using the trace requires 4.5 minutes: subx --trace run apps/survey < a.pack It generates a trace of 4.4GB with almost 83M lines. The trace takes 2 minutes to load.. oops, I forgot to load labels with `--debug`.
*	5451	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Even though the standard library is building and passing tests, the binaries it generates aren't exactly bit for bit identical with the originals. Comparing using `diff_ntranslate`, it looks like the data segment starting address isn't computed right in survey.subx (`compute-addresses`) when I start translating layer 058. Deleting some tests brings the code segment to a p_offset where bits 8-11 (the lowest 4 bits excluding the lowermost byte) are cleared and everything works. However, if bits 8-11 are set, then they don't make it to p_vaddr and p_paddr. Tried reproducing with a unit test, but the unit test passes fine.
*	5443 - standard library is now self-hosted	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \| \| \| \|	Translates 5k lines of input in 26 seconds. I'm not sure why I need to grow the label table. It was already 512 entries long, and I'm only using 373 so far.
*	5442	Kartik Agaram	2019-07-22	1	-0/+0
\| \| \| \| \| \| \| \| \|	We can now translate layers 49-72 using the self-hosted translator. The translator has now demonstrated translation over 4k lines. Most verbose phase output is 325KB, even if the final binary is 15KB. Emulation is too slow now, so I'm back to debug by print on a Linux machine.
*	5438 - raise error on uppercase hex	Kartik Agaram	2019-07-21	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can now translate layers 49-56 using the self-hosted translator (`translate` and `ntranslate`). As a follow-up to commit 5404, the self-hosted translator is a little more strict than the C++ translator in 3 places: a) All .subx files must define a data segment. b) All .subx files must define an `Entry` label. c) All numbers must be in lowercase hex. In all cases, where programs work with the C++ translator but violate the self-hosted translator's assumptions, we must make sure we raise errors rather than silently emit bad code.
*	5437	Kartik Agaram	2019-07-21	1	-0/+0
\| \| \| \|	Break a dependency from `print-int32` to `from-hex-char`.
*	5436 - support newlines in dquotes	Kartik Agaram	2019-07-21	1	-0/+0
\|
*	5435 - redo 5426	Kartik Agaram	2019-07-21	1	-0/+0
\| \| \| \| \|	We can now translate layers 49-55 using translate and ntranslate. Next step is to support '\n' in dquotes.subx.
*	5434	Kartik Agaram	2019-07-21	1	-0/+0
\|
*	5433	Kartik Agaram	2019-07-21	1	-0/+0
\|
*	5429	Kartik Agaram	2019-07-20	1	-0/+0
\| \| \| \|	Fix CI.
*	5426	Kartik Agaram	2019-07-20	1	-0/+0
\|
*	5422	Kartik Agaram	2019-07-19	1	-0/+0
\| \| \| \| \|	Various buffer sizes needed to be grown for ex11. But the next bottleneck is that we need to code-generate run-tests.
*	5419	Kartik Agaram	2019-07-19	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \|	Bugfix fourteen: we need different address computation logic for code vs data labels. It's really about different categories of instructions having different address computation logic. This subtle distinction will make good error messages hard. But that's a problem for later. Now there's just one example program not translating.
*	5417	Kartik Agaram	2019-07-18	1	-0/+0
\| \| \| \|	Clean up.
*	5416	Kartik Agaram	2019-07-18	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Figured out what's going in with bug fourteen: displacement operands aren't always used relative to the PC. Does this mean I need to track instruction boundaries past pack? :'( No, I just need different logic for labels in code vs data segments. This was an interesting bug for reminding me of the difference between the emulator-level trace and the application-level trace. The former has 1.5 million lines, while the latter has a dozen. Luckily, just dumping the latter immediately made obvious what the issue was. Though this experience does suggest some further ideas for debugging tools: slice trace by line and phase slice trace by start and end label debug UI for SubX translator 2D layout: rows = lines of code; columns = translator phases each 'cell' in this layout contains a list of log lines shows what came in, what was emitted easily collapse any cell These are domain-specific tools. Special-cased to the SubX translator phases.
*	5415	Kartik Agaram	2019-07-17	1	-0/+0
\| \| \| \| \| \| \| \|	Bugfix thirteen: displacement calculations were wrong because current offset was not being updated properly as words were being read and emitted. Now 10/12 example programs are translated correctly.
*	5409	Kartik Agaram	2019-07-17	1	-0/+0
\| \| \| \| \|	Bugfix eleven: segment flags were incorrectly computed. examples/ex1 now verified! Added to CI.
*	5408	Kartik Agaram	2019-07-16	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Bugfix ten: type error in `convert`. I was calling `rewind-stream` on a `buffered-file`. examples/ex1 is now just one nibble off the canonical. I have found one missing feature in the self-hosted translator, though: dquotes doesn't support newlines in strings, even though the C++ version does. dquotes parses them right, but the value initialized in the data segment is wrong.
*	5407	Kartik Agaram	2019-07-15	1	-0/+0
\| \| \| \| \| \| \|	Bugfix nine: flush(out) after translation is done. Still one remaining bug from comparing ELF binaries: emit-segments prints nothing for some reason.
*	5406	Kartik Agaram	2019-07-15	1	-0/+0
\| \| \| \| \| \| \|	Bugfix eight: incorrect segment count in ELF header. The generated examples/ex1 is still not right. But it has the second segment now. Or almost all of it. Final byte is missing for some reason.
*	5404 - subx/examples/ex1 now translating	Kartik Agaram	2019-07-15	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The result isn't an identical binary to before, and it segfaults when run. But it's bugfix seven. A couple of places where we make .subx files a little more strict: a) All .subx files must define a data segment. Even if they have no data. b) All .subx files must define an `Entry` label for the binary to start at. Earlier we used to default to the start of the code label. That's not too hard to add; we'd just need to: i) rename `get` to `get-or-abort` ii) clone a third variant of `get-or-insert` called `get` that returns null if the key is not found. iii) use `get` rather than `get-or-abort` when looking up the `Entry` label.
*	.	Kartik Agaram	2019-07-13	1	-0/+0
\| \| \| \|	Clean up.
*	survey.subx now passing all tests	Kartik Agaram	2019-07-13	1	-0/+0
\|
*	`test-convert-computes-addresses` bugfix six	Kartik Agaram	2019-07-13	1	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	map of how far we've gotten by now (functions with '' independently tested): ✓ compute-offsets ✓ compute-addresses* ✓ emit-output ✓ emit-headers ✓ emit-elf-header ✓ emit-hex-array* ✓ first emit-elf-program-header-entry ✓ emit-hex-array* ? second emit-elf-program-header-entry emit-hex-array* emit-segments*
*	fixed fifth bug, hit sixth	Kartik Agaram	2019-07-13	1	-0/+0
\|
*	.	Kartik Agaram	2019-07-13	1	-0/+0
\| \| \| \|	Clean up.
*	fixed fourth bug, hit fifth	Kartik Agaram	2019-07-13	1	-0/+0
\|
*	fixed one bug, hit another	Kartik Agaram	2019-07-12	1	-0/+0
\| \| \| \| \| \| \| \| \| \|	I carefully logged the segment a label is declared in but forgot to actually save it in the table. This has been a theoretic concern for some time, but I've never seen it actually happen until now. SubX is just too low level. Now I get past the first two phases but code generation fails to find the 'Entry' label.
*	compute-offsets test now passing	Kartik Agaram	2019-07-12	1	-0/+0
\| \| \| \|	The final integration test-convert-computes-addresses is still failing.
*	one failure remaining in test-compute-offsets	Kartik Agaram	2019-07-11	1	-0/+0
\| \| \| \| \| \| \| \| \|	'curr-segment-name' is now a string, and it's stored in a register rather than a global. Paradoxically, this leaks less than before. Before, every call to `get-or-insert-slice` leaked memory. Now we leak one string for every new segment. Which is trivial.
*	.	Kartik Agaram	2019-07-10	1	-0/+0
\|
*	.	Kartik Agaram	2019-07-10	1	-0/+0
\| \| \| \|	I think we're calling the wrong variant here.
*	start distinguishing table lookups from inserts	Kartik Agaram	2019-07-10	1	-0/+0
\|
*	zero out new rows returned by get-or-insert	Kartik Agaram	2019-07-10	1	-0/+0
\|
*	.	Kartik Agaram	2019-07-10	1	-0/+0
\| \| \| \| \|	Another batch of incorrectly signed conditional jumps. (Follow-up to commit 5180.)
*	.	Kartik Agaram	2019-07-10	1	-0/+0
\| \| \| \|	Fix infinite loop in the 2 remaining failing tests; now it's a segfault.
*	.	Kartik Agaram	2019-07-10	1	-0/+0
\|
*	mostly done with emit-output	Kartik Agaram	2019-07-09	1	-0/+0
\| \| \| \| \|	Some nooks and crannies will need light final debugging with xxd, but emit-hex-output covers most of the logic.
*	done with emit-segments	Kartik Agaram	2019-07-09	1	-0/+0
\| \| \| \|	Only failures now are the first two tests in survey.subx.
*	clean up after a few calls	Kartik Agaram	2019-07-09	1	-0/+0
\|
*	preserve truthiness of non-booleans	Kartik Agaram	2019-07-09	1	-0/+0
\| \| \| \| \|	Everywhere we check if something is true, we check it by comparing against 0, not 1.
*	this looks like a bug	Kartik Agaram	2019-07-09	1	-0/+0
\| \| \| \| \|	But of course the test is still infinite-looping so we can't be sure. I'm still just reading the code.
*	.	Kartik Agaram	2019-07-09	1	-0/+0
\|
*	move 'segment-start' to a global variable as well	Kartik Agaram	2019-07-09	1	-0/+0
\| \| \| \| \|	It's ugly to have function locals in global variables, but we'll figure out later how to deal with it.