mu - Soul of a tiny new machine. More thorough tests → More comprehensible and rewrite-friendly software → More resilient society.

	Commit message (Collapse)	Author	Age	Files	Lines
*	5001 - drop the :(scenario) DSL	Kartik Agaram	2019-03-12	19	-2052/+3144
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've been saying for a while[1][2][3] that adding extra abstractions makes things harder for newcomers, and adding new notations doubly so. And then I notice this DSL in my own backyard. Makes me feel like a hypocrite. [1] https://news.ycombinator.com/item?id=13565743#13570092 [2] https://lobste.rs/s/to8wpr/configuration_files_are_canary_warning [3] https://lobste.rs/s/mdmcdi/little_languages_by_jon_bentley_1986#c_3miuf2 The implementation of the DSL was also highly hacky: a) It was happening in the tangle/ tool, but was utterly unrelated to tangling layers. b) There were several persnickety constraints on the different kinds of lines and the specific order they were expected in. I kept finding bugs where the translator would silently do the wrong thing. Or the error messages sucked, and readers may be stuck looking at the generated code to figure out what happened. Fixing error messages would require a lot more code, which is one of my arguments against DSLs in the first place: they may be easy to implement, but they're hard to design to go with the grain of the underlying platform. They require lots of iteration. Is that effort worth prioritizing in this project? On the other hand, the DSL did make at least some readers' life easier, the ones who weren't immediately put off by having to learn a strange syntax. There were fewer quotes to parse, fewer backslash escapes. Anyway, since there are also people who dislike having to put up with strange syntaxes, we'll call that consideration a wash and tear this DSL out. --- This commit was sheer drudgery. Hopefully it won't need to be redone with a new DSL because I grow sick of backslashes.
*	5000	Kartik Agaram	2019-03-12	1	-1/+1
\| \| \| \|	Fix spurious errors in `test_layers` on non-Linux platforms.
*	4999	Kartik Agaram	2019-03-10	7	-1/+3
\| \| \| \| \| \| \| \|	Fix CI. pack.subx was passing in emulation but not natively. Commit 4954 on Feb 10 was a real dud. First I find I forgot to reclaim space for locals (commit 4996). Now I find I haven't been tracking registers properly either.
*	4998	Kartik Agaram	2019-03-09	1	-0/+5
\| \| \| \| \| \| \|	Start running binaries natively in test_layers as well. CI is still broken; need to investigate where my SubX emulation has a discrepancy with native x86.
*	4997	Kartik Agaram	2019-03-09	4	-32/+40
\|
*	4996 - back on pack.subx	Kartik Agaram	2019-03-08	11	-309/+412
\| \| \| \| \| \| \| \| \|	Yet another redrawing of responsibilities between convert and its helpers. In the process I discovered a bug in `write-stream-buffered` which ended up taking me through a detour to extract `browse_trace` into its own tool. It turns out just having long buffers is enough to need browse_trace. Simple operations like clearing a stream swamp a flat view of the trace.
*	4994	Kartik Agaram	2019-03-03	1	-14/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Bring back support for incrementally printing the trace to the screen (stderr). I previously thought I didn't need this as long as I'm always incrementally saving to the 'last_run' trace file. But I quickly ran into a use for it: when I want to see a complete trace including switching into the sandbox's trace and back out again. So there are now two separate commandline flags: --trace to save the trace to file --dump to print the trace to screen The former won't handle sandbox traces. But the latter will. I'm deemphasizing --dump in the help message since it should be rarely used. One other situation where I've used stderr in the past is for just raw convenience. But trying to always use the trace was a foolish consistency. Conclusion: a) For simple debugging, feel free to just use cout/cerr. Delete them before committing. b) If the prints get too complex, switch to the trace and browse_trace tool. c) If using nested sandboxes, emit to stderr, redirect to file, and browse_trace. I've gone back and forth on these questions in the past; now I'm trying to be a little more rigorous about capturing reasoning.
*	4992	Kartik Agaram	2019-02-27	1	-0/+3
\|
*	4988	Kartik Agaram	2019-02-25	6	-0/+0
\|
*	4987 - support `browse_trace` tool in SubX	Kartik Agaram	2019-02-25	24	-448/+539
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've extracted it into a separate binary, independent of my Mu prototype. I also cleaned up my tracing layer to be a little nicer. Major improvements: - Realized that incremental tracing really ought to be the default. And to minimize printing traces to screen. - Finally figured out how to combine layers and call stack frames in a single dimension of depth. The answer: optimize for the experience of `browse_trace`. Instructions occupy a range of depths based on their call stack frame, and minor details of an instruction lie one level deeper in each case. Other than that, I spent some time adjusting levels everywhere to make `browse_trace` useful.
*	4986 - spending some time improving SubX traces	Kartik Agaram	2019-02-23	2	-10/+15
\| \| \| \| \| \|	Now that our test runs are getting longer, debugging is again becoming a bottleneck. Time to start using trace depths along with `mu browse-trace` from the top-level.
*	4984	Kartik Agaram	2019-02-22	1	-1/+1
\|
*	4983	Kartik Agaram	2019-02-22	7	-66/+67
\| \| \| \| \| \| \|	Standardize name for 'end of file' sentinel. `eof` seems like an ordinary variable, and `EOF` looks too much like a register (particularly in code like `if (EAX == EOF)`), so we'll go with `Eof`. Consistent capitalization for globals, and constants are globals too.
*	4981 - no, go back to 3 phases	Kartik Agaram	2019-02-18	45	-234/+130
\| \| \| \| \| \| \| \| \| \| \| \| \|	Considering how much trouble a merge phase would be (commit 4978), it seems simpler to just add the extra syntax for controlling the entry point of the generated ELF binary. But I wouldn't have noticed this if I hadn't taken the time to write out the commit messages of 4976 and 4978. Even if we happened to already have linked list primitives built, this may still be a good idea considering that I'm saving quite a lot of code in duplicated entrypoints.
*	4978 - maybe we need another phase	Kartik Agaram	2019-02-17	1	-0/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Phase 1: coalesce different instances/fragments for each segment, correctly prepending later fragments. Phase 2: pack bitfields into bytes. Phase 3: compute addresses for labels, compute the ELF header. Phase 4: convert hex bytes to binary. But ugh, phase 1 involves linked lists and I'll have to go down a rabbit hole building up more standard library functions.
*	4977	Kartik Agaram	2019-02-16	1	-4/+8
\|
*	4976 - recommend that operand order be fixed	Kartik Agaram	2019-02-16	1	-14/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've been allowing operands in any order just because it simplifies implementation. I don't actually rely on this flexibility; all the .subx programs in this repo consistently use a single ordering. Why is a hard-coded canonical order hard to implement? The order that seems most logical to me is complicated by the "reg" bits in the ModR/M byte: - In instructions that interpret it as an `/r32` operand, it needs to be deemphasized because it refers to a different argument of the instruction than the `/mod`, `/rm32`, `/base`, `/index` and `/scale` operands that capture the bulk of instruction decoding complexity and so should be emphasized. `/r32` can also be unused, which strengthens the case for deemphasizing it. - In instructions that interpret the "reg" bits as a `/subop` operand, it should be colocated with the opcode because it performs the same function: specifying the operation the instruction performs. In both cases, the bits in the `reg` bitfield are conceptually unrelated to the other bitfields in the same byte. But they sometimes want to be close to the opcode bytes on the left, and at other times need to be deemphasized rightward. Fixing both these possibilities seems complicated and stateful, particularly since all operands are optional in general. On the other hand, just pulling operands you need to create each byte, regardless of where in the instruction they occur, that's nicely stateless.
*	4975 - new plan for Phase 2	Kartik Agaram	2019-02-16	1	-4/+36
\| \| \| \| \| \|	So far I've been assuming that I'd just statelessly convert each line in a .subx file. But over the past week or so of constant interruptions I gradually realized that code and data need different parsers.
*	4974	Kartik Agaram	2019-02-15	1	-1/+1
\| \| \| \|	Fix CI.
*	4973	Kartik Agaram	2019-02-15	20	-89/+120
\| \| \| \| \|	Support immediate operands in the data segment in all the ways we support them in the code segment.
*	4972	Kartik Agaram	2019-02-15	1	-1/+3
\|
*	4971 - clean up a few scripts	Kartik Agaram	2019-02-15	10	-29/+57
\| \| \| \| \| \| \| \| \| \| \| \| \|	Some of them are no longer useful; drop them. For the rest, have useful usage messages. And also be a little more principled in where we introduce CFLAGS, and where we expect it to come in from the commandline. I'm choosing not to call gen/run/dgen/drun from test_layers because it makes test_layers harder for newcomers to read. The scripts aren't the first thing people should see, they're just useful once you're up and running hacking on SubX.
*	4970	Kartik Agaram	2019-02-15	1	-4/+4
\|
*	4969	Kartik Agaram	2019-02-15	1	-2/+2
\|
*	4968	Kartik Agaram	2019-02-14	7	-0/+275
\|
*	4966	Kartik Agaram	2019-02-14	4	-24/+23
\| \| \| \|	Standardize how we show register allocation decisions.
*	4965	Kartik Agaram	2019-02-14	7	-25/+31
\|
*	4964	Kartik Agaram	2019-02-14	2	-3/+3
\|
*	4963	Kartik Agaram	2019-02-14	4	-9/+9
\|
*	4962	Kartik Agaram	2019-02-14	1	-38/+40
\| \| \| \| \|	Build the C++ version optimized by default when building/running all apps. We have enough apps now that the cost of optimized builds is worthwhile.
*	4961	Kartik Agaram	2019-02-14	31	-209/+253
\|
*	4960	Kartik Agaram	2019-02-13	2	-8/+6
\| \| \| \| \|	I think I don't need to special-case packing for different segments. That should massively cut down on the number of tests.
*	4959	Kartik Agaram	2019-02-13	2	-12/+30
\| \| \| \| \|	It's always seemed ugly to explain the rules for segment names. Let's just always require a fixed name for the code and data segments.
*	4958 - all my apps fit in 4KB so far	Kartik Agaram	2019-02-11	1	-0/+16
\| \| \| \|	(excluding tests)
*	4956	Kartik Agaram	2019-02-11	2	-1/+69
\|
*	4955	Kartik Agaram	2019-02-10	8	-20/+380
\| \| \| \|	Starting to build up Phase 2 (apps/pack) out of recently designed primitives.
*	4954	Kartik Agaram	2019-02-10	9	-39/+141
\|
*	4952	Kartik Agaram	2019-02-05	9	-13/+474
\|
*	4951	Kartik Agaram	2019-02-03	16	-78/+273
\| \| \| \|	Cleaner way to compare streams in tests.
*	4950	Kartik Agaram	2019-02-03	15	-0/+0
\|
*	4949	Kartik Agaram	2019-02-02	24	-42/+62
\|
*	4948	Kartik Agaram	2019-02-02	2	-6/+341
\| \| \| \| \|	This seems like the final helper we need for Phase 2. Now to build the business logic itself.
*	4947	Kartik Agaram	2019-02-01	2	-0/+2
\| \| \| \| \| \| \| \|	Bugfix: has-metadata? was corrupting registers Seems uneconomic to write tests for stuff like this. Assembly is just not the right layer to try to come up with a general solution or process. Keep running your code and wait to find signs of breakage.
*	4946	Kartik Agaram	2019-02-01	2	-2/+66
\|
*	4945	Kartik Agaram	2019-02-01	7	-0/+70
\|
*	4944	Kartik Agaram	2019-01-30	1	-4/+4
\| \| \| \|	Improve syntax highlighting in the Readme.
*	4943	Kartik Agaram	2019-01-30	2	-3/+2
\|
*	4941	Kartik Agaram	2019-01-21	1	-4/+17
\|
*	4940	Kartik Agaram	2019-01-21	1	-4/+4
\|
*	4939	Kartik Agaram	2019-01-21	3	-3/+253
\|