mu - Soul of a tiny new machine. More thorough tests → More comprehensible and rewrite-friendly software → More resilient society.

	Commit message (Collapse)	Author	Age	Files	Lines
*	6082 - bugfix in spilling register vars	Kartik Agaram	2020-03-06	1	-3/+263
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the process I'm starting to realize that my approach to avoiding spills isn't ideal. It works for local variables but not to avoid spilling outputs. To correctly decide whether to spill to an output register or not, we really need to analyze when a variable is live. If we don't do that, we'll end up in one of two bad situations: a) Don't spill the outermost use of an output register (or just the outermost scope in a function). This is weird because it's hard to explain to the programmer why they can overwrite a local with an output above a '{' but not below. b) Disallow overwriting entirely. This is easier to communicate but quite inconvenient. It's nice to be able to use eax for some temporary purpose before overwriting it with the final result of a function. If we instead track liveness, things are convenient and also easier to explain. If a temporary is used after the output has been written that's an obvious problem: "you clobbered the output". (It seems more reasonable to disallow multiple live ranges for the output. Once an output is written it can only be shadowed in a nested block.) That's the bad news. Now for some good news: One lovely property Mu the language has at the moment is that live ranges are guaranteed to be linear segments of code. We don't need to analyze loop-carried dependences. This means that we can decide whether a variable is live purely by scanning later statements for its use. (Defining 'register use' is slightly non-trivial; primitives must somehow specify when they read their output register.) So we don't actually need to worry about a loop reading a register with one type and writing to another type at the end of an iteration. The only way that can happen is if the write at the end was to a local variable, and we're guaranteeing that local variables will be reclaimed at the end of the iteration. So, the sequence of tasks: a) compute register liveness b1) verify that all register variables used at any point in a program are always the topmost use of that register. b2) decide whether to spill/shadow, clobber or flag an error. There's still the open question of where to attach liveness state. It can't be on a var, because liveness varies by use of the var. It can't be on a statement because we may want to know the liveness of variables not referenced in a given statement. Conceptually we want a matrix of locals x stmts (flattened). But I think it's simpler than that. We just want to know liveness at the time of variable declarations. A new register variable can be in one of three states w.r.t. its previous definition: either it's shadowing it, or it can clobber it, or there's a conflict and we need to raise an error. I think we can compute this information for each variable definition by an analysis similar to existing ones, maintaining a stack of variable definitions. The major difference is that we don't pop variables when a block ends. Details to be worked out. But when we do I hope to get these pending tests passing.
*	6079 - optimize register spills	Kartik Agaram	2020-03-05	1	-23/+211
\| \| \| \| \|	The second var to the same register in a block doesn't need to spill. We're never going to restore the var it's shadowing.
*	6074	Kartik Agaram	2020-02-29	1	-21/+25
\|
*	6073	Kartik Agaram	2020-02-29	1	-11/+28
\|
*	6071 - array indexing for non-int power-of-2 types	Kartik Agaram	2020-02-29	1	-11/+163
\|
*	6070	Kartik Agaram	2020-02-29	1	-1/+1
\|
*	6069	Kartik Agaram	2020-02-29	1	-24/+24
\|
*	6062	Kartik Agaram	2020-02-27	1	-10/+92
\|
*	6061	Kartik Agaram	2020-02-27	1	-22/+22
\|
*	6055 - record types and the 'get' instruction	Kartik Agaram	2020-02-27	1	-6/+487
\| \| \| \| \| \| \| \| \| \| \|	This is a lot of code for a single test, and it took a long time to get my data model just right. But the test coverage seems ok because it feels mostly like straight-line code. We'll see. I've also had to add a lot of prints. We really need app-level trace generation pretty urgently. That requires deciding how to turn it on/off from the commandline. And I've been reluctant to start relying on the hairy interface that is POSIX open().
*	6054	Kartik Agaram	2020-02-24	1	-3/+21
\|
*	6053	Kartik Agaram	2020-02-23	1	-2/+2
\|
*	6052	Kartik Agaram	2020-02-23	1	-10/+21
\|
*	6051	Kartik Agaram	2020-02-23	1	-13/+25
\|
*	6050	Kartik Agaram	2020-02-23	1	-26/+34
\|
*	6049	Kartik Agaram	2020-02-23	1	-11/+11
\|
*	6048	Kartik Agaram	2020-02-21	1	-28/+9
\|
*	6047	Kartik Agaram	2020-02-21	1	-2/+2
\|
*	6045	Kartik Agaram	2020-02-21	1	-16/+16
\|
*	6044	Kartik Agaram	2020-02-21	1	-68/+68
\|
*	6043	Kartik Agaram	2020-02-21	1	-0/+52
\| \| \| \|	Test for 'index'.
*	6041 - array indexing starting to work	Kartik Agaram	2020-02-21	1	-5/+99
\| \| \| \| \| \| \| \| \| \| \| \| \|	And we're using it now in factorial.mu! In the process I had to fix a couple of bugs in pointer dereferencing. There are still some limitations: a) Indexing by a literal doesn't work yet. b) Only arrays of ints supported so far. Looking ahead, I'm not sure how I can support indexing arrays by non-literals (variables in registers) unless the element size is a power of 2.
*	6037 - first passing test for pointer lookup	Kartik Agaram	2020-02-20	1	-67/+192
\|
*	6036	Kartik Agaram	2020-02-20	1	-5/+6
\|
*	6035	Kartik Agaram	2020-02-20	1	-18/+18
\|
*	6034	Kartik Agaram	2020-02-20	1	-20/+20
\|
*	6033 - save pointer lookup state while parsing	Kartik Agaram	2020-02-20	1	-30/+37
\|
*	6032 - make room for '*' pointer lookups in stmts	Kartik Agaram	2020-02-20	1	-31/+89
\|
*	6031 - bugfix in selecting codegen pattern	Kartik Agaram	2020-02-20	1	-6/+23
\|
*	6030	Kartik Agaram	2020-02-20	1	-11/+9
\|
*	6029	Kartik Agaram	2020-02-20	1	-4/+4
\|
*	6028	Kartik Agaram	2020-02-20	1	-4/+4
\|
*	6023 - bug: vars with both stack-offset and reg	Kartik Agaram	2020-02-18	1	-10/+20
\| \| \| \| \|	This was initially disquieting; was I writing enough tests? Then I noticed I had TODOs for some missing checks.
*	6022 - initial sketch of array length	Kartik Agaram	2020-02-18	1	-0/+75
\| \| \| \| \|	This is a particularly large abstraction leak: SubX arrays track their lengths in bytes, and therefore Mu as well.
*	6022	Kartik Agaram	2020-02-18	1	-1/+4
\| \| \| \|	Forgot to actually use the new type-dispatch in commit 6017.
*	6021	Kartik Agaram	2020-02-18	1	-0/+22
\|
*	6020	Kartik Agaram	2020-02-18	1	-55/+47
\| \| \| \|	Some deduplication, though this may be a premature abstraction.
*	6019 - finish supporting all branch primitives	Kartik Agaram	2020-02-18	1	-6/+233
\| \| \| \| \| \| \| \|	I'd been thinking I didn't need unconditional `break` instructions, but I just realized that non-local unconditional breaks have a use. Stop over-thinking this, just support everything. The code is quite duplicated.
*	6017 - simplify type-dispatch for primitives	Kartik Agaram	2020-02-17	1	-26/+66
\| \| \| \| \| \| \| \| \| \| \| \|	We'll be doing type-checking in a separate phase in future. For now we need only to distinguish between literals and non-literals for x86 primitive instructions. I was tempted to support x86 set__ instructions for this change: https://c9x.me/x86/html/file_module_x86_id_288.html That will happen at some point. And I'll simplify a bunch of branches for results of predicate functions when it happens.
*	6016	Kartik Agaram	2020-02-17	1	-5/+5
\|
*	6014	Kartik Agaram	2020-02-17	1	-58/+58
\|
*	6011	Kartik Agaram	2020-02-16	1	-5/+3
\|
*	6009 - significantly cleaner lexing	Kartik Agaram	2020-02-16	1	-132/+32
\| \| \| \| \| \| \| \| \|	This cleans up a bunch of little warts that had historically accumulated because of my bull-headedness in not designing a grammar up front. Let's see if the lack of a grammar comes up again. We now require that there be no space in variable declarations between the name and the colon separating it from its type.
*	6008	Kartik Agaram	2020-02-16	1	-17/+17
\| \| \| \| \| \| \| \|	Allow comments at the end of all kinds of statements. To do this I replaced all calls to next-word with next-mu-token.. except one. I'm not seeing any bugs yet, any places where comments break things. But this exception makes me nervous.
*	6005	Kartik Agaram	2020-02-14	1	-4/+40
\| \| \| \| \| \| \| \|	Support calling SubX code from Mu. I have _zero_ idea how to make this safe. Now we can start writing tests. We can't use commandline args yet. That requires support for kernel strings.
*	6000 - clean up after no-local branches	Kartik Agaram	2020-02-09	1	-10/+181
\|
*	5998 - redo code-generation for 'break'	Kartik Agaram	2020-02-09	1	-77/+130
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've been saying that we can convert this: { var x: int break-if-= ... } ..into this: { 68/push 0/imm32 { 0f 84/jump-if-= break/disp32 ... } 81 0/subop/add %esp 4/imm32 } All subsequent instructions go into a nested block, so that they can be easily skipped without skipping the stack cleanup. However, I've been growing aware that this is a special case. Most of the time we can't use this trick: for loops for non-local breaks for non-local loops In most cases we need to figure out all the intervening variables on the stack and emit code to clean them up. And now it turns out even for local breaks like above, the trick doesn't work. Consider what happens when there's a loop later in the block: { var x: int break-if-= ... } If we emitted a nested block for the break, the local loop would become non-local. So we replace one kind of state with another. Easiest course of action is to just emit the exact same cleanup code for all conditional branches.
*	5997 - clean up after unconditional loops	Kartik Agaram	2020-02-09	1	-34/+122
\| \| \| \| \| \| \|	Turns out we can't handle them like conditional loops. This function to emit cleanup code for jumps is getting quite terrible. I don't yet know what subsidiary abstractions it needs.
*	5996	Kartik Agaram	2020-02-09	1	-0/+12
\|
*	5993 - support for unlabeled loop instructions	Kartik Agaram	2020-02-08	1	-3/+213
\| \| \| \| \|	Now that we have the infrastructure for emitting cleanup blocks, the labeled variants should be easy as well.