mu - Soul of a tiny new machine. More thorough tests → More comprehensible and rewrite-friendly software → More resilient society.

	Commit message (Collapse)	Author	Age	Files	Lines
*	6121	Kartik Agaram	2020-03-10	1	-3/+3
\|
*	6119	Kartik Agaram	2020-03-10	4	-9522/+10225
\|
*	6118 - support records on the stack	Kartik Agaram	2020-03-10	2	-12/+96
\|
*	6117 - fix offset computation	Kartik Agaram	2020-03-10	1	-13/+15
\| \| \| \| \| \|	It makes no sense to know where the next variable will start before I've seen it or how much space it needs. Things have only been working so far because all variables take 4 bytes.
*	6116 - stack locations now computed during codegen	Kartik Agaram	2020-03-10	2	-66/+91
\| \| \| \| \| \| \| \| \| \| \| \| \|	We can't do it during parsing time because we may not have all type definitions available yet. Mu supports using types before defining them. At first I thought I should do it in populate-mu-type-sizes (appropriately renamed). But there's enough complexity to tracking when stuff lands on the stack that it's easiest to do while emitting code. I don't think we need this information earlier in the compiler. If I'm right, it seems simpler to colocate the computation of state close to where it's used.
*	6115	Kartik Agaram	2020-03-10	2	-27/+0
\|
*	6114	Kartik Agaram	2020-03-10	1	-2/+2
\|
*	6113	Kartik Agaram	2020-03-10	2	-58/+41
\|
*	6112	Kartik Agaram	2020-03-08	2	-21/+112
\| \| \| \| \|	Move computation of offsets to record fields into the new phase as well. Now we should be robust to type definitions in any order.
*	6111	Kartik Agaram	2020-03-08	2	-5/+185
\| \| \| \| \| \| \| \| \| \|	Move out total-size computation from parsing to a separate phase. I don't have any new tests yet, but it's encouraging that existing tests continue to pass. This may be the first time I've ever written this much machine code (with mutual recursion!) and gotten it to work the first time.
*	6110	Kartik Agaram	2020-03-08	1	-2/+3
\|
*	6109	Kartik Agaram	2020-03-08	2	-2/+0
\|
*	6108	Kartik Agaram	2020-03-08	2	-13/+6
\|
*	6107	Kartik Agaram	2020-03-08	2	-2/+15
\| \| \| \| \|	Finally we're now able to track the index of a field in a record/struct/product type.
*	6106	Kartik Agaram	2020-03-08	2	-9/+6
\| \| \| \|	Free up eax using the newly available register.
*	6105	Kartik Agaram	2020-03-08	2	-6/+9
\| \| \| \|	Create space for another local.
*	6104	Kartik Agaram	2020-03-08	2	-7/+12
\| \| \| \| \|	parse-mu-types has a lot of local state. Move a local to the stack to free up a register.
*	6103	Kartik Agaram	2020-03-08	2	-3/+1
\|
*	6102	Kartik Agaram	2020-03-08	1	-26/+0
\|
*	6101	Kartik Agaram	2020-03-08	2	-45/+92
\| \| \| \| \| \| \| \| \|	Make room for additional information for each field in a record/product type. Fields can be used before they're declared, and we may not know the offsets they correspond to at that point. This is going to necessitate a lot of restructuring.
*	6100	Kartik Agaram	2020-03-08	1	-2/+9
\| \| \| \|	Fix a bug with a live register being clobbered.
*	6099	Kartik Agaram	2020-03-08	1	-38/+5
\|
*	6098	Kartik Agaram	2020-03-07	1	-2/+2
\| \| \| \| \|	It was premature to say user-defined record types and array types were done.
*	6097	Kartik Agaram	2020-03-07	2	-4/+1
\| \| \| \| \| \|	I thought I needed to support compute-offset with literal index, but in that case might as well just use an index literal directly. The 'index' instruction with literals already supports non-power-of-2 sizes.
*	6096	Kartik Agaram	2020-03-07	2	-3/+58
\| \| \| \|	A new test, and a new bugfix.
*	6095	Kartik Agaram	2020-03-07	2	-10/+10
\|
*	6094 - new 'compute-offset' instruction	Kartik Agaram	2020-03-07	15	-29/+186
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If indexing into a type with power-of-2-sized elements we can access them in one instruction: x/reg1: (addr int) <- index A/reg2: (addr array int), idx/reg3: int This translates to a single instruction because x86 instructions support an addressing mode with left-shifts. For non-powers-of-2, however, we need a multiply. To keep things type-safe, it is performed like this: x/reg1: (offset T) <- compute-offset A: (addr array T), idx: int y/reg2: (addr T) <- index A, x An offset is just an int that is guaranteed to be a multiple of size-of(T). Offsets can only be used in index instructions, and the types will eventually be required to line up. In the process, I have to expand Input-size because mu.subx is growing big.
*	6093	Kartik Agaram	2020-03-07	2	-111/+168
\| \| \| \|	Some much-needed reorganization.
*	6092	Kartik Agaram	2020-03-06	1	-10/+10
\|
*	6091	Kartik Agaram	2020-03-06	42	-5474/+5848
\|
*	6090 - new instruction: multiply by immediate	Kartik Agaram	2020-03-06	5	-3/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a 3-operand instruction: r32 = rm32 * imm32 It looks like https://c9x.me/x86/html/file_module_x86_id_138.html has a bug, implying the same opcode supports a 2-operand version. I don't see that in the Intel manual pdf, or at alternative sites like https://www.felixcloutier.com/x86/imul Native runs seem to validate my understanding. In the process I also fixed a bug in the existing multiply instruction 0f af: the only flags it sets are OF and CF. The other existing multiply instruction f7 was doing things right.
*	6089	Kartik Agaram	2020-03-06	28	-70/+70
\|
*	6088 - start using setCC instructions	Kartik Agaram	2020-03-06	10	-26/+168
\|
*	6087	Kartik Agaram	2020-03-06	1	-7818/+7865
\|
*	6086 - `index` into arrays with a literal	Kartik Agaram	2020-03-06	2	-30/+114
\|
*	6085	Kartik Agaram	2020-03-06	15	-17/+78
\| \| \| \|	Support parsing ints from strings rather than slices.
*	6084	Kartik Agaram	2020-03-06	21	-9166/+9401
\|
*	6083	Kartik Agaram	2020-03-06	19	-65/+65
\|
*	6082 - bugfix in spilling register vars	Kartik Agaram	2020-03-06	2	-3/+263
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the process I'm starting to realize that my approach to avoiding spills isn't ideal. It works for local variables but not to avoid spilling outputs. To correctly decide whether to spill to an output register or not, we really need to analyze when a variable is live. If we don't do that, we'll end up in one of two bad situations: a) Don't spill the outermost use of an output register (or just the outermost scope in a function). This is weird because it's hard to explain to the programmer why they can overwrite a local with an output above a '{' but not below. b) Disallow overwriting entirely. This is easier to communicate but quite inconvenient. It's nice to be able to use eax for some temporary purpose before overwriting it with the final result of a function. If we instead track liveness, things are convenient and also easier to explain. If a temporary is used after the output has been written that's an obvious problem: "you clobbered the output". (It seems more reasonable to disallow multiple live ranges for the output. Once an output is written it can only be shadowed in a nested block.) That's the bad news. Now for some good news: One lovely property Mu the language has at the moment is that live ranges are guaranteed to be linear segments of code. We don't need to analyze loop-carried dependences. This means that we can decide whether a variable is live purely by scanning later statements for its use. (Defining 'register use' is slightly non-trivial; primitives must somehow specify when they read their output register.) So we don't actually need to worry about a loop reading a register with one type and writing to another type at the end of an iteration. The only way that can happen is if the write at the end was to a local variable, and we're guaranteeing that local variables will be reclaimed at the end of the iteration. So, the sequence of tasks: a) compute register liveness b1) verify that all register variables used at any point in a program are always the topmost use of that register. b2) decide whether to spill/shadow, clobber or flag an error. There's still the open question of where to attach liveness state. It can't be on a var, because liveness varies by use of the var. It can't be on a statement because we may want to know the liveness of variables not referenced in a given statement. Conceptually we want a matrix of locals x stmts (flattened). But I think it's simpler than that. We just want to know liveness at the time of variable declarations. A new register variable can be in one of three states w.r.t. its previous definition: either it's shadowing it, or it can clobber it, or there's a conflict and we need to raise an error. I think we can compute this information for each variable definition by an analysis similar to existing ones, maintaining a stack of variable definitions. The major difference is that we don't pop variables when a block ends. Details to be worked out. But when we do I hope to get these pending tests passing.
*	6081 - ctags for .mu files	Kartik Agaram	2020-03-05	6	-14/+19
\|
*	6080	Kartik Agaram	2020-03-05	6	-9369/+9711
\|
*	6079 - optimize register spills	Kartik Agaram	2020-03-05	2	-23/+211
\| \| \| \| \|	The second var to the same register in a block doesn't need to spill. We're never going to restore the var it's shadowing.
*	6078 - highlight hex literals in Vim	Kartik Agaram	2020-03-02	3	-3/+3
\|
*	6077	Kartik Agaram	2020-03-02	5	-54/+55
\|
*	6076	Kartik Agaram	2020-03-02	1	-20/+21
\|
*	6075	Kartik Agaram	2020-03-02	1	-1/+2
\|
*	6074	Kartik Agaram	2020-02-29	2	-21/+25
\|
*	6073	Kartik Agaram	2020-02-29	2	-11/+28
\|
*	6072	Kartik Agaram	2020-02-29	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	I just did an experiment, and test_apps is spending over half its time (95 seconds that could be 40) redundantly clearing every stream before every test. And some of those streams are _large_; the translators for SubX and Mu use streams sizes pessimistically for the largest possible program they can handle (1MB per segment). A single test (the one in assort.subx) spends 24 seconds initializing 2 1MB-segments before processing a dozen lines of text in the blink of an eye. I'm not going to speed this up. But good to know.
*	6071 - array indexing for non-int power-of-2 types	Kartik Agaram	2020-02-29	2	-11/+163
\|