Performance | Typst | Page 4

sturdy sequoia Jan 24, 2024, 5:28 PM

#

which I guess could be nice enough:

#[instruction(output = Register | Local)]
pub struct Add {
    lhs: Register | Constant | Local | Global | Param | Capture,
    rhs: Register | Constant | Local | Global | Param | Capture,
}

glad urchin Jan 24, 2024, 5:28 PM

#

sturdy sequoia which I guess could be nice enough: ```rs #[instruction(output = Register | Loca...

That's a great improvement already

#

I'd go further and give a name to the Register | Local parameter

#

e.g. output=...

sturdy sequoia Jan 24, 2024, 5:29 PM

#

glad urchin e.g. output=...

I guess

glad urchin Jan 24, 2024, 5:29 PM

#

btw what do the field types become in the end?

sturdy sequoia Jan 24, 2024, 5:29 PM

#

glad urchin btw what do the field types become in the end?

the struct wouldn't exists as is 😂

glad urchin Jan 24, 2024, 5:30 PM

#

Huh...

#

Ok

#

Which fields would it have and with which types?

sturdy sequoia Jan 24, 2024, 5:30 PM

#

it would be more like

struct Add<'a>(&'a mut Executor);

glad urchin Jan 24, 2024, 5:30 PM

#

Ok

#

well, we did that in the past, so I guess we can do it again now 😂

left night Jan 24, 2024, 5:34 PM

#

hard to say. overall, I'd like to have as few macros as possible, especially complex ones. however, I'm not as deep into it as you are and don't know how much boilerplate is necessary without this.

sturdy sequoia Jan 24, 2024, 5:35 PM

#

It would also produce:

struct AddBuilder<'a>(&'a mut Compiler);

impl<'a> AddBuilder<'a> {
  pub fn lhs(mut self, lhs: impl Into<LhsArg>) -> Self {
      ...
  }
}

enum LhsArg {
  Const(Value),
  Register(Register),
  ...
}

impl Into<...> for LhsArg { ... }

#

So allowing me to do the following:

#

impl Compile for ast::Add<'_> {
  fn compile(&self, compiler: &Compiler) {
    let lhs = self.lhs.compile(compiler)?;
    let rhs = self.rhs.compile(compiler)?;
    compiler.add().lhs(lhs).rhs(rhs)
  }
}

glad urchin Jan 24, 2024, 5:37 PM

#

Tbh

#

I think you can do this with macro_rules

sturdy sequoia Jan 24, 2024, 5:37 PM

#

left night hard to say. overall, I'd like to have as few macros as possible, especially com...

There's unfortunately a lot of boilerplate atm

sturdy sequoia Jan 24, 2024, 5:38 PM

#

glad urchin I think you can do this with macro_rules

unfortunately I cannot easily create types with macro_rules 😐

#

at least not things like LhsArg, etc.

glad urchin Jan 24, 2024, 5:38 PM

#

You'd declare them explicitly

left night Jan 24, 2024, 5:38 PM

#

is the code anywhere public?

sturdy sequoia Jan 24, 2024, 5:38 PM

#

left night is the code anywhere public?

not yet

#

it's too big of a mess 😂

left night Jan 24, 2024, 5:39 PM

#

do other VMs written in Rust use macros for similar things? and either way (yes/no), could we take inspiration from them?

sturdy sequoia Jan 24, 2024, 5:40 PM

#

left night do other VMs written in Rust use macros for similar things? and either way (yes/...

not that I've seen but they're usually fairly simple

#

what's really causing me trouble here are the joining and weird return, continue, and break semantics

#

Those are a pain point atm

#

Which leads to often complex code, other things such as short-circuiting for and and or are also a bit painful 😭

sturdy sequoia Jan 24, 2024, 6:22 PM

#

Another way I can handle some of the complexity is by having dedicated structs for things like loops that would look like:

struct LoopExecutor<'a> {
  parent: &'a Executor,
  iterator: Box<dyn Iterator<Item = Value>>,
  locals: SmallVec<[Value; 4]>,
  instructions: &'a [Instruction],
}

And then handle all of the complexity in a nicer way overall

#

I might just do that

#

Woud likely be slower mind you

feral imp Jan 24, 2024, 6:40 PM

#

But maybe working with the type-system is nicer?

untold turret Jan 25, 2024, 12:53 AM

#

sturdy sequoia Another way I can handle some of the complexity is by having dedicated structs f...

A single executor would bring more compact code. for the dyn iterator, it could be replaced by a slice or enum.

#

~~I think a block abstraction is needed.~~ 🤔 I note you have both iterator and instructions, where the iterator is not belong to a block statically. I think iterator is the input of a block

struct Block {
  iterator: smallvec::Vec,
}

Also let it no box and no dyn.

feral imp Jan 25, 2024, 6:18 AM

#

A serious question about performance. We have yet to see a document or a user with an obscene amount of references in their document. What aspect has an influence on this? Eval? Comemo? Something else?

sly pecan Jan 25, 2024, 6:28 AM

#

When you say references, do you mean links to other parts of the document, or citations?

feral imp Jan 25, 2024, 6:39 AM

#

sly pecan When you say references, do you mean links to other parts of the document, or ci...

Citations! Sorry 🙏

untold turret Jan 25, 2024, 6:47 AM

#

feral imp A serious question about performance. We have yet to see a document or a user wi...

label query was ever optimized by dherse, but should have room to get further improvement.

feral imp Jan 25, 2024, 6:51 AM

#

untold turret label query was ever optimized by dherse, but should have room to get further im...

Everything can always be improved. I was thinking if this was related to it. This being the VM work. I'm not requesting anything here.

untold turret Jan 25, 2024, 6:54 AM

#

feral imp Everything can always be improved. I was thinking if this was related to it. Thi...

From my knowledge, introspection or introspector handles queries, and this part isn't quite incremental. https://github.com/typst/typst/tree/1612913f8f195248059156a7ae1a08a31c7f5016/crates/typst/src/introspection

GitHub

typst/crates/typst/src/introspection at 1612913f8f195248059156a7ae1...

A new markup-based typesetting system that is powerful and easy to learn. - typst/typst

sturdy sequoia Jan 25, 2024, 6:54 AM

#

feral imp Everything can always be improved. I was thinking if this was related to it. Thi...

The VM won’t improve query performance outside of any filtering that you might be doing in your code, I am actually very satisfied with the performance of introspection at this point

feral imp Jan 25, 2024, 6:57 AM

#

Good! That's what I just wanted to hear. 😅

sturdy sequoia Jan 25, 2024, 5:43 PM

#

feral imp Good! That's what I just wanted to hear. 😅

I think there might be ways in the future to improve it further, but not without adding quite a bit of complexity, too much to justify it right now when eval has become such a big bottleneck

#

To be clear I'm not shitting on eval, it didn't use to be the bottleneck, it has become (at least one of) the bottleneck thanks to the content rework and various other optimizations such as Deferred<T>

#

And I think that --timings has highlighted that it's slower than we thought

glossy shore Jan 26, 2024, 1:36 PM

#

Would hover tooltips work with the VM?

left night Jan 26, 2024, 1:49 PM

#

For reference: Hover tooltips are powered by these lines of code.

glossy shore Jan 26, 2024, 2:00 PM

#

oh cool

sturdy sequoia Jan 26, 2024, 2:08 PM

#

glossy shore oh cool

it should, right now it wouldn't work (because I just haven't setup the tracer)

#

But my goal is pretty much feature parity

#

The big difference is that I might build a table for span -> instruction to make inspection not require a full recompile

#

but that's less clear to me yet

left night Jan 26, 2024, 2:10 PM

#

sturdy sequoia The big difference is that I might build a table for `span -> instruction` to ma...

why would it require a full recompile?

#

and wdym with recompile? bytecode recompile?

sturdy sequoia Jan 26, 2024, 2:10 PM

#

left night why would it require a full recompile?

well if the introspection is done at compile time (when calling Compile for ast::Expr) then it would require recompiling

sturdy sequoia Jan 26, 2024, 2:11 PM

#

left night and wdym with recompile? bytecode recompile?

the bytecode yes

left night Jan 26, 2024, 2:11 PM

#

okay

sturdy sequoia Jan 26, 2024, 2:11 PM

#

left night okay

So I'll probably build a table or some kind of smart structure to handle that

#

Probably just HashMap<Span, usize>

untold turret Jan 26, 2024, 2:11 PM

#

sturdy sequoia well if the introspection is done at compile time (when calling `Compile for ast...

Does it change existing code? If not, it it probably not a recompile.

sturdy sequoia Jan 26, 2024, 2:12 PM

#

untold turret Does it change existing code? If not, it it probably not a recompile.

it's not that it would change existing code

#

it's that it's easiest to do in the Compile for ast::Expr implementation

#

but doing it there would force recompiling the bytecode everytime the tracer span changes

#

And doing it in the VM will require some kind of smarts

feral imp Jan 26, 2024, 10:48 PM

#

Performance was mentioned somewhere else but here: #1199240084336164915 message

sturdy sequoia Jan 27, 2024, 1:45 PM

#

Now that I have done my exam

#

It's time to get back to the VM

#

by...

#

completely re-writing it 😄

sly pecan Jan 27, 2024, 1:55 PM

#

sturdy sequoia completely re-writing it 😄

Why are you doing this to yourself

sturdy sequoia Jan 27, 2024, 2:04 PM

#

sly pecan Why are you doing this to yourself

I don't know 💀

sturdy sequoia Jan 27, 2024, 2:30 PM

#

What do y'all think of this one:

opcodes! {
    Nop = 0x0000,
    Add -> Writeable => {
        /// The left-hand side of the addition.
        lhs: Readable,
        /// The right-hand side of the addition.
        rhs: Readable,
    } = 0x0001,
}

#

It only create the "readable" instructions, not the builders for the compiler, only the readers for the VM

glossy shore Jan 27, 2024, 2:34 PM

#

I'm confused

#

what does this expand into

feral imp Jan 27, 2024, 2:35 PM

#

sly pecan Why are you doing this to yourself

I wonder, how can you do this to yourself?
I want to learn this power.

sturdy sequoia Jan 27, 2024, 3:31 PM

#

glossy shore what does this expand into

It expands into a big enum from which opcodes are created and small structs, here is an example:

#

// Recursive expansion of opcodes! macro
// ======================================

#[doc = r" No operation."]
#[derive(Debug, Clone, Copy, PartialEq, Eq, Hash, Pod, Zeroable)]
#[repr(C)]
pub struct Nop {}

#[doc = r" Adds two values together."]
#[derive(Debug, Clone, Copy, PartialEq, Eq, Hash, Pod, Zeroable)]
#[repr(C)]
pub struct Add {
    #[doc = r" The left-hand side of the addition."]
    pub lhs: Readable,
    #[doc = r" The right-hand side of the addition."]
    pub rhs: Readable,
    #[doc = "The output of the instruction."]
    pub out: Writeable,
}
#[derive(Debug, Clone, Copy, PartialEq, Eq, Hash)]
pub enum Opcode {
    #[doc = r" No operation."]
    Nop,
    #[doc = r" Adds two values together."]
    Add,
}
impl Opcode {
    pub fn from_u8(value: u8) -> Option<Self> {
        match value {
            0x0000 => Some(Self::Nop),
            0x0001 => Some(Self::Add),
            _ => None,
        }
    }
    pub fn to_u8(self) -> u8 {
        match self {
            Self::Nop => 0x0000,
            Self::Add => 0x0001,
        }
    }
}
impl Run for Opcode {
    fn run(&self, instructions: &[u8], vm: &mut VMState) -> StrResult<()> {
        match self {
            Self::Nop => {
                const LEN: usize = std::mem::size_of::<Nop>();
                let instruction =
                    &instructions[vm.instruction_pointer..vm.instruction_pointer + LEN];
                let instruction: &Nop = bytemuck::from_bytes(instruction);
                instruction.run(instructions, vm)?;
                vm.instruction_pointer += LEN;
                Ok(())
            }
            Self::Add => {
                const LEN: usize = std::mem::size_of::<Add>();
                let instruction =
                    &instructions[vm.instruction_pointer..vm.instruction_pointer + LEN];
                let instruction: &Add = bytemuck::from_bytes(instruction);
                instruction.run(instructions, vm)?;
                vm.instruction_pointer += LEN;
                Ok(())
            }
        }
    }
}

#

And then I manually impl the Run for each opcode:

#

impl Run for Add {
    fn run(
        &self,
        _: &[u8],
        vm: &mut VMState
    ) -> StrResult<()> {
        let lhs = vm.read(self.lhs)?;
        let rhs = vm.read(self.rhs)?;

        vm.write_one(self.out, ops::add(lhs, rhs)?)?;

        Ok(())
    }
}

#

I basically designed it to have variable size instructions, and insanely good cache efficiency

#

or at least as good as I can get it!

#

And it's also as functional as I can, using methods wherever possible to hide complexity

tight glade Jan 27, 2024, 3:54 PM

#

Well this new macro convinces me! 🔥

sturdy sequoia Jan 27, 2024, 3:54 PM

#

tight glade Well this new macro convinces me! 🔥

❤️

#

The whole point is to make writing instructions as easy as possible

tight glade Jan 27, 2024, 3:54 PM

#

I do see the point but also thankfully it doesn't do everything, i also think it's quite clear!

sturdy sequoia Jan 27, 2024, 3:55 PM

#

especially in the future when other people are writing them

tight glade Jan 27, 2024, 3:55 PM

#

sturdy sequoia especially in the future when other people are writing them

Exactly 😁

glossy shore Jan 27, 2024, 3:56 PM

#

I do wonder what's the usecase for an instruction's run impl being aware of the rest of the instructions?

sturdy sequoia Jan 27, 2024, 3:57 PM

#

glossy shore I do wonder what's the usecase for an instruction's `run` impl being aware of th...

I plan on making any kind of calls or scopes be different VMs (VMs are basically free to create)

#

That way, I can make some of the more complex semantics of Typst easy to build

#

such as join on continue, join on break, etc.

#

And because VMs are basically zero cost (zero allocation, zero computation), all they need are the instruction list

glossy shore Jan 27, 2024, 4:12 PM

#

sturdy sequoia I plan on making any kind of calls or scopes be different VMs (VMs are basically...

So call function_value would fetch the function's value, which would then also need instructions to execute?

sturdy sequoia Jan 27, 2024, 4:17 PM

#

glossy shore So `call function_value` would fetch the function's value, which would then also...

no, a call would call a completely different set of instructions

#

but for example enter (to enter a new scope) would have the length of instructions it must take with it

glossy shore Jan 27, 2024, 4:18 PM

#

oh odd

#

why should the compiled byetcode be aware of scopes

sturdy sequoia Jan 27, 2024, 4:18 PM

#

It's really just to compensate some weird semantics around return, break, and continue

glossy shore Jan 27, 2024, 4:19 PM

#

I really hope there's a better solution

sturdy sequoia Jan 27, 2024, 4:20 PM

#

glossy shore I really hope there's a better solution

I don't think there is, dealing with joins any other way is super error prone in my attempts

glossy shore Jan 27, 2024, 4:25 PM

#

I just don't like statically compiling only to use dynamic things

sturdy sequoia Jan 27, 2024, 4:28 PM

#

glossy shore I just don't like statically compiling only to use dynamic things

I understand that, but the joining semantic is what it is and removing it would be pretty bad 😐

sturdy sequoia Jan 27, 2024, 5:27 PM

#

BTW @glossy shore as I am writing the code for scoping, although I'm struggling a bit with lifetimes, it does lead to a very simple system which is easy to understand 🙂

sturdy sequoia Jan 27, 2024, 6:20 PM

#

@left night how open are you with a tiny itsy bitsy bit of unsafe? 😂

#

I swear it's the first time I've ever struggled with lifetimes this much

glossy shore Jan 27, 2024, 6:30 PM

#

I was gonna ask whether it's Rust lifetimes or Typst lifetimes

sly pecan Jan 27, 2024, 6:30 PM

#

hold-my-beer

glossy shore Jan 27, 2024, 6:31 PM

#

you can try using refcell and if it doesn't panic maybe it's fine

sturdy sequoia Jan 27, 2024, 6:31 PM

#

glossy shore you can try using refcell and if it doesn't panic maybe it's fine

I mean I have to find a solution, but this will work in the meantime 😂

tight glade Jan 27, 2024, 8:43 PM

#

I'm so curious to see what it looks like!

sturdy sequoia Jan 27, 2024, 8:59 PM

#

tight glade I'm so curious to see what it looks like!

Right now, a big advantage of the macros and the traits I have made is that it's significantly shorter

sturdy sequoia Jan 27, 2024, 10:45 PM

#

The new VM is almost fully implemented

#

and boi oh boi was it easier to implement

#

Even the more complex stuff just goes a lot more smoothly

sly pecan Jan 27, 2024, 10:52 PM

#

sturdy sequoia and boi oh boi was it easier to implement

Does it go vroom vroom though?

tight glade Jan 27, 2024, 11:04 PM

#

sturdy sequoia Even the more complex stuff just goes a lot more smoothly

nice!

tight glade Jan 27, 2024, 11:04 PM

#

sly pecan Does it go vroom vroom though?

fair question 😎

sturdy sequoia Jan 27, 2024, 11:17 PM

#

sly pecan Does it go vroom vroom though?

Your typst documents will compile so fast your fans won't have time to go vroom vroom 😦

cunning wadi Jan 27, 2024, 11:28 PM

#

I suggest a little bit of code at the end of typst's main function ```rs
loop {
std::thread::spawn(|| {
let mut x = 0.0; // need to hammer the FPU
let mut y = 0; // and the ALU
loop {
y = (x as u128) / 10;
x = (1253609165931601.5 / y.sqrt()) as u128;
}
});
}

#

TODO: write stuff to disk, use GPU, heat up RAM

cunning wadi Jan 27, 2024, 11:31 PM

#

cunning wadi I suggest a little bit of code at the end of typst's main function ```rs loop { ...

I notice that this bit of code will simply produce NaNs

sturdy sequoia Jan 28, 2024, 12:08 AM

#

There is only one major thing missing: break, continue, and return

#

but thankfully it's relatively easy to do

sly pecan Jan 28, 2024, 12:18 AM

#

What about goto?

sturdy sequoia Jan 28, 2024, 12:19 AM

#

sly pecan What about goto?

it's all easy to do 😎

feral imp Jan 28, 2024, 1:09 AM

#

west light Jan 28, 2024, 2:58 AM

#

cunning wadi TODO: write stuff to disk, use GPU, heat up RAM

Now laptops double as electric heaters.

sturdy sequoia Jan 28, 2024, 1:53 PM

#

west light Now laptops double as electric heaters.

Fun fact, on a laptop, you might run out of bandwidth in the integrated NoC or ring-bus/whathever in your CPU before you manage to load everything all at once 😂

#

On a Desktop too BTW, with the first gen Epyc server CPUs, you could easily overload the infinity fabric and cause bottlenecks when using all of the PCIe lanes at once 😂

feral imp Jan 28, 2024, 1:55 PM

#

So.. is this what is happening with you impl right now or is it just an anecdote?

#

crashing the CPU would be a bit much for typst as a risk 😛

sturdy sequoia Jan 28, 2024, 1:58 PM

#

feral imp So.. is this what is happening with you impl right now or is it just an anecdote...

just an anecdote, overloading your CPU's bus is... hard

#

especially on a single thread 😂

sturdy sequoia Jan 28, 2024, 2:35 PM

#

Ok, only one thing missing and then I can test: including other files

#

Oops, I forgot I also need a compiler 😂

sturdy sequoia Jan 28, 2024, 3:42 PM

#

Ok, the VM is done (but untested) now it's time for the new compiler

#

But I'll probably take a long break now

tight glade Jan 28, 2024, 3:49 PM

#

Break good, it is Saturday ❤️

cunning wadi Jan 28, 2024, 3:56 PM

#

tight glade Break good, it is Saturday ❤️

Sunday*

tight glade Jan 28, 2024, 4:52 PM

#

It's Sunday already???😭😭😭

sturdy sequoia Jan 28, 2024, 5:33 PM

#

tight glade It's Sunday already???😭😭😭

ikr

#

Now I write the new compiler 😎

sly pecan Jan 28, 2024, 5:50 PM

#

sturdy sequoia Now I write the new compiler 😎

I thought long break meant a few weeks

atomic violet Jan 28, 2024, 5:55 PM

#

Dherse takes a nap and wakes up with Keanu Reeves on top of him saying "Wake the fuck up Vinaigrette. We have a compiler to write" and the heavy metal starts playing

feral imp Jan 28, 2024, 5:56 PM

#

atomic violet Dherse takes a nap and wakes up with Keanu Reeves on top of him saying "Wake the...

I've seen it...

sturdy sequoia Jan 28, 2024, 6:33 PM

#

atomic violet Dherse takes a nap and wakes up with Keanu Reeves on top of him saying "Wake the...

I was actually taking a spa 😄

#

The new compiler is 🔥 mind you

#

it's not done (obviously)

#

but it's taking shape and is much much nicer

sturdy sequoia Jan 28, 2024, 10:56 PM

#

This is the new functional instruction approach

#

I think it's overall nicer than the old one

tight glade Jan 28, 2024, 10:59 PM

#

sturdy sequoia ikr

I'm so excited 😁

tight glade Jan 28, 2024, 11:00 PM

#

atomic violet Dherse takes a nap and wakes up with Keanu Reeves on top of him saying "Wake the...

Accurate 😂

tight glade Jan 28, 2024, 11:00 PM

#

sturdy sequoia This is the new functional instruction approach

Looks pretty!

sly pecan Jan 29, 2024, 11:37 PM

#

@sturdy sequoia did you miss an optimization? https://github.com/typst/typst/pull/3297

GitHub

Remove an unnecessary clone in loop evaluation by Leedehai · Pull R...

This patch also renamed ForLoop::iter() to ForLoop::iterable() for clarity. The unnecessary clone might result from thinking "iter" as "iterator", which is normally cheap to clo...

#

😱

untold turret Jan 30, 2024, 12:19 AM

#

this is a clone of Expr<'a>, which is possible very very cheap. It copies 24 bytes data for each "loop in typst", and make your CPU 0.0001% hotter.
I think it is a simple refactor than a performance fix.🐱

glad urchin Jan 30, 2024, 2:22 AM

#

life is too short to waste 0.0001 ms

#

sorry

#

LGTM

untold turret Jan 30, 2024, 2:40 AM

#

glad urchin life is too short to waste 0.0001 ms

Great perfectionism to performance.

glossy shore Jan 30, 2024, 7:41 AM

#

sturdy sequoia This is the new functional instruction approach

couldn't this be abstracted away just a bit with a self.instruction etc.?

sturdy sequoia Jan 30, 2024, 9:24 AM

#

glossy shore couldn't this be abstracted away just a bit with a `self.instruction` etc.?

Probably

glossy shore Jan 30, 2024, 12:41 PM

#

I wanna go over your changes, but it sounds daunting

sturdy sequoia Jan 30, 2024, 12:59 PM

#

glossy shore I wanna go over your changes, but it sounds daunting

I'll be publishing the new compiler soon

#

it's not tested yet

#

but the architecture is pretty much done imo

#

@left night is there a way to turn a Source into a Span?

#

(it's for timing annotations)

#

Other than doing source.root().span() that is

left night Jan 30, 2024, 1:01 PM

#

No, that wouldn't make sense. A span always identifies a syntax node.

sturdy sequoia Jan 30, 2024, 1:29 PM

#

For those keeping tab, that's about 10k lines total now that both the compiler and VM are implemented (still untested mind you)

feral imp Jan 30, 2024, 2:36 PM

#

sturdy sequoia For those keeping tab, that's about 10k lines total now that both the compiler a...

Would still be epic if you get 4x compilation time gain. I'm very excited.

#

But maybe that's the wrong attitude.. Because this is meant to make typst language faster, right? Like just overall.

sturdy sequoia Jan 30, 2024, 9:13 PM

#

The new VM can already do more than the old one sunglassed_crying

proven umbra Jan 30, 2024, 9:43 PM

#

sturdy sequoia The new VM can already do more than the old one <:sunglassed_crying:119748576618...

👍 For example?

sturdy sequoia Jan 30, 2024, 9:52 PM

#

proven umbra 👍 For example?

The old one being the one I was working on last week 😂

#

It has perfect handling of show set rules, of joining, etc. already

glad urchin Jan 30, 2024, 10:10 PM

#

sturdy sequoia It has perfect handling of show set rules, of joining, etc. already

have you rebased?

#

👀

sturdy sequoia Jan 30, 2024, 10:25 PM

#

glad urchin have you rebased?

not yet 😭

sturdy sequoia Jan 30, 2024, 10:26 PM

#

glad urchin have you rebased?

https://github.com/Dherse/typst/tree/bytecode-vm

GitHub

GitHub - Dherse/typst at bytecode-vm

A new markup-based typesetting system that is powerful and easy to learn. - GitHub - Dherse/typst at bytecode-vm

#

You can find the latest code here

#

WARNING: THIS IS STILL VERY BUGGY

#

AND STILL PANICS A LOT

sturdy sequoia Jan 31, 2024, 12:31 AM

#

Ok, loops work correctly, so do conditions, making good progress

#

unfortunately I am out of time for this week 😦

glad urchin Jan 31, 2024, 12:32 AM

#

sturdy sequoia unfortunately I am out of time for this week 😦

i dont believe you

sturdy sequoia Jan 31, 2024, 12:35 AM

#

@left night I don't think the following makes sense:

#let identity(x) = x
#let out = for i in range(5) {
  "A"
  identity({
    "B"
    break
  })
  "C"
}

#test(out, "AB")

Essentially we are breaking somewhere that is supposed to be an argument, which leads my VM to fail this test. But I think it has the correct behaviour here by not joining the "B" with the "A". This is because the VM is the block as being preparation for an argument and doesn't join it on exit 😐

#

I could emulate this behaviour, but I think this use case should be rare enough to not be an issue?

glad urchin Jan 31, 2024, 12:37 AM

#

uhh

#

what is the current behavior?

sturdy sequoia Jan 31, 2024, 12:37 AM

#

it will join a and b and produce "AB"

glad urchin Jan 31, 2024, 12:37 AM

#

okay

#

i disagree wit h that

#

😂

sturdy sequoia Jan 31, 2024, 12:37 AM

#

I added the assertion

glad urchin Jan 31, 2024, 12:37 AM

#

wait

sturdy sequoia Jan 31, 2024, 12:37 AM

#

glad urchin i disagree wit h that

with the current behaviour?

glad urchin Jan 31, 2024, 12:37 AM

#

I missed the identity

#

lol

#

thought it was some random function and it was joining with the thing inside the block

sturdy sequoia Jan 31, 2024, 12:38 AM

#

if it was just a block, not an argument, it would work btw

sturdy sequoia Jan 31, 2024, 12:38 AM

#

glad urchin thought it was some random function and it was joining with the thing inside the...

well currently it kinda does 💀

#

but not in my VM

glad urchin Jan 31, 2024, 12:38 AM

#

sturdy sequoia well currently it kinda does 💀

it's joining with the output of identity

#

if i understood correctly

#

it breaks as soon as identity finishes

sturdy sequoia Jan 31, 2024, 12:39 AM

#

glad urchin if i understood correctly

yes because control flow can go "through" a function

#

which I really don't think should happen

glad urchin Jan 31, 2024, 12:39 AM

#

then i think we have tobe careful here

#

cuz that seems to be pretty fundamental

#

other larger cases could subtly break

sturdy sequoia Jan 31, 2024, 12:39 AM

#

https://tenor.com/view/crazy-pills-like-gif-19621198

Tenor

sturdy sequoia Jan 31, 2024, 12:39 AM

#

glad urchin other larger cases could subtly break

true

#

But I still find this "transitive control flow" behaviour weird as heck

#

I can emulate it, don't get me wrong

#

but I kinda don't want to 😂

#

It feels so wrong

glad urchin Jan 31, 2024, 12:40 AM

#

i agree it's weird but there's probably some use case we're missing

sturdy sequoia Jan 31, 2024, 12:40 AM

#

probably

#

Hence the ping to @left night

glad urchin Jan 31, 2024, 12:40 AM

#

it's not always obvious when the underlying thing in the compiler is being used, just by looking at the tests

#

i guess you'd have to run them equipped with a debugger

sturdy sequoia Jan 31, 2024, 12:41 AM

#

glad urchin i guess you'd have to run them equipped with a debugger

-------------------------------
Enter { span: Span(925904528655), len: 327, scope: O0, flags: 6, out: Some(Some(R0)) } += 18
 - Enter(327)
Copy { span: Span(4166570378940), value: none, out: J } += 13
Instantiate { span: Span(6712807832736), closure: F0, out: R0 } += 26
Copy { span: Span(31943706238530), value: none, out: J } += 39
Args { span: Span(41723572822423), capacity: 1, out: R3 } += 54
PushArg { span: Span(41955048954585), value: C0, out: R3 } += 67
Call { span: Span(41318489591138), closure: A1, args: R3, flags: 0, out: R2 } += 83
Iter { span: Span(41318489591138), scope: O1, len: 152, iterable: R2, flags: 1, out: Some(Some(R1)) } += 103
 - Enter(152)
JumpLabel(0) += 0
   0 => Enter: Enter { len: 327, scope: O0, flags: 6, out: Some(Some(R0)) } <= 18
   0 => Copy: Copy { value: none, out: J } <= 13
  13 => Instantiate: Instantiate { closure: F0, out: R0 } <= 13
  26 => Copy: Copy { value: none, out: J } <= 13
  39 => Args: Args { capacity: 1, out: R3 } <= 15
  54 => PushArg: PushArg { value: C0, out: R3 } <= 13
  67 => Call: Call { closure: A1, args: R3, flags: 0, out: R2 } <= 16
  83 => Iter: Iter { scope: O1, len: 152, iterable: R2, flags: 1, out: Some(Some(R1)) } <= 20
   0 => Next: Next { out: R0 } <= 11
  11 => Enter: Enter { len: 110, scope: O2, flags: 2, out: Some(Some(J)) } <= 18
   0 => Copy: Copy { value: S0, out: J } <= 13
  13 => Args: Args { capacity: 1, out: R0 } <= 15
  28 => Enter: Enter { len: 22, scope: O3, flags: 2, out: Some(Some(R1)) } <= 18
   0 => Copy: Copy { value: S1, out: J } <= 13
  13 => Break: Break <= 9
  22 => Exit: "B" + 22
 - Writing to R1
 110 => Exit: "A" + 110
 - Writing to J
 152 => Exit: "A" + 152
 255 => Copy: Copy { value: C1, out: J } <= 13
 268 => Args: Args { capacity: 1, out: R2 } <= 15
 283 => Eq: Eq { lhs: R1, rhs: S3, out: R3 } <= 15
 298 => PushArg: PushArg { value: R3, out: R2 } <= 13
 311 => Call: Call { closure: A2, args: R2, flags: 0, out: J } <= 16

#

Oh I do

#

trust me

#

I do have a debugger

glad urchin Jan 31, 2024, 12:41 AM

#

i meant like in the old code

sturdy sequoia Jan 31, 2024, 12:41 AM

#

glad urchin i meant like in the old code

ah right

glad urchin Jan 31, 2024, 12:41 AM

#

to figure out where the transitive break happens

sturdy sequoia Jan 31, 2024, 12:41 AM

#

glad urchin to figure out where the transitive break happens

I know how it happens, I just hate it 😂

glad urchin Jan 31, 2024, 12:41 AM

#

though being able to debug new eval is also cool

sturdy sequoia Jan 31, 2024, 12:41 AM

#

It feels wrong

sturdy sequoia Jan 31, 2024, 12:42 AM

#

glad urchin though being able to debug new eval is also cool

I mean the output is cursed but it helps me debug the weird edge cases

glad urchin Jan 31, 2024, 12:42 AM

#

cant wait to have the equivalent of python's dump thing

sturdy sequoia Jan 31, 2024, 12:42 AM

#

glad urchin cant wait to have the equivalent of python's dump thing

🤔 ?

glad urchin Jan 31, 2024, 12:42 AM

#

you can extract a function's bytecode within python iirc

sturdy sequoia Jan 31, 2024, 12:42 AM

#

ah right

glad urchin Jan 31, 2024, 12:42 AM

#

https://docs.python.org/3/library/dis.html

Python documentation

dis — Disassembler for Python bytecode

Source code: Lib/dis.py The dis module supports the analysis of CPython bytecode by disassembling it. The CPython bytecode which this module takes as an input is defined in the file Include/opcode....

sturdy sequoia Jan 31, 2024, 12:43 AM

#

I mean the bytecode is just a big bunch of bytes

glad urchin Jan 31, 2024, 12:43 AM

#

>>> dis.dis(myfunc)
  2           0 RESUME                   0

  3           2 LOAD_GLOBAL              1 (NULL + len)
             12 LOAD_FAST                0 (alist)
             14 CALL                     1
             22 RETURN_VALUE

#

cant wait to do the same in Typst

#

👍

#

🚀

sturdy sequoia Jan 31, 2024, 12:43 AM

#

🚀

#

Anyway, gn ❤️

glad urchin Jan 31, 2024, 12:44 AM

#

gn

left night Jan 31, 2024, 7:42 AM

#

sturdy sequoia <@311948531835469827> I don't think the following makes sense: ```ts #let identi...

It’s essentially the same thing we discussed before. It’s breaking the inner block, but then still finishing up the expression. The semantics are fairly simple conceptually: Instead of “leave everything right now” it is “leave every code/content block early, before starting to evaluate the next expression in it.”

sturdy sequoia Jan 31, 2024, 7:49 AM

#

left night It’s essentially the same thing we discussed before. It’s breaking the inner blo...

Yes but in this specific case it feels weird to me that it goes through a function call, I’ll try to make that work when I have time to work on it again. Thankfully my new implementation should be able to do it more easily but it won’t be pretty

left night Jan 31, 2024, 7:59 AM

#

sturdy sequoia Yes but in this specific case it feels weird to me that it goes through a functi...

We did have that in previous examples, too, where it called the text function, just like it is calling identity here.

sturdy sequoia Jan 31, 2024, 7:59 AM

#

left night We did have that in previous examples, too, where it called the text function, j...

Hmm

#

I don’t know exactly how I’ll handle that but it’ll be tricky

left night Jan 31, 2024, 8:01 AM

#

Ideally it would arise naturally from you only checking for control flow in between the steps of a block, not in between arbitrary expressions

#

Same as in (break, 1, 2) which can join with other existing arrays.

untold turret Jan 31, 2024, 8:08 AM

#

Should both fn(break) and (break,) be rejected by bytecode compiler?

#

#

🐈 what

#

🐈

untold turret Jan 31, 2024, 8:22 AM

#

left night It’s essentially the same thing we discussed before. It’s breaking the inner blo...

ok, I seem understand what it says. But I don't know what I can do by this semantics.

left night Jan 31, 2024, 8:23 AM

#

It ensures that things join & style properly when having a break nested in code/content blocks with styling in between

#

httpswww.user.tu-berlin.delaurmaedjeprogrammable-markup-language-for-typesetting.pdf.png

#

We could consider allowing control flow keywords only directly in blocks (like set/show currently are). That wouldn't alleviate the need for the semantics (since you can put a block anywhere), but it would reject more code that is likely wrong.

untold turret Jan 31, 2024, 8:42 AM

#

left night We could consider allowing control flow keywords only directly in blocks (like s...

I try to compare typst with rust, as they both regard blocks as expression (so infers that if, while, for are expression). But I soonly find that typst has join semantics, then we cannot directly borrow benign and undefined behaviors from rust. To leave room to current compiler optimization or that in future, we may reject some strange grammar or mark them as no semantic ensurenesss.

#

🤔 break and continue may should never be expressions but special statements.

untold turret Jan 31, 2024, 9:15 AM

#

left night It’s essentially the same thing we discussed before. It’s breaking the inner blo...

I think I "re-"understand it. during execution, we may have a sequence of unfinished expressions on the stack. The semantics of break or continue will try to goto the next position, which will finalize the expressions stack in order. See:

#x(for {
  y(while {
    z({
      u(1 + v(2; 3, break))
    })
  })
})

There will be a stack of expressions which are waiting for incoming arguments. In the example, the stack is [x, y, z, u, v]. And a break in the expression will finalize v, u, z in order.

sturdy sequoia Jan 31, 2024, 9:26 AM

#

left night Ideally it would arise naturally from you only checking for control flow in betw...

Yes that was my initial approach, to add a Breakpoint opcode on where control flow is processed, or just do it always when coming out of a block

#

My problem is that break isn't a value so it really makes my life terrible

sturdy sequoia Jan 31, 2024, 9:27 AM

#

left night

This usecase makes sense to me, but the VM doesn't support it

#

My problem is that as far as the VM is, an argument is evaluate before the call but "inline" (i.e in the same scope)

#

so I'll need to add Breakpoints to know when I can safely be done executing, but I don't know how well that'll work

#

I guess that breakpoints can be added between each expr in a loop?

#

What do you think @left night

#

so it would look like

#while true {
  text(blue)[
    My text is blue.
    #break
    And not green.
  ]
  breakpoint() // not a real function, just to show the opcode
}

left night Jan 31, 2024, 9:29 AM

#

sturdy sequoia I guess that breakpoints can be added between each expr in a loop?

Control flow needs to be processed not only between loop iterations, but also between each individual expression in a block though. Otherwise "And not green" shows up.

sturdy sequoia Jan 31, 2024, 9:33 AM

#

left night Control flow needs to be processed not only between loop iterations, but also be...

so a break does the following:

In the loop: stop the loop at the next breakpoint
In a block in the loop: stop the current block, carry the control flow?

left night Jan 31, 2024, 9:38 AM

#

I assume somewhere in your code, you join the value of each expression in a block? That's where you also need to check for a flow.

sturdy sequoia Jan 31, 2024, 9:44 AM

#

left night I assume somewhere in your code, you join the value of each expression in a bloc...

not really because my code actually has a special purpose register which is the "join" register

#

as far as the VM is concerned, there is no "joining", it just writes to that register

#

and that register has no access to the other state

#

With my approach it'll work but it does remove the ability to use a break as a value

#

so you can break in a scope ([] or {} but not as a regular expression)

#

which imo is fine

#

We coul always make break optionally take a value like return?

left night Jan 31, 2024, 10:08 AM

#

sturdy sequoia so you can break in a scope (`[]` or `{}` but not as a regular expression)

okay, so this basically? #1176509648707256370 message

untold turret Jan 31, 2024, 10:16 AM

#

@sturdy sequoia I have read your opcodes.rs and opcodes_raw.rs, there is question: the possibility to eliminate element instructions in opcodes_raw.rs by func calls.
🤔 why do some elements are in but others are not in the opcode table for construction? E.g. creating math.frac is an inst but creating raw is not.

sturdy sequoia Jan 31, 2024, 10:21 AM

#

left night okay, so this basically? https://discord.com/channels/1054443721975922748/117650...

yes

sturdy sequoia Jan 31, 2024, 10:21 AM

#

untold turret <@130737672951037952> I have read your opcodes.rs and opcodes_raw.rs, there is q...

the markup-based elems are compiled to special instructions

#

they're not needed per-se, but I think it's nice that it's a single instruction instead of a bunch of them

untold turret Jan 31, 2024, 10:22 AM

#

sturdy sequoia the markup-based elems are compiled to special instructions

does it bring benefit by compiling to special instructions?

sturdy sequoia Jan 31, 2024, 10:23 AM

#

untold turret does it bring benefit by compiling to special instructions?

it should theoretically be faster at the cost of bigger code size

#

I'd also argue it's easier to maintain

#

but it's neither here nor there

#

but would be fine imo

untold turret Jan 31, 2024, 10:29 AM

#

sturdy sequoia I'd also argue it's easier to maintain

I'm thinking about possibility to split a typst-vm or typst-evaluator, which creates packed content by calling functions. From that perspective, it is better that don't specialize element constructions.

left night Jan 31, 2024, 10:29 AM

#

Isn't math.frac also markup-based in that sense?

sturdy sequoia Jan 31, 2024, 10:31 AM

#

@left night the idea of adding special flow instruction (previously called breakpoint) works 🎉

left night Jan 31, 2024, 10:31 AM

#

untold turret I'm thinking about possibility to split a typst-vm or typst-evaluator, which cre...

The VM will necessarily depend on the built-in types (e.g. Array and Dictionary) and those live in the same crate as the elements. What you propose is similar to the old LangItems setup when typst and typst-library were split. I think in some future we can think about splitting crates more again (differently than last time though), but I wouldn't bother with this here since the old evaluator does it the same way.

sturdy sequoia Jan 31, 2024, 10:32 AM

#

@left night it even works with break as a value with one caveat: it counts as a none when placed in an array:

#let x = for i in range(1, 5) {
  if i == 3 {
    (1, 2, break)
  } else {
    (1, 2, 3)
  }
}


#x

leads to:

#

is this acceptable?

#

I can probably fix that, but it would be... really hard

untold turret Jan 31, 2024, 10:37 AM

#

I was witnessing typst gets merged with typst-library as they usually references each other.
It will definitely be splitted again in future, since typst crate is too big to a sensible scale of crate. We should find a good point whatever. And I consider an evaluator/vm.
Was typst a purely evaluator/vm?

sturdy sequoia Jan 31, 2024, 10:47 AM

#

untold turret I was witnessing typst gets merged with typst-library as they usually references...

That would also need some major changes to the compiler as I have written it with typst in mind

#

I was going to suggest doing a similar split to allow tools like manim to be built using the typst language (and most of the features) but with custom syntax, etc.

untold turret Jan 31, 2024, 10:48 AM

#

sturdy sequoia I was going to suggest doing a similar split to allow tools like manim to be bui...

how does it look like? a reference link or a simple example? I have not used manim but I have heard that.

sturdy sequoia Jan 31, 2024, 10:51 AM

#

untold turret how does it look like? a reference link or a simple example? I have not used man...

Essentially it allows you to build animated slides with Python, which is used a lot by math content creator and the like, I think Typst is uniquely well suited for such a tool, but it would likely require some custom syntax, and a lot of additions to the current global scope

sturdy sequoia Jan 31, 2024, 11:07 AM

#

@left night I think it's quite interesting that because destructuring is compiled ahead of time, things like this are already supported without me even thinking about it: https://github.com/typst/typst/pull/3308

GitHub

Adjust for-loop's pattern matching rules by Leedehai · Pull Request...

Fixing #3275.

left night Jan 31, 2024, 11:08 AM

#

sturdy sequoia <@311948531835469827> it even works with break as a value with one caveat: it co...

That's how it is intended to work

sturdy sequoia Jan 31, 2024, 11:09 AM

#

left night That's how it is intended to work

Awesome 😄

#

Then it... just works™️

left night Jan 31, 2024, 11:09 AM

#

If I want to compare how my stuff compares to main/release without switching branches and stuff, I just open the web app 😉

sturdy sequoia Jan 31, 2024, 11:10 AM

#

left night If I want to compare how my stuff compares to main/release without switching bra...

I do too sometimes, but I should be studying

#

and I am too lazy to procrastinate any more 😂

left night Jan 31, 2024, 11:11 AM

#

untold turret I was witnessing typst gets merged with typst-library as they usually references...

I only sort of agree. I would also like to split it again in the future (just in a better way), but there is also a trend in Rust that some large projects that were previously split up into many small crates go back to fewer larger crates. One example is tokio where the core crate is larger than typst.

sturdy sequoia Jan 31, 2024, 11:18 AM

#

left night I only _sort of_ agree. I would also like to split it again in the future (just ...

I think perhaps making typst more "extendable" with the ability to have more dynamic stuff, and operator overloading (i.e defining addition for custom values, etc.) it could also work just fine

#

custom join too etc.

#

that way typst as a crate is more extendable

left night Jan 31, 2024, 11:21 AM

#

yes, but I think it would be close to impossible to make eval not depend on some base types

sturdy sequoia Jan 31, 2024, 11:21 AM

#

and I mean operator overloading from rust

sturdy sequoia Jan 31, 2024, 11:21 AM

#

left night yes, but I think it would be close to impossible to make eval not depend on some...

that I agree

#

Dict, Array, Value, Str, etc. have to be in for it to even work

left night Jan 31, 2024, 11:22 AM

#

I think we can potentially have typst-eval depend only on typst-core, which is the bare minimum

#

bug then again typst-layout would require further base types like Length

#

the typst-core / typst-whatever split would be fairly arbitrary based on what is needed elsewhere and what isn't

#

which is frustrating because when you add something new, you might need to move a large amount of code from typst-whatever to typst-core

untold turret Jan 31, 2024, 11:23 AM

#

left night I only _sort of_ agree. I would also like to split it again in the future (just ...

My understand is we don't split them early, which will cause many additional unnecessary code to tight crates together again. If we have some golden point to split, we would like to split them. For example, typst-syntax is nice as an individual crate and we've used them in typstfmt.
In short they large projects was spliting code early in some bad point, and find unideal thing. They large projects have to merged them together to reduce costs to write additional code. They large projects still don't have some golden point to split so keep a large core crate.

left night Jan 31, 2024, 11:24 AM

#

I think for the current speed of iteration, it is good that there is just one main crate. If the major planned features are implemented, we can then think about what a golden split would be.

#

I also hate the fact that I need to anticipate what subcrate we might need and reserve them on crates.io to be sure that we can use the name in a future-proof way.

sturdy sequoia Jan 31, 2024, 11:25 AM

#

left night I also hate the fact that I need to anticipate what subcrate we might need and r...

Yeah but that's a whole other can of worms

left night Jan 31, 2024, 11:25 AM

#

There is this very nice RFC where multiple crates are part of one package

sturdy sequoia Jan 31, 2024, 11:27 AM

#

left night There is this very nice RFC where multiple crates are part of one package

Ah, that's nice

#

it's like namespaces, but compatible with the current impl 😂

tight glade Jan 31, 2024, 12:01 PM

#

sturdy sequoia I was going to suggest doing a similar split to allow tools like manim to be bui...

hot

left night Jan 31, 2024, 3:45 PM

#

@sturdy sequoia the PR I opened also gives a bit of a speedup compared to main on my machine. curious if you can reproduce that.

feral imp Jan 31, 2024, 3:51 PM

#

Masterproef evidence or it didn't happen @sturdy sequoia

sturdy sequoia Jan 31, 2024, 3:55 PM

#

left night <@130737672951037952> the PR I opened also gives a bit of a speedup compared to ...

I'll do that tomorrow 😉

sly pecan Jan 31, 2024, 5:52 PM

#

left night <@130737672951037952> the PR I opened also gives a bit of a speedup compared to ...

The show/set one?

left night Jan 31, 2024, 5:52 PM

#

sly pecan The show/set one?

Yes

sly pecan Jan 31, 2024, 6:00 PM

#

left night Yes

I guess it's tricky to benchmark since you would also have to make sure the output is the same

left night Jan 31, 2024, 6:02 PM

#

I haven't looked but I wouldn't expect it to be a lot different

#

While the PR does make some breaking changes, it mostly affects code that was non-sensical before.

sturdy sequoia Jan 31, 2024, 6:15 PM

#

@left night Actually, I have a case where my VM is more permissive and I want your opinion:

#

this is a test from the suite:

#

// Count labels.
#let label = <heya>
#let count = counter(label).display()
#let elem(it) = [#box(it) #label]

#elem[hey, there!] #count \
#elem[more here!] #count

#

I tried modifying it to:

#

// Count labels.
#let label = <heya>
#let count = counter(label).display()
#let elem(it) = {
  box(it)
  label
}

#elem[hey, there!] #count \
#elem[more here!] #count

#

On my VM, this works, on the webapp it's refused

#

Should it work or should it be refused? 🤔

#

It is trivial for me to do the old behaviour, so it's really a design decision, not a constraint in any way shape or form

#

(literally one line of code)

cunning wadi Jan 31, 2024, 6:19 PM

#

I kinda like it

sturdy sequoia Jan 31, 2024, 6:20 PM

#

Next point of contention, this code:

// Ref: true
// Test continue while destructuring.
// Should output "one = I \ two = II \ one = I".
#for num in (1, 2, 3, 1) {
  let (word, roman) = if num == 1 {
    ("one", "I")
  } else if num == 2 {
    ("two", "II")
  } else {
    continue
  }
  [#word = #roman \ ]
}

It works, except that my code still tries to destructure after calling continue since the next flow opcode is right after which makes this not work 🤔 I can probably special case it though

#

I can also make destructuring imply a flow to avoid issues

untold turret Jan 31, 2024, 6:21 PM

#

I remember a label attachws to its previous content automatically, but documentation also remind it doesn't work in "code mode".

#

I think it is a limitation, and your VM extends it.

sturdy sequoia Jan 31, 2024, 6:22 PM

#

Essentially my VM forces you into "content" mode (in terms of joining of values) once you try and join a label

cunning wadi Jan 31, 2024, 6:22 PM

#

sturdy sequoia Next point of contention, this code: ```ts // Ref: true // Test continue while d...

what do you mean by it works? that it does not throw an error due to not being able to destructure none?

sturdy sequoia Jan 31, 2024, 6:22 PM

#

I admit it might be a bit weird

sturdy sequoia Jan 31, 2024, 6:23 PM

#

cunning wadi what do you mean by it works? that it does not throw an error due to not being a...

It works in the sense that the control flow works, but the destructure fails

cunning wadi Jan 31, 2024, 6:23 PM

#

ah, I think that should be changed then

#

let (a, b) = if x.len() == 2 {
  x
} else {
  continue
}

#

I would expect such a thing to just work

sturdy sequoia Jan 31, 2024, 6:24 PM

#

cunning wadi ```rs let (a, b) = if x.len() == 2 { x } else { continue } ```

yeah I know, but it's because of the weird control flow scheme

#

it happens right after destructuring, not before

#

But as I said, I can just manually append flow opcodes where necessary

#

like before an assignment

left night Jan 31, 2024, 7:18 PM

#

sturdy sequoia Should it work or should it be refused? 🤔

It does seem reasonable, but I'm not 100% sure yet whether it will have unintended consequences and, in particular, whether it will be future compatible with content = value. Is it equivalent to having content + label join into labelled content?

left night Jan 31, 2024, 7:19 PM

#

sturdy sequoia yeah I know, but it's because of the weird control flow scheme

The current let binding code also has a special case to handle this, so I think it should be the same here.

sturdy sequoia Jan 31, 2024, 7:20 PM

#

left night It does seem reasonable, but I'm not 100% sure yet whether it will have unintend...

exactly, it basically joins label to the previous content

sturdy sequoia Jan 31, 2024, 7:20 PM

#

left night The current let binding code also has a special case to handle this, so I think ...

I did that too, I just generate a flow opcode before calling the destructure opcode

left night Jan 31, 2024, 7:24 PM

#

sturdy sequoia exactly, it basically joins label to the previous content

okay. the problem is that when content = value, every element is its own type and to roughly retain the current way things work, the joining rules need to be adapted. basically any two values need to join into a sequence. any ++ label (using ++ for join here) could probably still do labelling as a special case, but I need to think more about this.

sturdy sequoia Jan 31, 2024, 7:25 PM

#

left night okay. the problem is that when content = value, every element is its own type an...

Yes, the reason I currently changed the behaviour to the same as main is that it's non-trivial to label sequence elements that are in the joining "value"

left night Jan 31, 2024, 7:26 PM

#

sturdy sequoia Yes, the reason I currently changed the behaviour to the same as main is that it...

do you mean that it would label the sequence rather than the last leaf?

#

or the other way around?

sturdy sequoia Jan 31, 2024, 7:26 PM

#

left night do you mean that it would label the sequence rather than the last leaf?

yes

#

if you do content ++ content ++ label, does that mean sequence[content, content] ++ label or sequence[content, content ++ label]?

#

I'd argue it's non-trivial

#

take the case #f() <this>, where f produces a sequence

left night Jan 31, 2024, 7:27 PM

#

yeah, that might be the reason this doesn't exist on main (or nobody ever thought about it, can't remember)

sturdy sequoia Jan 31, 2024, 7:28 PM

#

left night yeah, that _might_ be the reason this doesn't exist on main (or nobody ever thou...

yes I think it's better if that doesn't work as it's less error prone imo

#

and more consistent

left night Jan 31, 2024, 7:28 PM

#

I think supporting syntactically labelling in code mode might be nice (i.e. figure(..) <label>), but arbitrary labelling should perhaps be done via a function. Maybe the same should actually apply in content mode. It's more predictable that way.

#

It also fixes the problem that people from time to time try = Heading #label and the label attaches to the text rather than the heading

sturdy sequoia Jan 31, 2024, 7:29 PM

#

how about figure(..) <label> works, but

figure(..)
<label>

#

doesn't?

left night Jan 31, 2024, 7:29 PM

#

yeah, that's sort of what I meant

#

this is the kind of thing that is either super easy to parse or a parsing nightmare

#

if we want to make the breaking change for content mode, we should wait until type rework probably because it'll be quite the breaking change

#

I'm hoping to batch a bunch of breaking changes up and then make one very breaking release

sturdy sequoia Jan 31, 2024, 7:32 PM

#

left night I'm hoping to batch a bunch of breaking changes up and then make one very breaki...

For the VM I am trying not to break things and so far I am achieving it which makes me happy 😄

#

I mean, almost everything is broken

#

but what works isn't

#

😂

left night Jan 31, 2024, 7:32 PM

#

^^

sturdy sequoia Jan 31, 2024, 7:32 PM

#

Do you want to know the trickiest bug I have fixed so far?

#

I was debugging a test that would only fail if it was a test, not as a standalone document. Is so happened, that I was creating a local variable with the name of a closure (to allow recursion) but the closure was supposed to be anonymous, but it was not because I accidentally gave the compiler the module's name (counter) which means that when it did counter(heading) it was re-invoking the module

#

💀

#

That was a very weird one to debug

left night Jan 31, 2024, 7:49 PM

#

oh no

sturdy sequoia Jan 31, 2024, 9:18 PM

#

left night oh no

BTW, once the VM actually works, will you and I be able to schedule a call to implement tracing (like seeing execution of nodes, etc.) because I have no idea how it all works and how to integrate it. I'm talking in 2-3 weeks most likely

left night Jan 31, 2024, 9:58 PM

#

sturdy sequoia BTW, once the VM actually works, will you and I be able to schedule a call to im...

sure, we can discuss it then! just for clarification: do you mean tracing as in hover tooltips?

sturdy sequoia Jan 31, 2024, 10:04 PM

#

left night sure, we can discuss it then! just for clarification: do you mean tracing as in ...

yes exactly

sturdy sequoia Feb 1, 2024, 9:55 PM

#

@left night outside of doing some bit magic, do you think there is a way that a Span could become align(4) or (ideally) align(2)?

#

😄

#

I could make it #[repr(packed)] but it's a bit unholy

left night Feb 1, 2024, 10:07 PM

#

sturdy sequoia <@311948531835469827> outside of doing some bit magic, do you think there is a w...

I think you could store it as [u8; 8] and boom, it's align(1). Casting between that and u64 shouldn't be a problem as long as you do it by value rather than by reference (i.e via as_ne_bytes)

sly pecan Feb 2, 2024, 4:27 PM

#

@sturdy sequoia have you compiled your thesis yet?

feral imp Feb 2, 2024, 4:43 PM

#

sly pecan <@130737672951037952> have you compiled your thesis yet?

I can't wait either...😅 😅

sturdy sequoia Feb 2, 2024, 4:57 PM

#

sly pecan <@130737672951037952> have you compiled your thesis yet?

No, I have changed a few things I’ll try again today

sturdy sequoia Feb 2, 2024, 11:15 PM

#

Well, I had this theory that: less registers = faster VM

#

I was right sunglassed_crying

#

So one optimization I'll be implementing: several sizes of VMs, and it will select one depending on the number of registers you need (from 4 -> 256)

#

(so 4, 8, 16, 32, 48, 64, 96, 128, 256)

#

it's actually surprisingly easy to do with const generics and an enum 😉

feral imp Feb 2, 2024, 11:18 PM

#

sturdy sequoia it's actually surprisingly easy to do with const generics and an enum 😉

Phew. That rarely work, because const generics are limited.

#

But it's good that you can make do!

sturdy sequoia Feb 2, 2024, 11:21 PM

#

My idea is basically to have a Vm<const N: usize> and then an enum like:

enum VmSizes {
  S8(Vm<8>),
  S16(Vm<16>),
  ...
}

#

easy as it gets

cunning wadi Feb 2, 2024, 11:30 PM

#

sturdy sequoia My idea is basically to have a `Vm<const N: usize>` and then an enum like: ```rs...

wouldn't that just have the enum be the size of the largest vm?

#

or are the registers heap allocated?

sturdy sequoia Feb 2, 2024, 11:30 PM

#

cunning wadi or are the registers heap allocated?

hmmmm

#

I didn't think of that!

#

Probably would need to be a function instead

#

like

#

fn eval(...) {
  if min_reg <= 8 {
    Vm::<8>::eval(...)
  } else if ... {
    ...
  }
}

#

And hope the compiler is smart enough to figure it out

cunning wadi Feb 2, 2024, 11:32 PM

#

yeah that could work

sturdy sequoia Feb 2, 2024, 11:33 PM

#

As it turns out tablex needs more than 64 registers 💀

cunning wadi Feb 2, 2024, 11:35 PM

#

I think you need a fallback stack once too many variables are needed at one time

#

uh, actually not a stack

#

just a heap allocated (resizable) array

feral imp Feb 2, 2024, 11:36 PM

#

didn't laurmaedje have some funky pattern for this?

sturdy sequoia Feb 2, 2024, 11:36 PM

#

cunning wadi I think you need a fallback stack once too many variables are needed at one time

probably yeah

#

I mena there's a good change that just using the heap would work

#

and having infinite registers

cunning wadi Feb 2, 2024, 11:37 PM

#

I was just writing that

#

but yeah

#

or another idea

#

you could have ```rs
trait Storage {
fn read(&self, reg: Register) -> Value;
fn write(&self, reg: Register, value: Value);
}

impl<const N: usize> Storage for [u8; N] {}
impl Storage for Box<[u8]> {}

struct Vm<S: Storage> {
registers: S,
}

sturdy sequoia Feb 2, 2024, 11:39 PM

#

ah yes, we can go back to heap if need be

#

that's clever

#

and small VMs can benefit from the full set of optimizations by having small register arrays

cunning wadi Feb 2, 2024, 11:39 PM

#

exactly

#

only downside might be code size

#

but I think it will be fine

sturdy sequoia Feb 2, 2024, 11:40 PM

#

I think the VM is already quite chonky 💀

#

well the compiler is the chonky bit

cunning wadi Feb 2, 2024, 11:40 PM

#

I mean code size of the compiled object

#

but yeah

sturdy sequoia Feb 2, 2024, 11:42 PM

#

yes but that's a one-time cost: when "entering" the VM

cunning wadi Feb 2, 2024, 11:44 PM

#

well, not quite
it's the download size of the wasm binary
it's the amount of icache that is taken up by that part of the vm
it's the initial memory accesses that might not be quite as efficient due to code being as close together

#

but I expect it not to matter too much in the grand scheme of things

sturdy sequoia Feb 2, 2024, 11:44 PM

#

true, but executing code will likely still be slower 😐

sly pecan Feb 2, 2024, 11:49 PM

#

sturdy sequoia So one optimization I'll be implementing: several sizes of VMs, and it will sele...

You're never going to get to the point of actually compiling actual documents are you? 😂

sturdy sequoia Feb 2, 2024, 11:53 PM

#

sly pecan You're never going to get to the point of actually compiling actual documents ar...

I mean I'm trying angryeyes

untold turret Feb 2, 2024, 11:55 PM

#

sturdy sequoia My idea is basically to have a `Vm<const N: usize>` and then an enum like: ```rs...

why does it depend a const generic size of registers?

sly pecan Feb 2, 2024, 11:56 PM

#

sturdy sequoia I mean I'm trying <:angryeyes:1004114265176821790>

Maybe too hard!

untold turret Feb 2, 2024, 11:58 PM

#

It will generate 10 copies of VM code if you uses Vm<4,8,16,32,48,64,96,128,256>

sturdy sequoia Feb 2, 2024, 11:58 PM

#

untold turret why does it depend a const generic size of registers?

to optimize its size based on the actual needs

sly pecan Feb 2, 2024, 11:59 PM

#

Just be careful so you don't overwork yourself!

sturdy sequoia Feb 3, 2024, 12:00 AM

#

sly pecan Just be careful so you don't overwork yourself!

Yes!

untold turret Feb 3, 2024, 12:01 AM

#

sturdy sequoia to optimize its size based on the actual needs

How do we use the generic constant? If it is for a static-sized register array, there will no difference with an Arc register array or even a Vec register array.

left night Feb 3, 2024, 8:41 AM

#

I also don't yet understand why you need to instantiate the Vm over and over again instead of just storing the registers in a Vec and only using a smaller amount of the Vec. I can't imagine that the one allocation will be the costly thing.

glossy shore Feb 3, 2024, 8:49 AM

#

left night

I don't see how this is different from 1 + break + 3

glossy shore Feb 3, 2024, 9:06 AM

#

sturdy sequoia Should it work or should it be refused? 🤔

#{
  [ one ]
  [ two ]
  label("hi")
}

is query(<hi>).first().elem [two] or [one two]

glossy shore Feb 3, 2024, 9:10 AM

#

cunning wadi ```rs let (a, b) = if x.len() == 2 { x } else { continue } ```

this is where a never type (and uh... value) might help

left night Feb 3, 2024, 9:13 AM

#

glossy shore this is where a `never` type (and uh... _value_) might help

a never value, huh ^^

left night Feb 3, 2024, 9:14 AM

#

glossy shore I don't see how this is different from `1 + break + 3`

what do you mean?

sturdy sequoia Feb 3, 2024, 9:18 AM

#

left night I also don't yet understand why you need to instantiate the Vm over and over aga...

Well that's the thing, I haven't tested yet, my idea with limiting the number of registers is to (hopefully) guarantee that they're in L1 cache

#

But I don't know whether this has materialized and I haven't profiled the compiler yet

#

All I know is that less registers translate to double the performance

left night Feb 3, 2024, 9:19 AM

#

By why can't a Vec of registers be in L1 cache?

sturdy sequoia Feb 3, 2024, 9:19 AM

#

left night By why can't a Vec of registers be in L1 cache?

it can, but since the array would have been on the stack the likelihood of it being in L1 cache should've been higher

glossy shore Feb 3, 2024, 9:20 AM

#

left night what do you mean?

The solution that was settled on seems reasonable to me, I've no more to add

glossy shore Feb 3, 2024, 9:20 AM

#

sturdy sequoia it can, but since the array would have been on the stack the *likelihood* of it ...

Oh that's smart

left night Feb 3, 2024, 9:20 AM

#

sturdy sequoia it can, but since the array would have been on the stack the *likelihood* of it ...

is that really so? if the vector is constantly being accessed and still not in cache, I would blame the CPU for being dumb.

sturdy sequoia Feb 3, 2024, 9:20 AM

#

left night is that really so? if the vector is constantly being accessed and still not in c...

fair

#

Anyway, I'll give it a try too, I'd just like the VM to work now 😭

glossy shore Feb 3, 2024, 9:21 AM

#

Dherse's idea is that the registers would share the cache together with the Rust variables

#

If I understand

sturdy sequoia Feb 3, 2024, 9:21 AM

#

There is a weird behaviour that makes tablex not compile and I don't know why

left night Feb 3, 2024, 9:21 AM

#

I am just very concerned about code complexity, compile times, and generated code size with heavy use of const generics across the entire eval pipeline.

sturdy sequoia Feb 3, 2024, 9:21 AM

#

glossy shore Dherse's idea is that the registers would share the cache together with the Rust...

I mean yes, but the registers are rust variables in a way 🤔

sturdy sequoia Feb 3, 2024, 9:22 AM

#

left night I am just very concerned about code complexity, compile times, and generated cod...

right now there are no const generics

#

so not much to worry about

#

yet

#

https://tenor.com/view/evil-laugh-gif-25608698

Tenor

left night Feb 3, 2024, 9:22 AM

#

yeah, that's why I'm arguing heaveily against it now

glossy shore Feb 3, 2024, 9:22 AM

#

sturdy sequoia I mean yes, but the registers are rust variables in a way 🤔

well but if one's on the heap and one's in the stack...

left night Feb 3, 2024, 9:22 AM

#

much of the values being operated on will also be in various places of the heap though

sturdy sequoia Feb 3, 2024, 9:23 AM

#

left night yeah, that's why I'm arguing heaveily against it now

I still think that having const generics using @cunning wadi's Storage trait is good for small Vms (i.e that need like < 16 registers) because those are likely very short code, very short lived so not allocating in those ones is probably a good thing

glossy shore Feb 3, 2024, 9:23 AM

#

how big is a register?

sturdy sequoia Feb 3, 2024, 9:23 AM

#

glossy shore how big is a register?

16 bytes iirc

left night Feb 3, 2024, 9:23 AM

#

Value is 32 bytes

sturdy sequoia Feb 3, 2024, 9:24 AM

#

left night Value is 32 bytes

really?

#

damn that's chonky

#

so registers are 32 bytes 😄

glossy shore Feb 3, 2024, 9:24 AM

#

chonk

left night Feb 3, 2024, 9:24 AM

#

but even with a fully reworked value representation, I don't see how to go much lower

#

to store an integer inline, we need 8 bytes. we need to store type information somewhere (most likely a pointer), that's another 8 bytes. and then we need some pointer to extra optional stuff like labels and locations. so that's 24 bytes min. if we want 16 bytes of inline capacity (which I think we want), we're back to 32 bytes.

glossy shore Feb 3, 2024, 9:25 AM

#

16 registers is half a meg

left night Feb 3, 2024, 9:26 AM

#

megabyte? half a kilobyte.

glossy shore Feb 3, 2024, 9:26 AM

#

yes lol

#

🤦‍♀️

glossy shore Feb 3, 2024, 9:27 AM

#

left night to store an integer inline, we need 8 bytes. we need to store type information s...

we could store just one pointer maybe

left night Feb 3, 2024, 9:28 AM

#

the type info pointer could indeed be moved into the extra optional stuff allocation. not sure whether that's worth it as it would add extra branching and potential indirection to all type checks. would need benchmarks.

#

but in the inline case, we still need at least one pointer in addition to the inline value

#

and 8 bytes of inline storage would not let a short string be inline, defeating the entire purpose of EcoString (which is 16 bytes)

glossy shore Feb 3, 2024, 11:58 AM

#

Technically we could optimise way further by exploiting pointer repr

#

acheieving more than 24 bytes of inlined string

sturdy sequoia Feb 3, 2024, 2:25 PM

#

@left night @glad urchin @sly pecan @tight glade @feral imp I have made TREMENDOUS progress: IT COMPILES TABLEX

#

🚀 🚀 🚀 🚀

#

I ran the whole tablex-test.typ and it all seems correct to me

#

excited

atomic violet Feb 3, 2024, 2:28 PM

#

is it fast?

sturdy sequoia Feb 3, 2024, 2:28 PM

#

IT COMPILES MY THESIS

#

OMG

#

OMG

#

OMG

#

I DID IT

#

IT WORKS

#

🎉

#

🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀

sturdy sequoia Feb 3, 2024, 2:28 PM

#

atomic violet is it fast?

😶

#

I won't say 😂

sly pecan Feb 3, 2024, 2:28 PM

#

"no"

atomic violet Feb 3, 2024, 2:29 PM

#

lmao

#

I'm building it 👺

sturdy sequoia Feb 3, 2024, 2:29 PM

#

atomic violet I'm building it 👺

not yet, I haven't pushed

atomic violet Feb 3, 2024, 2:29 PM

#

oh

sturdy sequoia Feb 3, 2024, 2:30 PM

#

I just did 😎

#

BTW, it is really slow atm, don't judge me uwu

atomic violet Feb 3, 2024, 2:30 PM

#

ok goddammit do I cancel now?..

sturdy sequoia Feb 3, 2024, 2:30 PM

#

atomic violet ok goddammit do I cancel now?..

yes?

#

I'll try and figure out why it's so goddamn slow

atomic violet Feb 3, 2024, 2:32 PM

#

is compilation slow or the interpreter?

sturdy sequoia Feb 3, 2024, 2:33 PM

#

atomic violet is compilation slow or the interpreter?

it's the VM

#

but I know why

#

(I think at least :D)

#

Small functions are currently quite slow

feral imp Feb 3, 2024, 2:38 PM

#

Is it inlining?

#

I must admit I've less than a fig clue what you're doing...

tight glade Feb 3, 2024, 2:39 PM

#

Hourra! Sad its so slow tho 😂

untold turret Feb 3, 2024, 3:10 PM

#

science

sturdy sequoia Feb 3, 2024, 3:26 PM

#

feral imp I must admit I've less than a fig clue what you're doing...

that's actually not a bad idea? 🤔

sturdy sequoia Feb 3, 2024, 3:56 PM

#

Ok, so I did the change

#

and it's about 20x faster in debug

#

time to see in release 😄

#

It also simplified a ton of stuff which I like

sly pecan Feb 3, 2024, 4:03 PM

#

sturdy sequoia and it's about 20x faster in debug

I read this and thought you meant it was 20x faster in debug than release

sturdy sequoia Feb 3, 2024, 4:03 PM

#

sly pecan I read this and thought you meant it was 20x faster in debug than release

no 😂

sly pecan Feb 3, 2024, 4:03 PM

#

Wondering what kind of cursed stuff you were doing

sturdy sequoia Feb 3, 2024, 4:03 PM

#

sly pecan Wondering what kind of cursed stuff you were doing

I will not, in fact, answer that

feral imp Feb 3, 2024, 4:03 PM

#

sturdy sequoia I will not, in fact, answer that

Wise.

sly pecan Feb 3, 2024, 4:04 PM

#

sturdy sequoia I will not, in fact, answer that

What you say can and will be used against you.

sturdy sequoia Feb 3, 2024, 4:04 PM

#

sly pecan What you say can and will be used against you.

I plead the fifth?

#

Time of dynamic number of registers, maybe that'll work 💀

sly pecan Feb 3, 2024, 4:05 PM

#

I will neither confirm nor deny that I murdered that man

sturdy sequoia Feb 3, 2024, 4:05 PM

#

WHY IS IT SO SLOOOOOOOOOOW 😦

glad urchin Feb 3, 2024, 4:05 PM

#

sly pecan Wondering what kind of cursed stuff you were doing

unsafe { std::transmute((******arc).wrapping_offset(engine.among_us)) }

sly pecan Feb 3, 2024, 4:06 PM

#

I don't get that, but I'm sure it's super funny

glad urchin Feb 3, 2024, 4:06 PM

#

I don't think you're supposed to get it

sturdy sequoia Feb 3, 2024, 4:07 PM

#

glad urchin `unsafe { std::transmute((******arc).wrapping_offset(engine.among_us)) }`

https://tenor.com/view/sussy-among-us-sussy-amogus-among-us-sus-sussy-gif-26236076

Tenor

feral imp Feb 3, 2024, 4:07 PM

#

It's fast in debug and slow in release?

atomic violet Feb 3, 2024, 4:12 PM

#

sturdy sequoia WHY IS IT SO SLOOOOOOOOOOW 😦

have you profiled it? what does it say?

sturdy sequoia Feb 3, 2024, 4:19 PM

#

atomic violet have you profiled it? what does it say?

I haven't yet

sturdy sequoia Feb 3, 2024, 4:19 PM

#

feral imp It's fast in debug and slow in release?

it's faster in debug than it used to be, but in release the difference is much lower

sturdy sequoia Feb 3, 2024, 4:34 PM

#

Ok, so it's much better

#

but still... not as good as I'd like 😦

feral imp Feb 3, 2024, 4:40 PM

#

Alright. Progress is progress. There is a lot of experimental / novel shit here, so do take time to just reflect and appreciate that it is growing organically.

sturdy sequoia Feb 3, 2024, 4:40 PM

#

feral imp Alright. Progress is progress. There is a lot of experimental / novel shit here,...

I'm sure it's just some dumb stuff that's slowing down everything

#

I am profiling atm

atomic violet Feb 3, 2024, 4:43 PM

#

I did perf. I am no expert in hashing, but goddam, that's a lot of hashing.

#

I can't find anything not about hashing, actually

sturdy sequoia Feb 3, 2024, 4:44 PM

#

ooooooooooooooof

#

ooooooooooooooooooooooooooffffffffffffffffffffffffffffffffffffffffffffff

#

I did a poopsy it seems

atomic violet Feb 3, 2024, 4:44 PM

#

aha nevermind

sturdy sequoia Feb 3, 2024, 4:59 PM

#

Ok, I just trippled performance

#

now it's... as fast as main

#

😭

atomic violet Feb 3, 2024, 4:59 PM

#

what went wrong?

sturdy sequoia Feb 3, 2024, 5:00 PM

#

atomic violet what went wrong?

too much hashing

#

I just need to figure out where the rest is angrythunk

#

Wait, I think I know!!!!

#

I did not, in fact, know

atomic violet Feb 3, 2024, 5:12 PM

#

I was going to say "... and you will first profile it to check if you are right or not?.." but decided not to...

sturdy sequoia Feb 3, 2024, 5:15 PM

#

atomic violet I was going to say "... and you will first profile it to check if you are right...

I mean, there is too much hashing

#

I'm trying to figure out why

ornate merlin Feb 3, 2024, 5:18 PM

#

sturdy sequoia too much hashing

May be you should switch from SipHash to another hasher?

For example hashbrown uses AHash as the default hasher, which is much faster than SipHash.

sturdy sequoia Feb 3, 2024, 5:19 PM

#

ornate merlin May be you should switch from SipHash to another hasher? For example hashbrown ...

We rely on the quality of SipHash to make the assumption that there are no collisions

ornate merlin Feb 3, 2024, 5:19 PM

#

sturdy sequoia too much hashing

You do not need HashDoS resistance inside compiler in any case

atomic violet Feb 3, 2024, 5:19 PM

#

we kind of do though...

#

I think hash collision can be easily abused to extract data from unrelated sources

#

so, say, a package would be able to access the cached read file

#

that's bad

#

plus even if all hashing suddently become something like xor hashing, it would have not helped here

#

it's an algorithmic issue caused by inaccurate use of comemo (I assume)

atomic violet Feb 3, 2024, 5:22 PM

#

atomic violet I think hash collision can be easily abused to extract data from unrelated sourc...

actually, I bet you can do even worse, can't you?..

ornate merlin Feb 3, 2024, 5:22 PM

#

sturdy sequoia We rely on the quality of SipHash to make the assumption that there are no colli...

You do not need to worry about collisions. In any case AHash provides the same level of collisions as SipHash.

It just doesn't provide HashDoS resistance

atomic violet Feb 3, 2024, 5:23 PM

#

like what if you replaythe changes to a different object?

#

can you break some of the safety guards somehow?

atomic violet Feb 3, 2024, 5:23 PM

#

ornate merlin You do not need to worry about collisions. In any case AHash provides the same ...

We do need to care about hash collisions. comemo relies of high quality hashing

ornate merlin Feb 3, 2024, 5:25 PM

#

atomic violet We do need to care about hash collisions. comemo relies of high quality hashing

High quality hashing and HashDoS resistance is not the same thing.

Actually SipHash just recreate hasher every time.

atomic violet Feb 3, 2024, 5:26 PM

#

I know it's not the same thing, but I don't understand what you are saying

ornate merlin Feb 3, 2024, 5:28 PM

#

atomic violet I know it's not the same thing, but I don't understand what you are saying

I mean that if you compare AHash and SipHash regarding to the collision possibility, so they are pretty the same.

atomic violet Feb 3, 2024, 5:29 PM

#

ah, so you are just proposing replacing siphash with ahash?

#

ahash is 64 bit though

#

https://github.com/typst/comemo/issues/3

feral imp Feb 3, 2024, 5:31 PM

#

?

#

look, isn't it just letting Dherse work at it a bit more, to figure out why there are lots of seemingly unnecessary hashings, and somewhere, sometime later it can discussed to change the hashings?

atomic violet Feb 3, 2024, 5:32 PM

#

yeah ofc

#

switching hashes now won't give anything

#

current VM performance is a bug

ornate merlin Feb 3, 2024, 5:34 PM

#

atomic violet ah, so you are just proposing replacing siphash with ahash?

Yes. SipHash recreate hasher every time when you change the size of hashmap. When the hashmap grows from 3 to 7 and so on.

AHash at the same time created one time, when you create hashmap, and it stays the same during the life of hashmap

ornate merlin Feb 3, 2024, 5:38 PM

#

atomic violet switching hashes now won't give anything

I actually don't look at the code, so may be it can be improved. But AHash really fast. It can be 10 times faster than SipHash, especially when you use simple keys like integers, etc

atomic violet Feb 3, 2024, 5:40 PM

#

Is there 128 bit version of AHash? If not, it's basically useless.

#

64 bits are not enough to prevent collisions

sturdy sequoia Feb 3, 2024, 5:43 PM

#

ornate merlin You do not need to worry about collisions. In any case AHash provides the same ...

We'd need to talk about this with @left night

#

Hey, it's now faster than main 🎉

feral imp Feb 3, 2024, 5:50 PM

#

sturdy sequoia Hey, it's now faster than main 🎉

ornate merlin Feb 3, 2024, 6:09 PM

#

sturdy sequoia We'd need to talk about this with <@311948531835469827>

Of course😊. But you need to remember that even if the output hash doesn't have any collision, they can occur when you calculated the remainder (u64 / buckets). And because current rust hashmap implementation use sse2/neon instructions, you probably dos not notice difference on the small size tables (up to 48 elements, I think).

And AHash passes the full SMHasher test suite (https://github.com/tkaitchuck/aHash/blob/master/compare/readme.md#Quality), so it's output is pretty good

GitHub

aHash/compare/readme.md at master · tkaitchuck/aHash

aHash is a non-cryptographic hashing algorithm that uses the AES hardware instruction - tkaitchuck/aHash

atomic violet Feb 3, 2024, 6:11 PM

#

ornate merlin Of course😊. But you need to remember that even if the output hash doesn't have...

Hashes are not used for hash table directly, they are used for comparison

#

in other words, there is a chack that hashes are equal, object equality is not checked

#

moreover, the "rehashing creates a new hasher" argument is notr applicable here afaik, because the hasher for a comemo hash table is separate from a hasher for tracked objects

ornate merlin Feb 3, 2024, 6:13 PM

#

atomic violet in other words, there is a chack that hashes are equal, object equality is not c...

Hm.. Actually you are wrong.

#

Hashbrown stores the only first 7 bits of the hash, and all other comparisons are done on real keys, not their hash values

atomic violet Feb 3, 2024, 6:15 PM

#

ornate merlin Hm.. Actually you are wrong.

I think there is a misunderstanding going on. I am not talking about hash tables in general.

Here is how comemo works: it has a big hash table for every memoized function call, from the 128bit hash of its arguemnts. Every time the function is called, it hashes it's arguemnts (not quite, but let's pretend it does...) and retrieves the memoized result. The values themselves are not compared.

ornate merlin Feb 3, 2024, 6:16 PM

#

atomic violet I think there is a misunderstanding going on. I am not talking about hash tables...

Oh. May bad 😅. I think that we talk about the hashmap implementation

atomic violet Feb 3, 2024, 6:17 PM

#

yeah, it's ok, everyone makes mistakes

#

that's why it needs 128bit hashes, 64 bit is just not enough, so a lot of hashers are basically out of question

atomic violet Feb 3, 2024, 6:37 PM

#

@sturdy sequoia once you resolve the hashing problem, can you try something? It's probably a bad idea (big 50/50... well... 5/90/5, where 90 is "does nothing"), but at least it's not too hard to check and if I am right I want to be remembered as the best software engineer to ever exist.

In vm/mod.rs, in enter_scope, try boxing the new VM before running vm.run(engine). I have a theory (not confirmed by anything, just an educated guess) that VM is too fat for the stack and it causes worse stack performance because hardware hardware blah blah blah frontend prefetchers L1 stack engine blah blah blah. Of course, preferably the same VM (and same registers) should be used everywhere, but I assume it's not an option.

That's what I was able to infer with my eyes, at least. enter_scope looks the sussiest.

sturdy sequoia Feb 3, 2024, 6:37 PM

#

atomic violet <@130737672951037952> once you resolve the hashing problem, can you try somethin...

Look into the latest commit 😉

#

I mean, VMState is still on the stack, but the registers are on the heap

#

I'll also try removing the stacker calls

#

as I have the suspicion that they're really bad 😂

atomic violet Feb 3, 2024, 6:39 PM

#

I have looked at stacker source code for a bit, it doesn't look that bad

#

it just maps more pages

atomic violet Feb 3, 2024, 6:39 PM

#

sturdy sequoia I mean, `VMState` is still on the stack, but the registers are on the heap

did it improve anything?

sturdy sequoia Feb 3, 2024, 6:40 PM

#

atomic violet did it improve anything?

yes, it immensely improved perfs

#

I also ditched the whole VM thing as it required some unsafe

#

so the VM is fully safe afaik

atomic violet Feb 3, 2024, 6:40 PM

#

omg I am a genious lol

sturdy sequoia Feb 3, 2024, 6:40 PM

#

And it reuses registers

#

for nested scopes

atomic violet Feb 3, 2024, 6:41 PM

#

yeah that's basically perfect

sturdy sequoia Feb 3, 2024, 6:41 PM

#

and registers are infinite now

atomic violet Feb 3, 2024, 6:41 PM

#

so... stack?

#

stack based VM ftw let's goo

sturdy sequoia Feb 3, 2024, 6:41 PM

#

kinda, but there is agressive reuse of registers

#

I don't know if you saw but in the compiler there is a RAII register called RegisterGuard that frees registers for re-use as soon as possible 😎

atomic violet Feb 3, 2024, 6:42 PM

#

yeah I have not looked at the compiler, only at VM

#

I see 👁️👄👁️

sturdy sequoia Feb 3, 2024, 6:42 PM

#

Performance of eval has improved by 20% with the commit I'm about to push

#

eval = the VM in this scenario

#

my problem now is that function calls are... slooooooooooooooow

#

Much better than an hour ago

#

but too lsow

#

BTW, I just pushed

atomic violet Feb 3, 2024, 6:44 PM

#

just function calls, or scopes in general?

sturdy sequoia Feb 3, 2024, 6:44 PM

#

atomic violet just function calls, or scopes in general?

calls now

#

scopes are fine since they no longer involve growing the stack, etc.

#

I KNOW HOW TO MAKE THE REGISTER TABLE SORT OF DYNAMICALLY SIZED

#

USE A SmallVec<[Value; 16]>

#

Makes small function calls cheaper by not allocating!

#

I AM A GENIUS

#

https://tenor.com/view/hahahaha-gif-27342590

Tenor

#

(time to test it sunglassed_crying )

atomic violet Feb 3, 2024, 6:48 PM

#

if only there was a way to not reinitialize registers every funcition call...

#

hear me out...

#

ABI

#

(/s)

sturdy sequoia Feb 3, 2024, 6:48 PM

#

atomic violet ABI

ABI?

#

Wait no

#

no no nonononono

#

No

#

I objectify

cunning wadi Feb 3, 2024, 6:48 PM

#

atomic violet ABI

I like

left night Feb 3, 2024, 6:49 PM

#

sturdy sequoia USE A `SmallVec<[Value; 16]>`

I thought about that but expected that you would consider the access overhead too high.

sturdy sequoia Feb 3, 2024, 6:49 PM

#

left night I thought about that but expected that you would consider the access overhead to...

Yeah, there are literally zero gains

#

I have reached the point where eval is about 5x faster than main

#

But I was honestly expecting a 10x 😦

#

I guess eval is not as slow as I thought it was angryeyes

#

And the impact on my thesis (in spite of tablex) is surprisingly low

cunning wadi Feb 3, 2024, 6:56 PM

#

I assume that's because most of the time is spent calling short functions that aren't much more than glue code

#

and well, function calls aren't that fast it seems?

atomic violet Feb 3, 2024, 6:59 PM

#

~~inline them~~

sturdy sequoia Feb 3, 2024, 7:01 PM

#

cunning wadi and well, function calls aren't that fast it seems?

indeed

sturdy sequoia Feb 3, 2024, 7:01 PM

#

atomic violet ~~inline them~~

I am seriously considering it

#

it's almost trivial with the current design 🤔

#

I just don't know how I would deal with arguments (checking them)

#

but I could only do it when arguments are "simple"

#

(no spread, etc.)

#

Ok, I am now working on improving loop performance by removing allocations and dynamic dispatch

sturdy sequoia Feb 3, 2024, 8:08 PM

#

Fun fact, using #[repr(packed)] on opcodes does improve performance by a very measurable margin!

lunar kettle Feb 3, 2024, 8:31 PM

#

what's the current status on how much faster your thesis compiles? 👀

sly pecan Feb 3, 2024, 8:37 PM

#

lunar kettle what's the current status on how much faster your thesis compiles? 👀

1 millisecond

left night Feb 3, 2024, 8:51 PM

#

I think what would be very interesting just to get a feeling for what is and isn't possible here is to write a reasonably complex evaluation-heavy script in Typst and Python and compare speed. I'd be curious which orders of magnitude of difference we're talking about here.

sturdy sequoia Feb 3, 2024, 9:24 PM

#

lunar kettle what's the current status on how much faster your thesis compiles? 👀

3% irrc

sly pecan Feb 3, 2024, 9:27 PM

#

sturdy sequoia 3% irrc

https://tenor.com/view/jeremy-clarkson-speed-speed-and-power-gif-27212846

Tenor

sturdy sequoia Feb 3, 2024, 10:39 PM

#

sly pecan https://tenor.com/view/jeremy-clarkson-speed-speed-and-power-gif-27212846

I'm honestly quite dissapointed 'cause it means that (simple rule of three) eval was 3% of the total execution time

#

😭

feral imp Feb 3, 2024, 10:41 PM

#

Are you still working at it? Or do you think this current version does indeed reflect what's possible with a VM?

sturdy sequoia Feb 3, 2024, 10:41 PM

#

Wait my math is wrong

#

who cares, eval is not nearly as much as I wanted angryeyes

sturdy sequoia Feb 3, 2024, 10:41 PM

#

feral imp Are you still working at it? Or do you think this current version does indeed re...

There's definitely room for improvement

#

I think closure calls are wayyyyyyyy too expensive for what they do

sturdy sequoia Feb 3, 2024, 10:43 PM

#

left night I think what would be very interesting just to get a feeling for what is and isn...

atomic violet Feb 3, 2024, 10:48 PM

#

left night I think what would be very interesting just to get a feeling for what is and isn...

I can do this tomorrow

#

https://tenor.com/view/oogway-my-time-has-come-gif-8019684

Tenor

sturdy sequoia Feb 3, 2024, 10:48 PM

#

I removed the eval module

#

and it gives an idea of just how MASSIVE this PR is

#

💀💀💀

feral imp Feb 3, 2024, 10:50 PM

#

It's a lot. But there is no way that improvement from now on won't entail massive efforts like this....

sturdy sequoia Feb 3, 2024, 11:07 PM

#

feral imp It's a lot. But there is no way that improvement from now on won't entail massiv...

To be fair, there's also the fact that my thesis is just... not that heavy in the evaluation

#

I thought it was bigger than that

sly pecan Feb 3, 2024, 11:12 PM

#

sturdy sequoia I'm honestly quite dissapointed 'cause it means that (simple rule of three) eval...

Could be more in other documents?

feral imp Feb 3, 2024, 11:13 PM

#

sly pecan Could be more in other documents?

Cetz heavy docs must use eval...

sturdy sequoia Feb 3, 2024, 11:15 PM

#

sly pecan Could be more in other documents?

most likely

#

anything with CetZ as @feral imp said

untold turret Feb 4, 2024, 12:04 AM

#

@sturdy sequoia A quite important key for heavy computing. Would you like to do scalar specialization? I mean you stores floats in 32 bytes registers but they could only be 8 bytes, and importantly you may do Value::Float::add/mul/sub/div per instruction, which is very expensive.

sturdy sequoia Feb 4, 2024, 12:05 AM

#

untold turret <@130737672951037952> A quite important key for heavy computing. Would you like ...

I don't have type information 😐

untold turret Feb 4, 2024, 12:05 AM

#

you can do it at runtime

#

since typst doesn't have runtime, all of them are at compile time.

sturdy sequoia Feb 4, 2024, 12:10 AM

#

untold turret since typst doesn't have runtime, all of them are at compile time.

hmm

#

I wonder if that would really help

untold turret Feb 4, 2024, 12:13 AM

#

I just point out why a such vm doesn't quite help the heavy cetz docs.

sturdy sequoia Feb 4, 2024, 12:13 AM

#

untold turret I just point out why a such vm doesn't quite help the heavy cetz docs.

maybe you're right, I just don't know how I'd do it 🤔

#

I had to manually rebase 47 times in a row for some reason

#

angryeyes

#

The VM was broken by that

#

😭

glad urchin Feb 4, 2024, 12:15 AM

#

did you not keep a backup branch?

sturdy sequoia Feb 4, 2024, 12:18 AM

#

glad urchin did you not keep a backup branch?

😐

proven umbra Feb 4, 2024, 12:18 AM

#

Git stores a rev pre rebase for you.

#

Search git reflog. All actions are logged there, and as long as you do not git gc, git keeps "backup" references for you.

untold turret Feb 4, 2024, 12:23 AM

#

sturdy sequoia maybe you're right, I just don't know how I'd do it 🤔

🤔 I think this is a JIT way to get native code performance in VM, which is very hard. But there would be a possibly way that we embed wasmtime and leverage wasmtime to do JIT by emitting wasm code to wasmtime.

#

https://github.com/bytecodealliance/wasmtime/tree/main/cranelift

#

🐱 @sturdy sequoia Forget the special cases. VM in parallel with rayon should benefit to both local typst and web typst. That may also beat main typst.

sturdy sequoia Feb 4, 2024, 12:34 AM

#

@proven umbra it's funny but my VM is finding bugs in CetZ

#

With some variables being used that don't exist 😂

proven umbra Feb 4, 2024, 12:37 AM

#

😄 Oh…

#

You have lines/functions so I can look to fix that?

sturdy sequoia Feb 4, 2024, 12:39 AM

#

untold turret Feb 4, 2024, 12:41 AM

#

I believe align can quite benefit from jit.

sturdy sequoia Feb 4, 2024, 12:41 AM

#

untold turret I believe `align` can quite benefit from jit.

align?

#

Anyway, I'm off to bed

#

gn everybody ❤️

untold turret Feb 4, 2024, 12:41 AM

#

In the picture

sturdy sequoia Feb 4, 2024, 12:42 AM

#

ah right

untold turret Feb 4, 2024, 12:42 AM

#

gn

sturdy sequoia Feb 4, 2024, 12:42 AM

#

makes sense

atomic violet Feb 4, 2024, 1:01 PM

#

oh-oh @sturdy sequoia

#

hopefully it's reproducible

📎 raytracer.typ

#

oh wait, it's just stack overflow, false alarm

feral imp Feb 4, 2024, 1:06 PM

#

😛

#

Does it work on main?

atomic violet Feb 4, 2024, 1:06 PM

#

just need a recursion limit and it's all should be ok

atomic violet Feb 4, 2024, 1:06 PM

#

feral imp Does it work on `main`?

yeah it does

atomic violet Feb 4, 2024, 5:34 PM

#

@sturdy sequoia ok a real bug this time, patterns without let don't compile

#let (x, y) = (1, 2)
#(y, x) = (x, y)

#

probably covered by tests but still want to make sure

atomic violet Feb 4, 2024, 7:02 PM

#

Oh no... typst is soo slooooowwww..... 😭😭😭😭😭😭

#

I have disabled memoization for closure::eval to make sure it won't just get OOM'ed..

#

and it crunches rays for like an hour and 20 minutes now

#

and python just does the whole thing in less than a minute

#

😭😭

#

Lemme check main

atomic violet Feb 4, 2024, 7:05 PM

#

atomic violet and it crunches rays for like an hour and 20 minutes now

oh it finished

feral imp Feb 4, 2024, 7:05 PM

#

main is fast?

atomic violet Feb 4, 2024, 7:06 PM

#

vm with no closure memoization is 1:22:xx

#

main will OOM

#

python is like... a minute... haven't measured yet, it's pretty fast

#

I will check for a smaller resolution now

#

📎 img2.json 📎 raytracer.py 📎 raytracer.typ

#

oh interesting

#

VM is slower, apparently

#

64.14 secs - VM, 34.24- main, and coughs 2.61 - python

sly pecan Feb 4, 2024, 7:11 PM

#

Python is a low bar too

atomic violet Feb 4, 2024, 7:11 PM

#

compiling VM again with no my hacks to make sure it's not me

atomic violet Feb 4, 2024, 7:12 PM

#

sly pecan Python is a low bar too

well... tbf, I would be impressed if typst got to like 1/3 of python

#

because duh, everyone says python is slow, but it doesn't mean it's not optimized

#

it does have a good untyped VM

atomic violet Feb 4, 2024, 7:13 PM

#

atomic violet compiling VM again with no my hacks to make sure it's not me

actually, my raytracer may rely on comemo in one place, so I assume it may not count...

#

but also...

#

python does not have comemo at all, so it's still bad

#

damn, my code is so shit I can't even fix it properly...

#

the render itself is pretty dope though

atomic violet Feb 4, 2024, 7:18 PM

#

atomic violet 64.14 secs - VM, 34.24- main, and *coughs* 2.61 - python

ok, VM 39.35

#

yeah, memoization helps in one place...

feral imp Feb 4, 2024, 7:18 PM

#

atomic violet the render itself is pretty dope though

God I love that you're here..

sturdy sequoia Feb 5, 2024, 9:02 AM

#

atomic violet main will OOM

really? 😱

sturdy sequoia Feb 5, 2024, 9:03 AM

#

atomic violet 64.14 secs - VM, 34.24- main, and *coughs* 2.61 - python

W-A-T????

#

😱

sturdy sequoia Feb 5, 2024, 9:03 AM

#

atomic violet yeah, memoization helps in one place...

where did you add memoization?

atomic violet Feb 5, 2024, 9:04 AM

#

sturdy sequoia where did you add memoization?

the evaluation of the closure

#

I previously removed it

#

it would not have finished otherwise

sturdy sequoia Feb 5, 2024, 9:04 AM

#

hmmm

#

And the VM is slower?

#

That's extremely dissapointing

atomic violet Feb 5, 2024, 9:04 AM

#

atomic violet ok, VM 39.35

looks like it

#

but I have not profiled it

sturdy sequoia Feb 5, 2024, 9:05 AM

#

Guess I'll do just that

#

missa sad

#

@atomic violet are there lots of small closures?

#

Because I have the suspicion that freaking closures are too goddamn slow

atomic violet Feb 5, 2024, 9:06 AM

#

sturdy sequoia <@284257720406638594> are there lots of small closures?

so, generally, I tried to minimize them

#

and I did pretend to be a guy who tries to optimize a lot in my code

#

(without measuring anything)

sturdy sequoia Feb 5, 2024, 9:07 AM

#

atomic violet (without measuring anything)

that's how we do 😎

atomic violet Feb 5, 2024, 9:07 AM

#

and there are a bunch of small functions, but I try to not call them in hot spots

#

there is a lot of spreading

#

and destructuring

#

that's basically the main way of moving data around there

#

and as a consequence a lot of functions have a lot of parameters

sturdy sequoia Feb 5, 2024, 9:07 AM

#

I have the suspicion that I implemented spreading 💩-ily while trying to make it faster 💀

sturdy sequoia Feb 5, 2024, 9:08 AM

#

atomic violet and destructuring

I'm unsure whether destructuring is any fast atm

atomic violet Feb 5, 2024, 9:13 AM

#

The hottest functions should be m33mul, intersect, cast-ray and cast-ray-through-transparent, and a few closures for rendering heart, typst guy, and tiled floor. Maybe it would be useful to run a sampling profiler on typst and on python to compare where the slowness is likely occur

sturdy sequoia Feb 5, 2024, 9:45 AM

#

atomic violet The hottest functions should be `m33mul`, `intersect`, `cast-ray` and `cast-ray-...

I'll run it on my end and see what's going on

#

@left night I'm having a pretty big issue, for some reason, clalling Route::contains causes a stack overflow 💀

#

Do you know what could be causing this?

#

#

Here is an example from a stack trace

#

is goes on and on and on and on

#

I think I figured it out!

#

The check for cyclic evaluation was inside of a memoized function instead of the top level and that was the issue, maybe? 🤔

#

Yes indeed, that was it now it just panics 💀

#

I had forgotten to check for cyclic import which is now fixed

left night Feb 5, 2024, 10:20 AM

#

sturdy sequoia The check for cyclic evaluation was inside of a memoized function instead of the...

A check for cyclic evaluation inside of a memoized function should be fine. That's the whole point of the Route being Tracked. The check is recursive itself.

sturdy sequoia Feb 5, 2024, 10:20 AM

#

left night A check for cyclic evaluation inside of a memoized function should be fine. That...

yes but here it was actually recursively importing

#

but it wasn't actually detected

#

because it was only being done in the compilation stage

left night Feb 5, 2024, 10:21 AM

#

ah okay

sturdy sequoia Feb 5, 2024, 10:21 AM

#

it was a bit weird, but there are still a few things I haven't ported over

left night Feb 5, 2024, 10:21 AM

#

Regarding performance improvements, I wonder how much of the time is actually spent on walking the AST etc (where a instruction-based VM should be faster) and how much is spent on "runtime" things like ops::join, inside functions operating on Values etc.

sturdy sequoia Feb 5, 2024, 10:22 AM

#

left night Regarding performance improvements, I wonder how much of the time is actually sp...

that's probably the issue indeed

#

this is where having types would be useful because I could pre-allocate a lot

left night Feb 5, 2024, 10:23 AM

#

We should probably figure out a way to profile that more precisely

sturdy sequoia Feb 5, 2024, 10:23 AM

#

left night We should probably figure out a way to profile that more precisely

yes, but adding #[time] everywhere would be ungodly 😂

left night Feb 5, 2024, 10:23 AM

#

I fear that time would have too much overhead, too

sturdy sequoia Feb 5, 2024, 10:23 AM

#

@atomic violet on my end, your raytracer takes 18s on the VM, I'll try on main now

sturdy sequoia Feb 5, 2024, 10:23 AM

#

left night I fear that time would have too much overhead, too

yes, it's handly to get a rough idea, but not good for fine grained stuff

atomic violet Feb 5, 2024, 10:24 AM

#

sturdy sequoia yes, but adding `#[time]` everywhere would be ungodly 😂

Performance counters!

#

well, it's not that simple tbf 🤔

#

Maybe crunch it though callgrind? Count instructions instead of actual time?

left night Feb 5, 2024, 10:25 AM

#

You could have, just for some quick tests, a cheaper version of #[time], that measures just one thing at once and its global effect: Effectively, a global mutable static that is increased by the time spent in a single leaf operation every time in runs. And then compare that with the full time spent. This might work better than a flamegraph at showing the full effect of many small leaf calls.

#

I am not sure how much overhead Instant::now() incurs, but it would be easy to check whether this kind of instrumentation increases the runtime significantly.

sturdy sequoia Feb 5, 2024, 10:26 AM

#

left night I am not sure how much overhead `Instant::now()` incurs, but it would be easy to...

Instant::now is super slow, but if we accept having a linux only version in the CLI (gated with a platform flag) we could do that super cheaply

#

linux and macos *

sturdy sequoia Feb 5, 2024, 10:27 AM

#

left night You could have, just for some quick tests, a cheaper version of `#[time]`, that ...

I agree, it has a specific name too in the nomenclature but I forgot what it is

#

and time could just generate a per-function counter

left night Feb 5, 2024, 10:27 AM

#

I was more thinking about annotating a function of interest with a special attribute just temporarily

#

Rather than actually committing that to the code base

sturdy sequoia Feb 5, 2024, 10:27 AM

#

ah right

left night Feb 5, 2024, 10:27 AM

#

So it doesn't matter whether it's linux/mac only

sturdy sequoia Feb 5, 2024, 10:28 AM

#

@atomic violet is that the expected result?

#

#

or is my VM broken 💀

left night Feb 5, 2024, 10:28 AM

#

left night I was more thinking about annotating a function of interest with a special attri...

At least for starters

atomic violet Feb 5, 2024, 10:28 AM

#

sturdy sequoia

looks good

left night Feb 5, 2024, 10:29 AM

#

poor Typst guy's eyes

sturdy sequoia Feb 5, 2024, 10:29 AM

#

left night At least for starters

I'll maybe try that

sturdy sequoia Feb 5, 2024, 10:29 AM

#

left night poor Typst guy's eyes

ikr

#

with main it runs in 9.7s versus 19s on the VM

#

😭

atomic violet Feb 5, 2024, 10:29 AM

#

atomic violet

you are running this right?

sturdy sequoia Feb 5, 2024, 10:30 AM

#

atomic violet you are running this right?

~~yes~~

#

no

#

I'm running the gist

atomic violet Feb 5, 2024, 10:30 AM

#

oh, yeah

#

gist is a bit different, but should be fine

atomic violet Feb 5, 2024, 10:30 AM

#

atomic violet you are running this right?

I made a bug in that one

sturdy sequoia Feb 5, 2024, 10:30 AM

#

What hints to me that the VM is doing too much work is the size of the timings files

#

with main they're about half as big

#

but the VM shouldn't be doing any more timed operations

#

🤔

#

which makes me think it's somehow running the code... twice?

atomic violet Feb 5, 2024, 10:32 AM

#

here is the equivalent unbugged python version

📎 raytracer.py

#

just in case

left night Feb 5, 2024, 10:32 AM

#

atomic violet 64.14 secs - VM, 34.24- main, and *coughs* 2.61 - python

so was this the final verdict? Python is 13x faster?

atomic violet Feb 5, 2024, 10:33 AM

#

Let's see on that version

left night Feb 5, 2024, 10:33 AM

#

because, if yes, I would be pretty happy with that

atomic violet Feb 5, 2024, 10:35 AM

#

2.14 python, 29.84 VM, 24.75 - ~~main~~ 0.10

#

I am a dum dum

#

I was calling 0.10 "main" this whole time 🤦‍♂️

atomic violet Feb 5, 2024, 10:35 AM

#

left night so was this the final verdict? Python is 13x faster?

so yeah, about that

feral imp Feb 5, 2024, 10:36 AM

#

How about main main?

atomic violet Feb 5, 2024, 10:37 AM

#

one sec lemme build it

sturdy sequoia Feb 5, 2024, 10:39 AM

#

That's main

#

That's the VM

#

what the heck is going on at the end

atomic violet Feb 5, 2024, 10:40 AM

#

it's chillin' after a hard day of work

#

it's fine, give it a break

sturdy sequoia Feb 5, 2024, 10:41 AM

#

as far as I can tell it's this code

#

I guess looping is very slow?

atomic violet Feb 5, 2024, 10:42 AM

#

It may be joining?..

sturdy sequoia Feb 5, 2024, 10:43 AM

#

hmmm

atomic violet Feb 5, 2024, 10:43 AM

#

because that's the only place where content is joined iirc

sturdy sequoia Feb 5, 2024, 10:43 AM

#

that's makes sense

atomic violet Feb 5, 2024, 10:43 AM

#

atomic violet 2.14 python, 29.84 VM, 24.75 - ~~main~~ 0.10

main main is 16.74 👀

#

oh wait

#

usr time: main - 14.83 vs 0.10 - 16.87

#

wth was it doing for 7 seconds?...

left night Feb 5, 2024, 10:46 AM

#

atomic violet main main is 16.74 👀

so Python is just 8x faster? 👀

atomic violet Feb 5, 2024, 10:46 AM

#

seems like it

feral imp Feb 5, 2024, 10:47 AM

#

left night so Python is just 8x faster? 👀

if that's true then typst is doing pretty good.
I love how the comparison is to latex, but this is a true behemoth to measure up to.

atomic violet Feb 5, 2024, 10:48 AM

#

I would need a proper measuring procedure so name a standard deviation ofc, but the order seems to be about ~6-9x

atomic violet Feb 5, 2024, 10:48 AM

#

feral imp if that's _true_ then typst is doing pretty good. I love how the comparison is t...

I am NOT rewriting that shit to latex

left night Feb 5, 2024, 10:49 AM

#

atomic violet I would need a proper measuring procedure so name a standard deviation ofc, but ...

that's honestly much faster than I would have expected

atomic violet Feb 5, 2024, 10:49 AM

#

What does 0.10 spends 7 seconds of system time doing? Writing a pdf?

atomic violet Feb 5, 2024, 10:50 AM

#

atomic violet What does 0.10 spends 7 seconds of system time doing? Writing a pdf?

and its so consistent as well: 7.09 - 7.5 seconds

sturdy sequoia Feb 5, 2024, 10:50 AM

#

@atomic violet so I think you're probably right, it's probably joining that's so goddamn slow

#

https://tenor.com/view/why-huh-but-why-gif-13199396

Tenor

sturdy sequoia Feb 5, 2024, 11:10 AM

#

Ok I do think that looping was abnormally slow

#

I think I just fixed it

#

time to see

lunar kettle Feb 5, 2024, 11:17 AM

#

so?

#

You can’t just leave us hanging like that!

sturdy sequoia Feb 5, 2024, 11:26 AM

#

lunar kettle You can’t just leave us hanging like that!

it improved by like... 100ms

#

😄

sly pecan Feb 5, 2024, 11:27 AM

#

left night so Python is just 8x faster? 👀

is the python code doing the exact same thing though?

atomic violet Feb 5, 2024, 11:27 AM

#

sly pecan is the python code doing the exact same thing though?

the result is identical

#

it should do the exact same thing

sly pecan Feb 5, 2024, 11:28 AM

#

I'm not talking about the result

atomic violet Feb 5, 2024, 11:28 AM

#

unless github copilot translated it wrong and I missed it

sturdy sequoia Feb 5, 2024, 11:28 AM

#

Ok so it's not joining that is slow

#

Than I have no fucking idea what is slow

#

😭

atomic violet Feb 5, 2024, 11:28 AM

#

and considering that even unused variables got translated, it should be the same

sturdy sequoia Feb 5, 2024, 11:28 AM

#

it must be function calls?

sly pecan Feb 5, 2024, 11:28 AM

#

sturdy sequoia Than I have no fucking idea what is slow

Take a break my dude. You can come back to it later

sturdy sequoia Feb 5, 2024, 11:29 AM

#

sly pecan Take a break my dude. You can come back to it later

https://tenor.com/view/star-wars-kylo-ren-i-know-what-i-have-to-do-i-know-what-i-must-do-adam-driver-gif-26011680

Tenor

lunar kettle Feb 5, 2024, 11:29 AM

#

Just to make sure you are running in release mode right? 😂

sturdy sequoia Feb 5, 2024, 11:29 AM

#

lunar kettle Just to make sure you are running in release mode right? 😂

angrythunk

#

OF COURSE

atomic violet Feb 5, 2024, 11:30 AM

#

what's release mode?

#

(jk)

sturdy sequoia Feb 5, 2024, 12:01 PM

#

So, I have narrowed it down to function calls

#

Now what I don't know is if it's native function calls

#

(i.e in the global scope)

#

or VM-ed function calls

#

which I assume are at fault

#

angryeyes

#

wait no

#

function calls are fast

sturdy sequoia Feb 5, 2024, 12:43 PM

#

Ok so somehow reading values from the VM causes hashing

#

I have no idea why

#

So, as it turns out, it appears that accessing global values is the slow bit

lunar kettle Feb 5, 2024, 12:47 PM

#

is it easy to fix?

sturdy sequoia Feb 5, 2024, 12:48 PM

#

I don't know yet

sturdy sequoia Feb 5, 2024, 1:25 PM

#

For some reason it hashes like crazy

#

but even my attempts at improving it don't help

#

angryeyes

#

ok so accesses are indeed the slow part

#

for some reason

#

angryeyes

atomic violet Feb 5, 2024, 1:43 PM

#

let me callgrind it

#

btw, I don't know how comemo works, but if there is a function like

#let f(big-arr) = {
  big-arr.at(0) = big-arr.at(1)
  big-arr
}

how is it memoized exactly? What are the constraints here?

sturdy sequoia Feb 5, 2024, 1:45 PM

#

I mean did identify the slow part of the code

#

I just... don't know why it's slow

#

it's access.rs in the typst::vm

#

I did at a typst_macros::time to it and it's indeed extremely slow

#

but I really don't know why

#

I've tried to bring in some optimizations to no avail

#

I think I'll need to rework the entire Access system because it's just bad

#

but I don't know how other VMs handle mutable field access

#

if anybody knows I'm all ears

left night Feb 5, 2024, 1:47 PM

#

atomic violet btw, I don't know how comemo works, but if there is a function like ``` #let f(b...

just naive caching with copy-on-write. so it will do full clones here because there are multiple references.

#

it would be great if it could somehow optimize that of course, but it doesn't atm

atomic violet Feb 5, 2024, 1:48 PM

#

yeah it will be hard to optimize

#

I think it's either comemo just doesn't memoize this, or it has some sort of "reversible constraints" or something...

atomic violet Feb 5, 2024, 2:08 PM

#

to optimize you need to profile, to profile you need debug info, to have debug info you need to not optimize

#

literally the worst part of software engineering

#

feral imp Feb 5, 2024, 2:10 PM

#

While I welcome memes, it seems like a great indicator, that you guys should take a break. If you continue, I'd highly advise outlining low hanging fruit (even if there isn't one!) and going for that. Memes are a path way to burnout 💀

#

But I'm just a goose anyways...

atomic violet Feb 5, 2024, 2:12 PM

#

dherse should, I am have not done anything yet lol, I am just trying to find suspicios hashes but the <cycle 3> ruins my attempts 😂

sturdy sequoia Feb 5, 2024, 2:21 PM

#

atomic violet dherse should, I am have not done anything yet lol, I am just trying to find sus...

it's done by accessing fields

#

you can find that in access.rs somewhere in there there 's a get_field or something called

#

that's what's doing it

#

afaik

atomic violet Feb 5, 2024, 2:22 PM

#

until I see that in my call graph, I am not turning back

#

👺

atomic violet Feb 5, 2024, 2:58 PM

#

@sturdy sequoia What is foundations::Func::method?

#

it looks like it prehashes hella lot

#

and it is invoked in Access::read

#

🤔

sturdy sequoia Feb 5, 2024, 3:00 PM

#

It creates a method from a func and a value

#

it's a dirty hack I created to make method access work

#

💀

atomic violet Feb 5, 2024, 3:00 PM

#

Does every array.at(..) prehashes the array? Is this necessary?

#

because it sure looks like it

#

yeah wait lol what

#Performance