std::mem::size_of weird behaviour on mac m1 chipset | Rust Programming Language Community | Page 1

signal osprey Dec 13, 2023, 4:08 PM

#

Hey 👋

When I run the following code on my mac I get: "size of InnerNode: 96", while the actual size of that struct should be 88

use std::mem::size_of;

pub struct InnerNode {
    pub tag: u8,
    pub padding: [u8; 3],
    pub prefix_len: u32,
    pub key: u128,
    pub children: [u32; 2],
    pub child_earliest_expiry: [u64; 2],
    pub reserved: [u8; 40],
}

fn main() {
    println!("size of InnerNode: {}", size_of::<InnerNode>());
}

If I run the same program on the rust playground I get the correct size - https://play.rust-lang.org/?version=stable&mode=debug&edition=2021&gist=df6a72a6f427b5cbd60b79ac935424b7

What gives? Is this an expected behaviour?

Rust Playground

A browser interface to the Rust compiler to experiment with the language

late bramble Dec 13, 2023, 4:22 PM

#

is your mac an M1 by any chance? I can totally imagine that ARM has different alignment requirements to x86.
in this case, I suspect it comes down to the alignment of the u128.

signal osprey Dec 13, 2023, 4:27 PM

#

Yup it's an M1 - so this type of behaviour is expected then?

late bramble Dec 13, 2023, 4:29 PM

#

at least it's not unexpected

#

Some examples here:
x86_64: 8 byte alignment
https://godbolt.org/z/P1MT8G59b

ARM64: 16 byte alignment
https://godbolt.org/z/ncW9YfGnv

Risc-V (rv64gc): 16 byte alignment
https://godbolt.org/z/bsM1e5of4

#

If anything, 88 bytes is the unexpected value.

signal osprey Dec 13, 2023, 4:53 PM

#

thanks! that was helpful

#

do you have any tips while coding libs that need to support both? let's say a lib like openbook-v2, asserts the size of specific structs, the size of those structs is "hardcoded" into a const

#

what would be your approach to support both scenarios?

#

being more straightforward - stuff like this - https://github.com/openbook-dex/openbook-v2/blob/master/programs/openbook-v2/src/state/orderbook/nodes.rs#L101

GitHub

openbook-v2/programs/openbook-v2/src/state/orderbook/nodes.rs at ma...

Contribute to openbook-dex/openbook-v2 development by creating an account on GitHub.

late bramble Dec 13, 2023, 5:02 PM

#

In general, you shouldn't really care. Hardcoding the size of as struct like you just linked to seems really bad

#

it should instead be

const NODE_SIZE: usize = std::mem::size_of::<Node>();

#

which also makes the asserts pointless

#

oh it compares the size of two different structs? that's very strange

vital oriole Dec 13, 2023, 5:04 PM

#

the only reason I can see you'd care about the size of the struct if you're doing shenanigans with writing the struct to network/disk

#

in which case.. please be careful

#

if you are doing this, one way to solve it would be to also force align 1

#

or to use the types from something like zerocopy which are already align 1

late bramble Dec 13, 2023, 5:07 PM

#

vital oriole if you are doing this, one way to solve it would be to also force align 1

You can't reduce alignment

vital oriole Dec 13, 2023, 5:07 PM

#

you can

late bramble Dec 13, 2023, 5:07 PM

#

For align, if the specified alignment is less than the alignment of the type without the align modifier, then the alignment is unaffected.

vital oriole Dec 13, 2023, 5:07 PM

#

repr(packed)

late bramble Dec 13, 2023, 5:07 PM

#

packed(1) true

vital oriole Dec 13, 2023, 5:09 PM

#

interesting that u128 has align 16 on m1

#

wonder why

late bramble Dec 13, 2023, 5:10 PM

#

But even if you use both repr(C) and pepr(packed), you still shouldn't hardcode the sizes

#

E.g. usize could still be different between platforms

late bramble Dec 13, 2023, 5:10 PM

#

vital oriole interesting that u128 has align 16 on m1

I mean, it's the natural alignment for a u128

vital oriole Dec 13, 2023, 5:10 PM

#

not really

late bramble Dec 13, 2023, 5:11 PM

#

x86 is the strange one here

vital oriole Dec 13, 2023, 5:11 PM

#

it's still doing 2 ops on u64 for add for instance

late bramble Dec 13, 2023, 5:11 PM

#

i imagine there could easily be 128 bit mem loads that are aligned

vital oriole Dec 13, 2023, 5:12 PM

#

alignment is needed because cpus do loads and stores to aligned addresses because registers are that size

#

if you don't have 128bit registers and instructions working on 128bit

#

you don't need align 16

late bramble Dec 13, 2023, 5:13 PM

#

https://github.com/rust-lang/rust/issues/54341

GitHub

i128 / u128 are not compatible with C's definition. · Issue #54341 ...

While fixing various bindgen bugs related to long double, int128, (rust-lang/rust-bindgen#1370, etc). I realized that the following Rust program, in my x86_64 Linux machine: #[repr(C)] struct Foo {...

#

on topic

vital oriole Dec 13, 2023, 5:14 PM

#

for instance

#

u64 on i686 has align 4

#

because it doesn't need more

vital oriole Dec 13, 2023, 5:16 PM

#

late bramble https://github.com/rust-lang/rust/issues/54341

llvm 18 seems to have a fix eyesFinite

late bramble Dec 13, 2023, 5:16 PM

#

on a cpu architecture level, you probably want your alignment to reflect your memory interface, not your data type

vital oriole Dec 13, 2023, 5:16 PM

#

that's just wasted memory

late bramble Dec 13, 2023, 5:16 PM

#

if you have a 32 bit memory interface, any alignment greater than 32 bits doesn't make sense

vital oriole Dec 13, 2023, 5:16 PM

#

what's a 32 bit memory interface

late bramble Dec 13, 2023, 5:17 PM

#

if you have a 64 bit memory interface, you want your u64 aligned to 8 bytes for maximum performance

late bramble Dec 13, 2023, 5:17 PM

#

vital oriole what's a 32 bit memory interface

a memory interface that can read 4 bytes/32 bits at once

#

(this is about cpu internal architecture, whatever that is)

vital oriole Dec 13, 2023, 5:17 PM

#

well

#

virtually all cpus these days can read 512 bits at a time

#

align 64 wouldn't be so great

late bramble Dec 13, 2023, 5:18 PM

#

yeah in which case it makes sense to require u128 to be aligned to 16 bytes

#

because then it's always within a single read

#

instead of possibly two separate reads

vital oriole Dec 13, 2023, 5:19 PM

#

that's a decent point

#

that's still assuming you're not using the data before and after the u128

#

which you should probably do

#

in which case the read from 2 cache lines is almost free on amd64

#std::mem::size_of weird behaviour on mac m1 chipset