Fixing UB in random crates | Rust Programming Language Community | Page 2

haughty mica Oct 11, 2022, 2:37 PM

#

That is correct

#

I would fix this by not using Box

#

I'm trying to campaign for fixing this by removing the LLVM attributes from Box, but it's hard to know if I'm making my voice heard or being annoying

#

At least Ralf expects me to say the line now

wraith wind Oct 11, 2022, 2:40 PM

#

What would be a good replacement for box here? Rc, using alloc?

haughty mica Oct 11, 2022, 2:40 PM

#

Rolling your own Box so that it doesn't have the attributes

#

Basically: https://docs.rs/aliasable/latest/aliasable/boxed/struct.AliasableBox.html

aliasable::boxed::AliasableBox - Rust

API documentation for the Rust AliasableBox struct in crate aliasable.

#

But that crate also has a few silly issues someone should fix

#

They are minor

wraith wind Oct 11, 2022, 2:42 PM

#

Would a PR that pulled in this crate as a dependency be reasonable, or would it be better to just reuse the relevant parts of it?

haughty mica Oct 11, 2022, 2:43 PM

#

That's a question for the maintainer.

#

I often find it hard to predict what tolerance people have for these patches and also for more dependencies

tender nimbus Oct 11, 2022, 3:14 PM

#

I have said already that this lint is good ferrisSob

tender nimbus Oct 11, 2022, 3:15 PM

#

haughty mica I'm trying to campaign for fixing this by removing the LLVM attributes from Box,...

Luckily there's a ton of activity on the ucg issue ferrisClueless ferrisForgor

wraith wind Oct 11, 2022, 3:19 PM

#

Swapping in that crate gives me a weird error

        match self.map.remove(k) {
            None => None,
            // old_node was a Box<..> and is now AliasableBox<..>
            Some(mut old_node) => {
                unsafe { 
                    ptr::drop_in_place(old_node.key.as_mut_ptr());
                }
                let node_ptr: *mut LruEntry<K, V> = &mut *old_node;
                self.detach(node_ptr);
                unsafe { Some(old_node.val.assume_init()) }
            }
        }
    }

error[E0507]: cannot move out of dereference of `AliasableBox<LruEntry<K, V>>`
   --> src/lib.rs:704:31
    |
704 |                 unsafe { Some(old_node.val.assume_init()) }
    |                               ^^^^^^^^^^^^ ------------- value moved due to this method call
    |                               |
    |                               move occurs because value has type `MaybeUninit<V>`, which does not implement the `Copy` trait

I'm not quite seeing why this works with box and not aliasablebox?

haughty mica Oct 11, 2022, 3:22 PM

#

Box magic

#

😩

wraith wind Oct 11, 2022, 3:23 PM

#

I guess I can turn it back into a regular box here ferrisThonk

haughty mica Oct 11, 2022, 3:23 PM

#

Perhaps

#

But I think this is DerefMove

#

Not that this helps you make the code compile, it just explains why you have the error without stdlib Box

tender nimbus Oct 11, 2022, 3:54 PM

#

wraith wind Swapping in that crate gives me a weird error ```rs match self.map.remov...

ferrisForgor

#

time to write it all by hand using raw pointers and making the code 5x worse

#

i love box

wraith wind Oct 11, 2022, 3:55 PM

#

I fixed that bit and then got another SB error with the invalidation not being local so I will simply give up for now ferrisClueless

tender nimbus Oct 11, 2022, 3:56 PM

#

ferrisClueless

thorny venture Oct 11, 2022, 5:29 PM

#

Saethlin would be proud of me. I optimised some code today to work well in debug

#

It did require a lot of effort though. I also had to avoid the desire to reach for unsafe

haughty mica Oct 11, 2022, 5:59 PM

#

Susge

#

I do approve but still

thorny venture Oct 11, 2022, 6:08 PM

#

I had to do a str utf8 validation 😦

tender nimbus Oct 11, 2022, 6:09 PM

#

good

thorny venture Oct 11, 2022, 6:09 PM

#

I wonder if it optimises away in release

tender nimbus Oct 11, 2022, 6:10 PM

#

I don't think utf8 validation will ever optimize away will it

thorny venture Oct 11, 2022, 6:17 PM

#

?godbolt

pub fn foo() -> &'static str {
    static B: &[u8] = b"hello";
    std::str::from_utf8(B).unwrap()
}

small apexBOT Oct 11, 2022, 6:17 PM

#

core::ptr::drop_in_place<core::str::error::Utf8Error>:
        ret

example::foo:
        sub     rsp, 56
        lea     rsi, [rip + .L__unnamed_1]
        lea     rdi, [rsp + 8]
        mov     edx, 5
        call    qword ptr [rip + core::str::converts::from_utf8@GOTPCREL]
        cmp     qword ptr [rsp + 8], 0
        jne     .LBB1_1
        mov     rax, qword ptr [rsp + 16]
        mov     rdx, qword ptr [rsp + 24]
        add     rsp, 56
        ret
.LBB1_1:
        movups  xmm0, xmmword ptr [rsp + 16]
        movaps  xmmword ptr [rsp + 32], xmm0
        lea     rdi, [rip + .L__unnamed_2]
        lea     rcx, [rip + .L__unnamed_3]
        lea     r8, [rip + .L__unnamed_4]
        lea     rdx, [rsp + 32]
        mov     esi, 43
        call    qword ptr [rip + core::result::unwrap_failed@GOTPCREL]
        ud2

.L__unnamed_2:
        .ascii  "called `Result::unwrap()` on an `Err` value"

.L__unnamed_3:
        .quad   core::ptr::drop_in_place<core::str::error::Utf8Error>
        .asciz  "\020\000\000\000\000\000\000\000\b\000\000\000\000\000\000"
        .quad   <core::str::error::Utf8Error as core::fmt::Debug>::fmt

.L__unnamed_5:
        .ascii  "/app/example.rs"

.L__unnamed_4:
        .quad   .L__unnamed_5
        .asciz  "\017\000\000\000\000\000\000\000\003\000\000\000\034\000\000"

.L__unnamed_1:
        .ascii  "hello"

thorny venture Oct 11, 2022, 6:17 PM

#

Rip 😦

ruby jacinth Oct 11, 2022, 6:27 PM

#

thorny venture ?godbolt ```rust pub fn foo() -> &'static str { static B: &[u8] = b"hello"; ...

you can do that if const is OK

#

?godbolt ```rust
pub fn foo() -> &'static str {
const B: &[u8] = b"hello";
const A: &str = match std::str::from_utf8(B) {
Ok(s) => s,
Err(_) => panic!(),
};
A
}

small apexBOT Oct 11, 2022, 6:27 PM

#

example::foo:
        lea     rax, [rip + .L__unnamed_1]
        mov     edx, 5
        ret

.L__unnamed_1:
        .ascii  "hello"

thorny venture Oct 11, 2022, 6:29 PM

#

Nah non const 😦

ruby jacinth Oct 11, 2022, 6:30 PM

#

why'd you not use const for this

thorny venture Oct 11, 2022, 6:32 PM

#

My use case is more more complex

#

It was a minimum example

#

You might as well ask why I didn't use a string instead of raw bytes

haughty mica Oct 11, 2022, 7:23 PM

#

Do you need all of UTF8

#

If it's an ASCII literal you may be able to wrap in an ASCII fast path and get the optimization

thorny venture Oct 11, 2022, 7:56 PM

#

I need a string output. I'm working on ascii bytes already

#

But I'm not reaching for unsafe because its work

tender nimbus Oct 11, 2022, 7:58 PM

#

ferrisRelieved

haughty mica Oct 11, 2022, 8:07 PM

#

Actually I wonder what happens if you crank the inline threshold

thorny venture Oct 11, 2022, 8:08 PM

#

Yeah I was thinking of writing a macro crate that does it inline

#

To force inline across crates

haughty mica Oct 11, 2022, 10:05 PM

#

haughty mica Actually I wonder what happens if you crank the inline threshold

You can get the innards to inline, but LLVM doesn't erase the code. Bummer.

thorny venture Oct 12, 2022, 5:07 AM

#

I wonder if a from_ascii method could do better

thorny venture Oct 12, 2022, 6:16 AM

#

?godbolt

pub fn foo() -> &'static str {
    static B: &[u8] = b"hello";
    from_ascii(B).unwrap()
}

fn from_ascii(bytes: &[u8]) -> Result<&str, ()> {
    if bytes.iter().copied().all(|b| b.is_ascii()) {
        unsafe { Ok(std::str::from_utf8_unchecked(bytes)) }
    } else {
        Err(())
    }
}

small apexBOT Oct 12, 2022, 6:16 AM

#

example::foo:
        lea     rax, [rip + .L__unnamed_1]
        mov     edx, 5
        ret

.L__unnamed_1:
        .ascii  "hello"

thorny venture Oct 12, 2022, 6:17 AM

#

oh shit it does

tender nimbus Oct 12, 2022, 6:18 AM

#

ferrisFlushed

haughty mica Oct 12, 2022, 1:26 PM

#

As predicted

haughty mica Oct 16, 2022, 2:12 AM

#

pyo3 wat r u doin https://asan.saethlin.dev/ub?crate=pyo3&version=0.17.2

#

Oh it's a classico

#

https://github.com/PyO3/pyo3/pull/2687

GitHub

Avoid calling slice::from_raw_parts with a null pointer by saethlin...

slice::from_raw_parts requires that the pointer not be null, even if the length is zero. This is because the pointer is converted into a reference, and references must not be null. This is detected...

ruby jacinth Oct 16, 2022, 9:11 AM

#

ferrisPensive

haughty mica Oct 17, 2022, 4:42 AM

#

I just learned that cargo-careful has been eating my attempt to run on ASan

tender nimbus Oct 17, 2022, 6:35 AM

#

ferrisballSweat

haughty mica Oct 17, 2022, 7:09 PM

#

Not sure if this is a false positive https://asan.saethlin.dev/logs/criterion/0.4.0.html

tender nimbus Oct 17, 2022, 7:12 PM

#

ferrisConcern

haughty mica Oct 17, 2022, 7:38 PM

#

This is the read_volatile implementation of black_box

tough leaf Oct 17, 2022, 7:40 PM

#

pub fn black_box<T>(dummy: T) -> T {
    unsafe {
        let ret = std::ptr::read_volatile(&dummy);
        std::mem::forget(dummy);
        ret
    }
}

yeah tbh i don't see how that can segfault, if this is the code being ran

tender nimbus Oct 17, 2022, 7:43 PM

#

but say T = Box<_>, now you do have sb issues

#

this should be using ManuallyDrop

or the real black box once we finally manage to stabilize that gopherballSweat

#

#[stable(feature = "bench_black_box", since = "CURRENT_RUSTC_VERSION")]

oh nice

#

we did

#

lol

haughty mica Oct 17, 2022, 9:24 PM

#

There is no segfault

#

ASan just thinks this is like a wild pointer access

#

I should minimize this

haughty mica Oct 17, 2022, 10:41 PM

#

I can't even reproduce this. Fuck.

#

I'm sure it's some kind of ASan bug

#

Another crate with the same issue: https://asan.saethlin.dev/logs/benchmarking/0.4.11.html

neon tiger Oct 17, 2022, 11:04 PM

#

haughty mica I'm trying to campaign for fixing this by removing the LLVM attributes from Box,...

FWIW I'm with you on this

haughty mica Oct 17, 2022, 11:35 PM

#

know this is some spicy low-level code but a stack buffer overflow seems a bit much https://asan.saethlin.dev/logs/trapframe/0.9.0.html

wraith wind Oct 21, 2022, 6:47 PM

#

https://github.com/vorner/arc-swap/pull/80#issuecomment-1287312728

bumping the MSRV will only cause them staying at the previous version of the code which has the same problem. So bumping it solves nothing.

tender nimbus Oct 21, 2022, 6:48 PM

#

but still unsound on 1.31
unsound with a stacked borrows violation?

#

or unsound how?

#

if you know that you're only compiling against old versions, soundness suddenly matters a lot less

wraith wind Oct 21, 2022, 6:49 PM

#

Writing through a pointer derived from &*ref with no interior mutability

tender nimbus Oct 21, 2022, 6:49 PM

#

hmm

#

i mean

#

it will probably be fine

#

maybe a few noalias/readonly shenanigans but no one found a "miscompilation" so far so it's gonna be fine

wraith wind Oct 21, 2022, 6:51 PM

#

I know that the read is just a bitwise copy and into_raw currently calls the forget. In the current version, these are enough. Nevertheless, this is with the knowledge of internal implementation
given that into_raw is documented as 'Consumes the Rc, returning the wrapped pointer.', can my implementation here be relied on?

#

fn as_ptr(me: &Rc<T>) -> *mut T {
    let ptr = Rc::into_raw(unsafe { std::ptr::read(me) } );
    ptr as *mut T
}
``` for context

wraith wind Oct 21, 2022, 6:53 PM

#

wraith wind <https://github.com/vorner/arc-swap/pull/80#issuecomment-1287312728> > bumping ...

I'm trying to persuade them to bump so I can use the definitely-documented-as-working-this-way method, and this seems like a weird argument for not doing that?

#

Since either way people on 1.31 are going to be using technically unsafe code which can't be made safe

tender nimbus Oct 21, 2022, 7:07 PM

#

just keep on unsound on the old version

wraith wind Oct 21, 2022, 7:09 PM

#

Well I think the options are to

bump MSRV, allowing use of the documented and safe way of doing it but blocking 1.31
use this method, which is definitely safe now, possibly safe in future, and definitely UB on 1.31

#

Not sure which to argue for

tender nimbus Oct 21, 2022, 7:11 PM

#

the one with less arguing

wraith wind Oct 21, 2022, 7:13 PM

#

I can't really tell what the author wants here

haughty mica Oct 21, 2022, 7:16 PM

#

This is why Zurr suggested doing version detection

#

If someone really wants to stay on an old an possibly-broken compiler, fine. But you can add version detection to ship a fix to users of a newer toolchain

tender nimbus Oct 21, 2022, 7:22 PM

#

yeah that's the best way probably (and i say this as someone who hates version detection)

wraith wind Oct 21, 2022, 7:26 PM

#

Alright, I'll go with that ferristhumbsup

haughty mica Oct 21, 2022, 7:33 PM

#

But also, there's no sense in annoying a maintainer

wraith wind Oct 21, 2022, 7:35 PM

#

They've been fairly positive in other discussions, just difficult to communicate over Github i guess

west violet Oct 22, 2022, 12:12 AM

#

Kinda wish we had native version detection instead of the somewhat ad-how stuff we’ve got

haughty mica Oct 22, 2022, 12:35 AM

#

Someone should RustSec this: https://asan.saethlin.dev/logs/grin_secp256k1zkp/0.7.11.html
(stack use after scope and heap buffer overflow)

#

The stack use after scope is typical cryptography code UB

#

The heap buffer overflow I do not recognize

tender nimbus Oct 22, 2022, 7:32 AM

#

typical cryptography ub

tender nimbus Oct 22, 2022, 7:33 AM

#

haughty mica Someone should RustSec this: https://asan.saethlin.dev/logs/grin_secp256k1zkp/0....

@strange egret do you want to rustsec it

strange egret Oct 22, 2022, 10:56 AM

#

tender nimbus <@107209884545609728> do you want to rustsec it

I'm happy to review and merge it if someone makes the PR. But the first step is always to notify the maintainer.

tame jewel Oct 22, 2022, 11:00 AM

#

haughty mica Someone should RustSec this: https://asan.saethlin.dev/logs/grin_secp256k1zkp/0....

It also has a ton of the typical: let mut ret: [$ty; $len] = mem::MaybeUninit::uninit().assume_init() ($ty is u8).
Sometimes the deprecation of mem::uninitialized seems not to have provided sufficient nudging to doing the right thing

tender nimbus Oct 22, 2022, 11:00 AM

#

ferrisClueless

#

wtf, this crate is an outdated fork of something from rust-bitcoin wtf

#

and the bitcoin one passes asan

#

rare coiner W

#

(well, this fork is also made by blockchain people for a cryptography which seems to be mainly focused on privacy?)

tough leaf Oct 22, 2022, 11:06 AM

#

wait is this crate literally just a fork that didn't change anything

tender nimbus Oct 22, 2022, 11:07 AM

#

This branch is 376 commits behind rust-bitcoin:master.

#

I have no idea wtf they're doing

strange egret Oct 22, 2022, 11:21 AM

#

Okay yeah then someone should probably rustsec it ferrisSweat

#

I'd merge the advisory

tender nimbus Oct 22, 2022, 11:23 AM

#

common coiner L

tough leaf Oct 22, 2022, 11:32 AM

#

lmao

glass chasm Oct 23, 2022, 2:28 AM

#

So much UB

#

Seriously, why are they doing this

haughty mica Oct 23, 2022, 2:42 AM

#

tender nimbus common coiner L

^

thorny venture Oct 23, 2022, 7:09 AM

#

So it seems like quite a few things might break soon ferrisBehehe

#

?eval 1

small apexBOT Oct 23, 2022, 7:10 AM

#

warning: the following packages contain code that will be rejected by a future version of Rust: traitobject v0.1.0
note: to see what the problems were, use the option `--future-incompat-report`, or run `cargo report future-incompatibilities --id 3`
     Running `target/debug/playground`

1

thorny venture Oct 23, 2022, 7:10 AM

#

traitobject is a dependency of unsafe-any which is a dependency of typemap. All lovely UB crates, but still have high download numbers

tender nimbus Oct 23, 2022, 7:47 AM

#

waiit which fcw is tha

#

It transmutes fat pointers

#

https://github.com/rust-lang/rust/pull/102635 uhm

#

lmao it'a an FCP now because the way the lint is implemented will break ferrisBut

haughty mica Oct 24, 2022, 4:44 AM

#

There is some really spicy stuff far down on the list on https://asan.saethlin.dev/ub

#

Such as an ODR violation: https://asan.saethlin.dev/ub?crate=whosly&version=0.1.5

#

I feel like that one is on me but I don't know how

#

Not sure if this is a UB in the Rust library or the C library: https://asan.saethlin.dev/ub?crate=clickhouse-driver-lz4&version=0.1.0

tender nimbus Oct 24, 2022, 4:53 AM

#

haughty mica Not sure if this is a UB in the Rust library or the C library: https://asan.saet...

average memcpy use

west violet Oct 26, 2022, 3:14 AM

#

@haughty mica https://github.com/sslab-gatech/Rudra/blob/master/rudra-sosp21.pdf

GitHub

Rudra/rudra-sosp21.pdf at master · sslab-gatech/Rudra

Rust Memory Safety & Undefined Behavior Detection. Contribute to sslab-gatech/Rudra development by creating an account on GitHub.

#

I know you’ve probably seen it but it’d be interesting to see if parts of it can be integrated into miri

haughty mica Oct 26, 2022, 3:17 AM

#

I think Rudra is just a linter

#

Also only 264 bugs kekw

west violet Oct 26, 2022, 3:17 AM

#

It’s a dataflow analysis thingy

haughty mica Oct 26, 2022, 3:17 AM

#

Ah ah yeah

west violet Oct 26, 2022, 3:18 AM

#

So it’s not really going for "all the ub", specifically some incorrect unsafe code

haughty mica Oct 26, 2022, 3:18 AM

#

I'm not sure why it would be integrated into Miri

west violet Oct 26, 2022, 3:19 AM

#

Well probably not integrated per say, more like miri could implement that algo

#

Or something along those lines

haughty mica Oct 26, 2022, 3:35 AM

#

Miri is an interpreter

#

Rudra is static analysis

haughty mica Oct 27, 2022, 9:00 PM

#

What is happening here?? https://asan.saethlin.dev/ub?crate=gltfgen&version=0.6.1

#

Custom test framework maybe?

ebon orbit Nov 3, 2022, 2:50 PM

#

owo i forgor about this

haughty mica Nov 3, 2022, 8:56 PM

#

uwu

regal galleon Nov 8, 2022, 4:13 PM

#

if someone is using unsafe { mem::unitialized() } for a [u64; N] i can just replace that with [0; N] right?

wide acorn Nov 8, 2022, 4:14 PM

#

Might want to profile it

#

And if it's noticeably slower, you can try to use MaybeUninit

regal galleon Nov 8, 2022, 4:16 PM

#

ty

regal galleon Nov 8, 2022, 4:38 PM

#

wait nvm i wasn't understanding the code, i'll just use a maybeuninit array

thorny venture Nov 8, 2022, 5:30 PM

#

regal galleon if someone is using `unsafe { mem::unitialized() }` for a `[u64; N]` i can just ...

That is correct yes, we 'fixed' uninitialized because it was guaranteed UB, and the fix was always producing an init value (notably, not always a valid value).

However, yeah, use MaybeUninit instead

tender nimbus Nov 8, 2022, 6:07 PM

#

sometimes a zeroed array will be just as fast

#

or something like array::from_fn could be used

ruby jacinth Nov 8, 2022, 6:48 PM

#

thorny venture That is correct yes, we 'fixed' uninitialized because it was guaranteed UB, and ...

That's an implementation detail ferrischu

thorny venture Nov 8, 2022, 6:49 PM

#

Yeah but performance wise they would be correct

#

Performance is never a documented behaviour ferrisBut

tough leaf Nov 8, 2022, 6:52 PM

#

wide acorn And if it's noticeably slower, you can try to use `MaybeUninit`

mem::uninitialized will be exactly as fast as [0; N] nowadays

#

because it is
well, 1-filled

#

but still
it does init the array :)

#

but yeah, [0; N] is fine

wraith wind Nov 8, 2022, 7:15 PM

#

thorny venture Performance is never a documented behaviour <:ferrisBut:536631961523978240>

Add sleep(100000) to all deprecated methods as a non breaking change to really get people to stop using it

#

ferrisGlasses

haughty mica Nov 8, 2022, 7:20 PM

#

wide acorn Might want to profile it

This is no longer a relevant concern, because mem::uninitialized is not actually uninitialized.

#

More fool me for not scrolling

wide acorn Nov 8, 2022, 7:22 PM

#

Well, presumably you'd have to profile it on an older version of Rust

#

or perhaps profile it against MaybeUninit::uninit().assume_init()

tender nimbus Nov 8, 2022, 7:22 PM

#

haughty mica This is no longer a relevant concern, because `mem::uninitialized` is not actual...

yeah but you'd probably want to restore the original performance if it exists
but yes you need to profile against MU

haughty mica Nov 8, 2022, 7:22 PM

#

Eh

#

My point is just that a PR which removes the unsafe entirely is not a perf regression

#

So it would be illogical for a maintainer to object saying that the PR regresses perf

wide acorn Nov 8, 2022, 7:30 PM

#

Well, it would be the Rust version update that regressed perf

#

So the comparison would really have to be against the prior status quo

west violet Nov 8, 2022, 9:23 PM

#

wraith wind Add sleep(100000) to all deprecated methods as a non breaking change to really g...

This is unironically the real solution to this problem

wide acorn Nov 8, 2022, 10:23 PM

#

Just increase the delay exponentially each Rust version

#

The main problem is, that would completely break our backcompat promises if we did it for every deprecated function

#

e.g., I worked with an old crate a while back that ran just fine but spat out a million warnings for r#try!()

tough leaf Nov 8, 2022, 10:29 PM

#

wide acorn The main problem is, that would completely break our backcompat promises if we d...

mem::uninit (and to a lesser extent mem::zeroed) are special because they're UB in the way basically everyone uses them

#

so treating them specially here makes sense

haughty mica Nov 8, 2022, 10:50 PM

#

wide acorn Well, it would be the Rust version update that regressed perf

You can't control the Rust version that your users are using

wide acorn Nov 8, 2022, 11:03 PM

#

haughty mica You can't control the Rust version that your users are using

Sure, but I think it's somewhat deceptive to say "you did this unsafe operation that had good perf and you thought was correct, now it has bad perf and the safe operation is equivalently slow, therefore the safe operation is just as good"

#

That is, when they wrote mem::uninitialized(), they intended to get an uninitialized array in O(1) time

haughty mica Nov 8, 2022, 11:05 PM

#

I submit to you that if they haven't changed the code away from mem::uninitialized in the meantime, they probably don't care enough about performance to merit the unsafe

wide acorn Nov 8, 2022, 11:05 PM

#

So just because we changed it on our end because it didn't fit with our model, doesn't mean that now the safe O(n) operation is as good as the original O(1) operation

haughty mica Nov 8, 2022, 11:06 PM

#

I agree that there is some violation of programmer intent, but there is more to the situation

wide acorn Nov 8, 2022, 11:06 PM

#

haughty mica I submit to you that if they haven't changed the code away from `mem::uninitiali...

People write codebases without keeping them regularly up to date with the latest warnings, that's just the way of the world

haughty mica Nov 8, 2022, 11:07 PM

#

Oh lmao

#

0x1-filling is a 1.64 feature

#

Ignore me

#

My messages will apply in a few months

#

Once it is stable-4 everyone who cares deeply about perf should have noticed

regal galleon Nov 9, 2022, 4:58 AM

#

in the threads too????

tender nimbus Nov 9, 2022, 5:45 AM

#

tough leaf mem::uninit (and to a lesser extent mem::zeroed) are special because they're UB ...

except nowadays old crates using it are pretty fine and it doesn't matter that much

tough leaf Nov 9, 2022, 10:06 AM

#

with the filling? yeah

without? i'd be worried about noundef once we emit that

tender nimbus Nov 9, 2022, 10:16 AM

#

tough leaf with the filling? yeah without? i'd be worried about `noundef` once we emit tha...

yes but we have filling do the latter question doesn't matter ferrisOwO

wide acorn Nov 9, 2022, 12:45 PM

#

https://github.com/rust-lang/rust/pull/100423

#

^ this would make the profiling much more annoying if it goes through

#

And crate authors that don't care about UB will probably just replace MaybeUninit::uninit().assume_init() with {let x=MaybeUninit::uninit();x.assume_init()}

tough leaf Nov 9, 2022, 12:51 PM

#

wide acorn ^ this would make the profiling much more annoying if it goes through

profiling?

#

it won't change correct code, only insert panics into UB functions

tender nimbus Nov 9, 2022, 12:56 PM

#

wide acorn ^ this would make the profiling much more annoying if it goes through

hm?

#

how would this impact profiling????

#

the extra function call will be yeeted away by the optimizer

wide acorn Nov 9, 2022, 12:59 PM

#

tender nimbus how would this impact profiling????

If you want to quickly evaluate how long zero-filling takes without loads of assume_init() churn

tender nimbus Nov 9, 2022, 12:59 PM

#

so you want to quickly write some UB to find out whether using MU properly would make it faster?

wide acorn Nov 9, 2022, 12:59 PM

#

wide acorn or perhaps profile it against `MaybeUninit::uninit().assume_init()`

like my earlier suggestion

wide acorn Nov 9, 2022, 12:59 PM

#

tender nimbus so you want to quickly write some UB to find out whether using MU properly would...

exactly

#

It's lying to LLVM, but as long as you don't leave it in prod (and don't try to read the memory before initializing it) it's not gonna be the end of the world

tender nimbus Nov 9, 2022, 1:01 PM

#

You can complain on the pr

#

I don't really care about this PR

wide acorn Nov 9, 2022, 1:03 PM

#

eh, it's not the strongest argument against it

#

"it won't change correct code, only insert panics into UB functions" is the obvious rebuttal

ruby jacinth Nov 9, 2022, 3:57 PM

#

wide acorn And crate authors that don't care about UB will probably just replace `MaybeUnin...

Who cares about people that just pull out another footgun when you take theirs away lol

#

At some point people are beyond help

wide acorn Nov 9, 2022, 4:01 PM

#

ruby jacinth Who cares about people that just pull out another footgun when you take theirs a...

Sure, and I think MaybeUninit::uninit().assume_init() is an obvious-enough footgun by itself that we shouldn't go beyond linting it

tough leaf Nov 9, 2022, 4:04 PM

#

wide acorn Sure, and I think `MaybeUninit::uninit().assume_init()` is an obvious-enough foo...

counterpoint: it's the correct way to make a [MaybeUninit<String>; 4]

wide acorn Nov 9, 2022, 4:04 PM

#

Yes, and in that case you know what you're doing

#

In every other case you're knowingly (afaik) using a footgun

tender nimbus Nov 9, 2022, 4:23 PM

#

maybe an extra bonk will get them to use it properly

#

but also I'm not sure whether it will

tame jewel Nov 9, 2022, 5:05 PM

#

wide acorn https://github.com/rust-lang/rust/pull/100423

Imo, we should change the deprecation status of mem::uninitialized() if this rewrite is the only consequence users take after the deprecation.
Panicking here feels very much like playing whack-a-mole with bad code, the proper fix would be ensuring that the correct use is easier to write than the incorrect use.

tame jewel Nov 9, 2022, 5:06 PM

#

tough leaf counterpoint: it's the correct way to make a `[MaybeUninit<String>; 4]`

The correct way of that was mem::uninitialized(). That remains correct, only deprecated without technical need for it.

#

Not necessarily undeprecated the method, MaybeUninit is potentially more useful, but there clearly is use for more actionable advice on how to change common code pattern that use it into correct use of MaybeUninit. Potentially something that involves some form of (miri-based) checks if new use is an actual improvement.

tender nimbus Nov 9, 2022, 5:19 PM

#

tame jewel Imo, we should change the deprecation status of `mem::uninitialized()` if this r...

the correct code is very easy to write don't worry

tender nimbus Nov 9, 2022, 5:20 PM

#

tame jewel The correct way of that was `mem::uninitialized()`. That remains correct, only d...

no it does not remain correct, it's not the correct way to do that anymore at all
in fact it won't even leave it uninit as mem::uninitialized just fills all bytes with 1 nowadays

tough leaf Nov 9, 2022, 5:20 PM

#

tender nimbus no it does not remain correct, it's not the correct way to do that anymore at al...

mem::uninit is a good way of making a [MaybeUninit<String>; 4]

#

edited

#

or, would be

#

honestly i think some form of safe transmute would be useful here

tame jewel Nov 9, 2022, 5:21 PM

#

tender nimbus no it does not remain correct, it's not the correct way to do that anymore at al...

filling with 1's is still correct.

#

It's just more obviously wrong for types that are not uninit.

tender nimbus Nov 9, 2022, 5:22 PM

#

tame jewel filling with 1's is still correct.

but you want uninit memory for a reason ferrisBut

#

if you don't want uninit memory just initialize it lol

#

using mem::uninitialized is ALWAYS wrong

tough leaf Nov 9, 2022, 5:23 PM

#

tender nimbus using mem::uninitialized is ALWAYS wrong

no?

tame jewel Nov 9, 2022, 5:23 PM

#

It's not always incorrect though.

tough leaf Nov 9, 2022, 5:23 PM

#

well, pre-mitigation

tender nimbus Nov 9, 2022, 5:23 PM

#

tough leaf well, pre-mitigation

I am talking about post mitigation

tame jewel Nov 9, 2022, 5:23 PM

#

All the lints and all the deprecation make it appear like it is always unsound, and that MaybeUninit::assume_init is less unsound.
Neither of these is the case.

tender nimbus Nov 9, 2022, 5:23 PM

#

tame jewel It's not always incorrect though.

It never does what you want which I would classify as incorrect

tough leaf Nov 9, 2022, 5:24 PM

#

yes because we changed it

tender nimbus Nov 9, 2022, 5:24 PM

#

tame jewel All the lints and all the deprecation make it appear like it is always unsound, ...

We have a lint for obviously invalid MaybeUninit::assume_init usage

tame jewel Nov 9, 2022, 5:24 PM

#

What I want is not to have to choose any specific value.

#

Which is exactly what it still does, the specified effects of both ways are exactly the same.

tender nimbus Nov 9, 2022, 5:24 PM

#

I don't quite follow this discussion actually

#

Never use mem::uninitialized, always use MaybeUninit in the correct way which is not hard

tame jewel Nov 9, 2022, 5:25 PM

#

tender nimbus Never use mem::uninitialized, always use MaybeUninit in the correct way which is...

Quite apparently it is hard. There's lots of MaybeUninit::uninit().assume_init()

tender nimbus Nov 9, 2022, 5:26 PM

#

tame jewel Quite apparently it _is_ hard. There's lots of `MaybeUninit::uninit().assume_ini...

No it's not
Those people are just too lazy to migrate
We literally cannot make it easier

#

(apart from making the creation of uninit arrays nicer which is being worked on)

tame jewel Nov 9, 2022, 5:30 PM

#

tender nimbus No it's not Those people are just too lazy to migrate We literally cannot make i...

That's my hypothesis, it didn't make the tasks that people want to achieve signficantly easier nor harder.
So they aren't switching because they see no point, or are switching to the exactly still unsound pattern which motivated 100423
Somewhere in documentation, lint, and information available the guide to a reasonable use of MaybeUninit is lost on developers.

tender nimbus Nov 9, 2022, 5:30 PM

#

mem::uninitialized is a little easier to use than MU
But we cannot make MU quite as easy that's the whole point
It's not bad

tame jewel Nov 9, 2022, 5:31 PM

#

On a separate note, I contend that saying 'mem::uninitialized is always incorrect' doesn't make it any easier.
Because it fails to explain the concept of MU, the correct explanation is inconsistent with the messaging.

#

The correct semantics of MU are recognizing that that the defined semantics of both methods are the same.
Initializing with 0x1 is just the compiler's freedom to choose / do anything undefined semantics.

#

And it's not complete enough because a fairly relevant point is that different reads may ''return'' different values.

#

The choice of filling with 0x1 could easily be confused with the statement that uninitialized is freeze.
Which it isn't.

#

Evidence: https://dtolnay.github.io/noisy-clippy/uninit_assumed_init.html#local
There's a grep for ignoring uninit_assumed_init. You're welcome to take a look and judge if they are sound.

There's gems such as these (dup-crypto):

        let mut tmp: [u8; 4] = MaybeUninit::uninit().assume_init();
        ptr::copy_nonoverlapping(input.get_unchecked(0), tmp.as_mut_ptr(), 4);
        u32::from_le_bytes(tmp)

That seems to have an obvious underlying usability issue.

tender nimbus Nov 9, 2022, 5:55 PM

#

tame jewel Evidence: https://dtolnay.github.io/noisy-clippy/uninit_assumed_init.html#local ...

all of them are unsound

#

if they were sound the lint wouldn't fire

#

to fix these you'd do:
Change the type from an array of things to an array of MU<thing>
Slightly change the code accessing it

#

we literally cannot make it easier apart from the initialization of uninit arrays which isn't too bad what do you want

tame jewel Nov 9, 2022, 6:36 PM

#

I want for people to write sound code. The full context is this:

fn read_u32_le(input: &[u8]) -> u32 {
    assert_eq!(input.len(), 4);
    unsafe {
        let mut tmp: [u8; 4] = MaybeUninit::uninit().assume_init();
        ptr::copy_nonoverlapping(input.get_unchecked(0), tmp.as_mut_ptr(), 4);
        u32::from_le_bytes(tmp)
    }
}

You wouldn't even have to cast anything, but for some reason the try_into().unwrap() was less preferred to the author than a literal unsafe block.
I'm not a behavioral scientist but some line of reasoning caused this and I'd really like to know what.
There's some underlying misinformation; and maybe the lint could be more specific in resolving this misinformation and demonstrating alternatives; and maybe that would increase adoption of appropriate uses; especially when the author is in the process of transitioning away from mem::uninitialized().

wide acorn Nov 9, 2022, 6:39 PM

#

The .try_into().unwrap() conversion isn't immediately obvious

#

You'd have to think to look all the way down in the TryFrom impls to find it

#

That's one of my main annoyances with rustdoc, sometimes critical functionality can get hidden among the endless boilerplate impls

tender nimbus Nov 9, 2022, 7:00 PM

#

i don't like From that much

#

because of it

tender nimbus Nov 9, 2022, 7:01 PM

#

tame jewel I want for people to write sound code. The full context is this: ```rust fn rea...

but btw, to make this sound using MU (you should of course be using try_into) ```diff

let mut tmp: [u8; 4] = MaybeUninit::uninit().assume_init();

let mut tmp: [MaybeUninit<u8>; 4] = [MaybeUninit::uninit(); 4];

```
    u32::from_le_bytes(tmp)
```

    u32::from_le_bytes(tmp.assume_init())

it's really easy, isn't it?

#

(also that get_unchecked hurts lol)

tame jewel Nov 9, 2022, 7:02 PM

#

tender nimbus (also that `get_unchecked` hurts lol)

yeah, it's wrong too due to provenance..

tender nimbus Nov 9, 2022, 7:03 PM

#

nah it's not ub just extremely unnecessary

tame jewel Nov 9, 2022, 7:04 PM

#

tender nimbus but btw, to make this sound using MU (you should of course be using `try_into`) ...

That does seem appropriate. Is there some analysis that would allow adding this as a lint, can it be generalizable somewhat readily?

tender nimbus Nov 9, 2022, 7:04 PM

#

wait no it is found when references shrink provenance

tender nimbus Nov 9, 2022, 7:05 PM

#

tame jewel That does seem appropriate. Is there some analysis that would allow adding this ...

i guess we could? but also it's often a little more than that (for example you'd need to change a field type as well)

tame jewel Nov 9, 2022, 7:06 PM

#

Maybe if the only use of the uninitialized assignment is a move, it would be possible to recognize when the assume_init() could be delayed until the move?

#

Scratch 'only use', that's nonsense. Let's say, the last or something

haughty mica Nov 9, 2022, 11:50 PM

#

The fact that the author of this is using get_unchecked(0) instead of as_ptr() makes me think it is not worth reading into the thought behind this code because it clearly wasn't reviewed carefully anyway

#

I see all of your objections to this work and idk. They don't bother me? We are ahead of the curve. We are trying to get people to patch their code before the release where a new compiler turns a previously-working program into a pile of vulnerabilities or a crashing mess. Of course people are going to be upset or skeptical. There are still a lot of people who are butthurt about the UB in their C which is actively exploited by existing compilers.

haughty mica Nov 9, 2022, 11:58 PM

#

tender nimbus wait no it is found when references shrink provenance

This specific case will probably be fine actually, I think.

#

It isn't fine for SB but it should be fine in the next model I think

tender nimbus Nov 10, 2022, 6:27 AM

#

yeah that's what I was trying to say with words that don't make sense

thorny venture Nov 11, 2022, 9:03 PM

#

@haughty mica why do I see you in a twitter space about crypto ferrisPensive

#

Fix more UB ferrisPlead

haughty mica Nov 11, 2022, 9:05 PM

#

Molly White is in here

#

Ergo, not that kind of space

#

You should join for the schadenfreude

thorny venture Nov 11, 2022, 9:05 PM

#

Ok you are forgiven. Web3 is going great ferrisUwU

haughty mica Nov 11, 2022, 9:15 PM

#

Hot damn the cope in here is unreal

thorny venture Nov 11, 2022, 9:21 PM

#

I do love a bit of copium

tender nimbus Nov 11, 2022, 9:31 PM

#

lmao

#

i love crypto (bros loosing their shit)

neon prism Nov 11, 2022, 9:33 PM

#

this reads like a song title

tender nimbus Nov 11, 2022, 9:39 PM

#

hi pen ping

neon prism Nov 11, 2022, 9:39 PM

#

at your service

thorny venture Nov 11, 2022, 9:39 PM

#

neon prism this reads like a song title

to the tune of jingle bells

Crypto bros
Crypto bros
Mining all the way
Oh what fun it is to ride
This line up all day, hey!

Crypto bros
Crypto bros
Exchange went bust today
Oh what fun it is to see
No more tokens today, hey!

thorny venture Nov 14, 2022, 2:38 PM

#

I have UB and idk why ferrisPensive

#

https://github.com/conradludgate/futures-buffered/actions/runs/3462352787/jobs/5781129414

Essentially I have ArcSlice which has a pointer to ArcSliceInner. This contains ArcSliceInnerMeta and [ArcSlotInner].

ArcSlot is supposed to have the same ownership over the entire arc as ArcSlice, but points to a single ArcSlotInner only.

I can track back the ArcSlotInner pointer to a ArcSliceInnerMeta pointer fine with no UB according to miri, but I can't turn it into a full on ArcSliceInner without triggering the stacked borrow error

dry frost Nov 14, 2022, 3:29 PM

#

thorny venture <https://github.com/conradludgate/futures-buffered/actions/runs/3462352787/jobs/...

https://github.com/conradludgate/futures-buffered/blob/aac79ec3a6d0f8f8943a87bcfb487ee171c3c441/src/arc_slice.rs#L243

- let ptr = Self::meta_raw(ptr);
+ let ptr = Self::meta_raw_ptr(ptr);

thorny venture Nov 14, 2022, 3:30 PM

#

Oh 👀

dry frost Nov 14, 2022, 3:31 PM

#

Having raw for a non-fully-raw function might have been the footgun that lead to this 😄

#

meta_raw
meta_rawer
meta_rawest_of_them_raw_things

#

Also, there could be a lint for things such as let len = *core::ptr::addr_of!(ptr.len); when ptr is a & reference

thorny venture Nov 14, 2022, 3:34 PM

#

Yep, that fixed it...

#

Very facepalm moment

thorny venture Nov 14, 2022, 3:35 PM

#

dry frost Also, there could be a lint for things such as `let len = *core::ptr::addr_of!(p...

I originally had it as (*ptr).len and then the lint complained so I thought "sure I guess"

ebon orbit Nov 14, 2022, 3:35 PM

#

hey, at least you fixed it

#

ferrisOwO

thorny venture Nov 14, 2022, 3:36 PM

#

ty yandros ferrisUwU

thorny venture Nov 14, 2022, 3:56 PM

#

Now I just have to find the source of the regression

#

oh wait shit - it's a benchmark issue 😛

tender nimbus Nov 14, 2022, 4:11 PM

#

the usual

thorny venture Nov 14, 2022, 4:13 PM

#

I swapped the order of the benchmarks and they flipped their speed

#

I have an interesting deadlock now though 😦

tender nimbus Nov 14, 2022, 4:20 PM

#

lol

thorny venture Nov 14, 2022, 4:25 PM

#

It's very sus too

#

It only happens on concurrency > 128

#

See how the last one is 128... ferrisSus

    #[tokio::test]
    async fn join_all_large() {
        let mut start = Instant::now();
        for i in 1..256 {
            crate::join_all((0..i).map(|i| async move {
                tokio::time::sleep(Duration::from_micros(1)).await;
                i
            }))
            .await;
            println!("join {i} in {:?}", start.elapsed());
            start = Instant::now();
        }
    }

dry frost Nov 14, 2022, 4:33 PM

#

Do you think it could be a tokio-specific issue?

thorny venture Nov 14, 2022, 4:33 PM

#

potentially. But it doesn't happen with the futures version of join_all/FuturesUnordered

dry frost Nov 14, 2022, 4:35 PM

#

if waker.is_the_one_from_futures() && concurrency <= 128 {
    work_properly()
}

thorny venture Nov 14, 2022, 4:36 PM

#

😮

#

it's a conspiracy

thorny venture Nov 14, 2022, 5:09 PM

#

Ok - this time it got further than 128 but I found the symptom that seems to cause the busy looping. It's happily polling along (the bounded.rs:183 is the poll call) and then it got unlucky and got stuck in a poll-wake cycle - now I'm not sure why it doesn't progress in this case

thorny venture Nov 15, 2022, 8:44 AM

#

dry frost Do you think it could be a `tokio`-specific issue?

I think it is actually

#

https://toot.conrad.cafe/@conrad/109346904207092238

haughty mica Nov 15, 2022, 3:23 PM

#

I really hate the use of the term "toot"

#

Makes me think this is somehow a fart joke

ebon orbit Nov 15, 2022, 3:24 PM

#

ferrisBut

thorny venture Nov 15, 2022, 3:46 PM

#

haughty mica I really hate the use of the term "toot"

I'm a frequent tooter

dry frost Nov 15, 2022, 4:36 PM

#

do toot ferrisWhen

ebon orbit Nov 23, 2022, 12:45 PM

#

who is the OP of this thread

#

oh it's nils

#

nvm

tender nimbus Nov 23, 2022, 12:52 PM

#

ferrisOwO

thorny venture Nov 23, 2022, 1:45 PM

#

Owo

#

Nils loves UB

ebon orbit Nov 23, 2022, 3:16 PM

#

ferrisballSweat

ruby jacinth Nov 23, 2022, 3:50 PM

#

ferrischu

tender nimbus Nov 23, 2022, 4:19 PM

#

ferrisBorrowCheck

lilac otter Nov 29, 2022, 2:49 PM

#

sorry for the maybe super silly question, but I am struggling to find the link to the website that reports on the different bugs that miri is detecting in recent crate compiles.. is it somewhere higher up in the chat history?

ruby jacinth Nov 29, 2022, 2:54 PM

#

lilac otter sorry for the maybe super silly question, but I am struggling to find the link t...

that'd be saethlin's thing right

#

ah https://miri.saethlin.dev/

lilac otter Nov 29, 2022, 2:55 PM

#

ah yess, exactly, that is it! thank youu ! 😄

stable arrow Nov 29, 2022, 4:08 PM

#

it's also in the pinned messages in this thread! but discord doesn't make it easy to remember pinned messages exist…

calm inlet Nov 29, 2022, 5:39 PM

#

There are pinned messages? Where?

tender nimbus Nov 29, 2022, 5:43 PM

#

calm inlet Nov 29, 2022, 5:44 PM

#

Oh! Good hiding place

#

I'll have forgotten again by this time tomorrow

lilac otter Nov 30, 2022, 3:16 PM

#

very good hiding place! i'll try to remember ! 😄

lilac otter Nov 30, 2022, 3:34 PM

#

hey guys,i had quick question! i made a pull request to fix some UB for the lru-rs crate, yet I ended up going maybe a bit overboard in the process haha! i am still quite new to this, so i was wondering of anyone had time to review and give me advice for improvement/point out where i went wrong with things! https://github.com/jeromefroe/lru-rs/pull/161

GitHub

Change LruCache.map to hold a pointer, rather than owned `LruEntry`...

This is meant as more of rough proposal, and I do not expect this to be merged in its current form. The aim of this pull request is to showcase a potential fix to an miri error that is inherent to ...

#

the issue was the invalidation of a tag through a Unique retag, if that helps?

tender nimbus Nov 30, 2022, 4:01 PM

#

This doesn't look like a hue drastic change to me

#

This is just a classic replacement of box with a pointer

#

btw you should use NonNull instead of *mut

lilac otter Nov 30, 2022, 5:58 PM

#

oh, i see! i was not to sure! does the reasoning for it make sense?

#

also, what is the usual premise for using NonNull rather than *mut in these cases?

haughty mica Nov 30, 2022, 6:00 PM

#

Box can't be null, it has a niche. So Option<Box> is the same size as Box, but Option<*mut T> is the size of two pointers, because *mut T can be null.

lilac otter Nov 30, 2022, 6:05 PM

#

ahh, that makes a lot of sense! in my mind i was confusing ‘NonNull’ with ‘MaybeUninit’ which made it even more more confusing haha! does the niche optimization also apply for ‘MaybeUninit’ or is this also allowed to be ‘null’ and this not have a niche?

tough leaf Nov 30, 2022, 6:07 PM

#

MaybeUninit has no niches

#

even if the inner type does

#

because that would be a validity invariant on a type that explicitly has no validity invariants

#

i.e. what would Option<MaybeUninit<NonNull<T>>> do if you have Some(MaybeUninit::zeroed())

lilac otter Nov 30, 2022, 6:09 PM

#

that is fair! is that part of MaybeUninit’s contract, that is has absolutely no validity invariants?

stable arrow Nov 30, 2022, 6:12 PM

#

it has to be, since the point is that it can contain entirely uninitialized memory …
huh https://doc.rust-lang.org/stable/std/mem/union.MaybeUninit.html doesn't really explain that.

wide acorn Dec 16, 2022, 8:10 PM

#

Argh, I hate protectors

#

https://github.com/Logicalshift/desync/issues/10

GitHub

Theoretical UB between `Scheduler::sync_background()` and `Job::run...

I've been testing this crate some more in Miri, and I've found an intermittent issue where it reports UB. /* Patch the desync repo: --- a/src/scheduler/desync_scheduler.rs +++ b/s...

#

You aren't allowed to tell another thread to invalidate one of your &mut function arguments, even if you never use that reference again

haughty mica Dec 16, 2022, 9:26 PM

#

Is it really theoretical UB

#

Is this code pattern giving LLVM license to do an oopsie because of dereferenceable

wide acorn Dec 16, 2022, 9:28 PM

#

Technically, but there's no real reason for LLVM to dereference it again, since the reference is lexically never accessed again

#

This code takes a value out of the reference, then the other thread observes that the value is taken

#

In fact, LLVM would have to be extremely smart to know that accessing the value again wouldn't cause a data race with the other thread

haughty mica Dec 16, 2022, 9:38 PM

#

That's not how optimizers work

#

They specifically do not have to reason about "will this cause a data race"

wide acorn Dec 16, 2022, 9:39 PM

#

They do if they want the assembly to make any sense, data races come from the processor after all

haughty mica Dec 16, 2022, 9:40 PM

#

No, they don't

#

Optimizations don't reason about other threads

#

This is the fundamental reason that data races are UB

#

Any function can be optimized to hell and back without wondering about "what if another thread observes this"

#

And optimizations at the level of LLVM IR don't reason about assembly or processors

wide acorn Dec 16, 2022, 9:42 PM

#

haughty mica Optimizations don't reason about other threads

They have to assume that possibly-aliased memory may be modified on a synchronization operation

#

(but I'll grant that it's a bigger problem for noalias &mut)

haughty mica Dec 16, 2022, 9:42 PM

#

No, they don't

#

You're confusing the rules about reordering with reasoning about multiple threads

wide acorn Dec 16, 2022, 9:46 PM

#

Are they not inextricably interlinked?

#

Reasoning about multiple threads is needed to justify the reordering rules, is it not?

#

I'm talking about this effect:

#

?godbolt

use std::cell::Cell;
use std::sync::atomic::{self, Ordering};

pub fn example1(cell: &Cell<i32>) -> i32 {
    cell.set(42);
    cell.get()
}

pub fn example2(cell: &Cell<i32>) -> i32 {
    cell.set(42);
    atomic::fence(Ordering::AcqRel);
    cell.get()
}

small apexBOT Dec 16, 2022, 9:47 PM

#

example::example1:
        mov     dword ptr [rdi], 42
        mov     eax, 42
        ret

example::example2:
        mov     dword ptr [rdi], 42
        mov     eax, dword ptr [rdi]
        ret

wide acorn Dec 16, 2022, 9:49 PM

#

In example2(), the optimizer is forced to dereference dword ptr [rdi] again, because the value could have changed

#

I don't really care about the exact rule

#

but the fact that a synchronization operation gives other threads free reign to do unsynchronized accesses

#

Unless the current thread does its own unsynchronized accesses following the operation, like in this example

#

but in something like

pub fn example3(cell: &Cell<i32>) {
    cell.set(42);
    atomic::fence(Ordering::AcqRel);
}

#

The value behind cell can be modified arbitrarily between the fence and the function returning

#

So the optimizer can't insert a read from cell during that time and expect to get a meaningful value

#

And to return to the &mut case, consider

pub fn example4(r: &mut i32) {
    GLOBAL_CHANNEL.send(r as *mut i32).unwrap();
    atomic::fence(Ordering::AcqRel);
}

#

Since r has escaped, its value can be modified between the fence and the function returning

#

So once again, the optimizer can't expect to get a meaningful value from reading it after the fence

#

The only thing it knows is that it's dereferenceable, since that's what we promise for &mut parameters

#

Call it reordering rules or whatever you want, it doesn't change the fact that there's not much that LLVM can usefully perform with an access after the fence

tender nimbus Jan 3, 2023, 1:35 PM

#

lmao wee_alloc triggered a debug assertion

#

(https://www.reddit.com/r/rust/comments/1027a6q/why_does_regexregexbuilder_not_work_with_wasm_and)

#

least bad allocator crate

wide acorn Jan 3, 2023, 3:36 PM

#

lmao, allocators

#

https://github.com/rust-lang/rust/issues/101899, https://github.com/rust-lang/rust/issues/101899#issuecomment-1256851139

#

alloc::alloc() is an automatic footgun for the average person

#

The average person doesn't actually read preconditions, they just imagine what the preconditions probably say

haughty mica Jan 3, 2023, 4:05 PM

#

You didn't report any of those?

wide acorn Jan 3, 2023, 4:49 PM

#

Haven't gotten around to it yet

#

Now that the new Layout restriction is stabilized, they can't deflect with "But my libc always refuses such allocations!"

#

(Although I still maintain that your libc is dumb if it doesn't refuse malloc()s larger than PTRDIFF_MAX)

#

Oh, and I should really post a PR fixing GlobalAlloc's preconditions, unless we want to invoke stdlib privilege to let it create overlarge Layouts

haughty mica Jan 3, 2023, 5:03 PM

#

Many of those are libraries

#

So it doesn't matter what libc the author uses, they need to work with any global allocator the user installs

wide acorn Jan 3, 2023, 5:07 PM

#

Yeah, I know, it's just that I was trying to judge just how much of an immovable rock we'd have to move to require GlobalAlloc::alloc() to always fail past isize::MAX

#

As it turns out, the name of the immovable rock is likely "the BSDs"

hybrid forge Jan 3, 2023, 7:05 PM

#

from cxx code-source: https://docs.rs/crate/cxx/1.0.85/source/src/rust_string.rs

// ABI compatible with C++ rust::String (not necessarily alloc::string::String).
#[repr(C)]
pub struct RustString {
    repr: [MaybeUninit<usize>; mem::size_of::<String>() / mem::size_of::<usize>()],
}

impl RustString {
    pub fn from(s: String) -> Self {
        unsafe { mem::transmute::<String, RustString>(s) }
    }
…}

(Mobile sorry can’t find the backsticks.)

I’m confused, doesn’t the code and comment contradict themselves?

wraith wind Jan 3, 2023, 7:06 PM

#

If you're using iOS, you can long-press single-quote to use backticks

hybrid forge Jan 3, 2023, 7:07 PM

#

Oh nice

thorny venture Jan 3, 2023, 7:08 PM

#

Uhh

tender nimbus Jan 3, 2023, 7:08 PM

#

hybrid forge from cxx code-source: https://docs.rs/crate/cxx/1.0.85/source/src/rust_string.rs...

ABI compatible
this means for example the way it's passed as function arguments

#

for example there's no guarantee that String isn't just passed in 3 registers on days where rustc feels like it
and passed in 2 + a usize on the stack on other days

#

but this struct will always be passed the same (at least on extern "C" fn)

wide acorn Jan 3, 2023, 7:11 PM

#

I hope they verify somewhere that size_of::<string::String>() is a multiple of size_of::<usize>()

hybrid forge Jan 3, 2023, 7:11 PM

#

tender nimbus for example there's no guarantee that `String` isn't just passed in 3 registers ...

But how does transmute work in both days then? I thought it was only reading memory with different type, which for me is the same as ABI

thorny venture Jan 3, 2023, 7:11 PM

#

wide acorn I hope they verify somewhere that `size_of::<string::String>()` is a multiple of...

I think it's guaranteed that String is 2 usize + ptr

#

At least, vec is?

tender nimbus Jan 3, 2023, 7:11 PM

#

thorny venture I think it's guaranteed that String is 2 usize + ptr

this cannot be an ABI/layout guarantee

thorny venture Jan 3, 2023, 7:11 PM

#

Not a layout guarantee

hybrid forge Jan 3, 2023, 7:12 PM

#

gtg but will read your explanations later

thorny venture Jan 3, 2023, 7:12 PM

#

But any such padding will align it so that all the usizes will be usize aligned

wide acorn Jan 3, 2023, 7:12 PM

#

True, but if it's guaranteed to store the length and capacity as a usize, then the alignment will do the rest of the work

thorny venture Jan 3, 2023, 7:12 PM

#

So it must be a multiple if usize len?

tender nimbus Jan 3, 2023, 7:12 PM

#

hybrid forge But how does transmute work in both days then? I thought it was only reading mem...

transmute is basically equal to writing the struct to memory and then reading the other struct from memory

#

here, only layout matters

#

which will be equal

wide acorn Jan 3, 2023, 7:13 PM

#

That's assuming that align_of::<usize>() == size_of::<usize>(), which isn't entirely guaranteed

thorny venture Jan 3, 2023, 7:15 PM

#

wide acorn That's assuming that `align_of::<usize>() == size_of::<usize>()`, which isn't en...

Alignment 1 usize when ferrisClueless

hybrid forge Jan 3, 2023, 10:41 PM

#

wide acorn I hope they verify somewhere that `size_of::<string::String>()` is a multiple of...

It's not really an issue since it would be caught during compilation if it failed? Meaning the String and Rustring types didn't have the same size

wide acorn Jan 4, 2023, 12:21 AM

#

hybrid forge It's not really an issue since it would be caught during compilation if it faile...

Right, duh

#

crisis averted

#

Forgot that transmute protects against that

wide acorn Feb 14, 2023, 2:47 AM

#

Yay, I found unwind safety bug #320498530495783489057: https://gitlab.com/tspiteri/rug/-/issues/47

GitLab

Rational::mutate_numer_denom() is not unwind-safe (#47) · Issues · ...

After executing the user-provided closure, Rational::mutate_numer_denom() calls xmpq::canonicalize(self) to canonicalize the resulting value. However, if the closure unwinds, then self is never canonicalized, and its non-canonical value can...

tender nimbus Feb 14, 2023, 6:52 PM

#

ferrisOwO

wide acorn Feb 14, 2023, 10:13 PM

#

We really do need an "unsafe Rust tips and tricks" document

#

So that people don't about unwinding existing, or forget to check for isize::MAX before making an allocation

#

(The latter is now handled by Layout, but a whole lot of crates call Layout::from_size_align_unchecked())

tender nimbus Feb 20, 2023, 6:23 PM

#

https://github.com/kornelski/rust-rgb/pull/56

GitHub

ARGB/ABGR pixels are 4 bytes each, not 2 by saethlin · Pull Request...

https://miri.saethlin.dev/logs/rgb/0.8.35.html

#

what

#

😵‍💫 rgb is such a cursed crate

haughty mica Feb 20, 2023, 6:26 PM

#

Why do you say that?

tender nimbus Feb 20, 2023, 6:28 PM

#

i think it has had soundness issues before hasnt it

haughty mica Feb 20, 2023, 7:13 PM

#

Hoyl

#

It's just... Sloppy and has poor tests

#

https://github.com/kornelski/rust-rgb/pull/51

GitHub

Go through `as_mut_ptr` instead of `as_ptr` to mutate by Nilstrieb ...

Going through as_ptr does not give write provenance
since it creates an intermediary shared reference.

#

https://github.com/kornelski/rust-rgb/pull/33

GitHub

Fix the number of elements in BGRA structure by rucoder · Pull Requ...

#

Anyway, it should pass Miri finally so obviously that makes it good

tender nimbus Feb 20, 2023, 8:48 PM

#

haughty mica https://github.com/kornelski/rust-rgb/pull/51

i knew there was something ferrisBut

tame jewel Feb 20, 2023, 9:09 PM

#

tender nimbus i think it has had soundness issues before hasnt it

some, you could say.

#

still better than using *const char

tender nimbus Feb 20, 2023, 9:11 PM

#

with that you at least know what you're getting

tender nimbus Feb 26, 2023, 1:15 PM

#

https://crates.io/crates/crop new rope crate with unsafe code just dropped
i have run miri on some of the tests and it looks fine
but it would be cool if someone added all the necessary cfgs to make all tests work and then PRed miri ci

#

it uses str_indices which has a SIMD feature that doesn't disable all SIMD so you need to use an obscure target like aarch64 where it doesnt support SIMD

#

or make a PR to str_indices that properly gates all SIMD behind the feature

grim copper Feb 26, 2023, 9:27 PM

#

argh, why do all the rope crates restrict to utf8

#

I want a binary rope!

haughty mica Feb 26, 2023, 9:54 PM

#

I feel like that makes things easier?

tough leaf Feb 26, 2023, 10:25 PM

#

i don't know how easy it'd be to build a UTF-8 enforcing rope over a Rope<u8>

#

but a Rope<T> is a type that should exist

grim copper Feb 26, 2023, 10:38 PM

#

tough leaf but a `Rope<T>` *is* a type that should exist

agreed

thorny venture Feb 27, 2023, 7:21 AM

#

stack: Vec<Vec<Arc<Node<FANOUT, L>>>>
Crikey

grim copper Feb 27, 2023, 7:24 AM

#

we've found the antiperf

tough leaf Mar 5, 2023, 7:18 PM

#

oh dear
captnproto has some UB in it :(

#

ptr::copy_nonoverlapping precondition

#

at the very least

#

ah
they fail miri

haughty mica Mar 5, 2023, 7:33 PM

#

O u c h

tender nimbus Mar 5, 2023, 7:34 PM

#

rip

#

the ship has sunk with the captn

tough leaf Mar 5, 2023, 7:35 PM

#

is there any non-self describing binary file format that's not cursed

#

flatbuffers has the rust code say "yeah you need FFI into the C++ code in order to call the buffer verifier"
captnproto has this (but granted it's probably not exploitable or anything, just funky)

#

rkyv was also weird

#

fuck it, using bincode ferrisBut

tender nimbus Mar 5, 2023, 7:40 PM

#

simply fix it

tough leaf Mar 5, 2023, 7:40 PM

#

i'll fix the test failures that assume that vecs are aligned

#

the underlying allocation

#

overaligned i mean

#

(they have a Vec<u8> which no reasonable allocator would misalign. however.)

haughty mica Mar 5, 2023, 7:43 PM

#

tough leaf is there *any* non-self describing binary file format that's not cursed

Well, IMO a binary format that isn't self-describing is already cursed

grim copper Mar 5, 2023, 9:25 PM

#

haughty mica Well, IMO a binary format that isn't self-describing is already cursed

Self describing formats should really be more cursed imo, but they don’t usually have expectations of high perf so there’s more checks in practice

#

But there’s always things like .net object deserialisation which is a CVE hellscape

tender nimbus Mar 5, 2023, 9:26 PM

#

🥒:D

haughty mica Mar 5, 2023, 9:38 PM

#

grim copper Self describing formats should really be more cursed imo, but they don’t usually...

No I mean the lack of a description itself is cursed

#

Because they inevitably end up on disk without the description and then it's like what are we doing here

grim copper Mar 5, 2023, 9:39 PM

#

Oh yeah… if you’re putting it on disk then that’s much more cursed

grim copper Mar 5, 2023, 9:40 PM

#

tender nimbus 🥒:D

True also

haughty mica Mar 5, 2023, 9:56 PM

#

Most things eventually escape to disk

#

"it's just a wire format"
So what do I do with this core dump from the deserializer

#

The deserializer cannot be trusted as a description in this case; if it were trustworthy it wouldn't have dumped core

#

In addition, if the format doesn't describe itself exactly, you run into compatibility problems that have led to the erosion of protobuf

tough leaf Mar 5, 2023, 10:07 PM

#

have each message be prefixed with a plaintext URL on where to find the schema

#

(and internally publish and archive all schemas used, even temporary ones for dev work)

wide acorn Mar 8, 2023, 6:01 PM

#

Yay, I found an unsoundness in hashbrown now: https://github.com/rust-lang/hashbrown/issues/412

GitHub

`RawIter::{reflect_insert, reflect_remove}()` can unsoundly compare...

If the &Bucket<T> comes from a different RawTable<T, A> than the RawIter<T> comes from, then these methods sometimes call offset_from() between the...

#

(That's what happens when you wrap your function in a huge unsafe {} block without thinking about it)

haughty mica Mar 8, 2023, 8:57 PM

#

Nice.

#

Well at least nobody is using this API right?

wide acorn Mar 8, 2023, 10:15 PM

#

I love contrived trait impls

#

https://github.com/RustCrypto/traits/issues/1275

GitHub

`StreamCipherCoreWrapper::get_pos()` with a block size of 0 can rea...

The safety comment in the function says "pos is set only to values smaller than block size", but if the block size of the StreamCipherCore is equal to 0 (in a contrived scenario),...

wide acorn Mar 8, 2023, 10:54 PM

#

And it looks like they duplicated their code: https://github.com/RustCrypto/utils/issues/843

GitHub

block-buffer: `BlockBuffer::get_pos()` with a block size of 0 can r...

This is effectively another instance of RustCrypto/traits#1275. // RUSTFLAGS=-Cdebug-assertions=off cargo +nightly miri run // block-buffer = "=0.10.3" use block_buffer::{generic_...

tender nimbus Mar 9, 2023, 5:48 AM

#

generic_array ferrisballSweat

haughty mica Mar 9, 2023, 2:54 PM

#

Hush, it's fine now

wide acorn Mar 9, 2023, 2:54 PM

#

It has arithmetic, that's more than you can say for the silly "const generics" ferrisOwO

#

"nooo, you'd need a full SAT solver, it would be inconsistent", so many excuses ferrisClueless

thorny venture Mar 11, 2023, 1:30 PM

#

Forked a repo and clippy straight away found UB ferrisBut

#

This project does some cool stuff but the code is ferrisballSweat

tender nimbus Mar 11, 2023, 1:32 PM

#

uhhhhhhhhhh

#

you know it's bad when clippy finds ub

#

show repo ferrisBorrowCheck

thorny venture Mar 11, 2023, 2:12 PM

#

tender nimbus show repo <:ferrisBorrowCheck:774062955468685352>

https://github.com/fschutt/azul

GitHub

GitHub - fschutt/azul: Desktop GUI Framework

Desktop GUI Framework. Contribute to fschutt/azul development by creating an account on GitHub.

tender nimbus Mar 11, 2023, 2:14 PM

#

and what the clippy output?

thorny venture Mar 11, 2023, 2:18 PM

#

a lot of stuff

#

The blatant UB it found was

let mut stack_mem = mem::MaybeUninit::<U>::uninit();
ptr::copy_nonoverlapping((ptr as *mut c_void) as *const U, stack_mem.as_mut_ptr(), size_of::<U>());
//                                                            copy is not in bytes ^^^^^^^^^^^^^^

#

I'm not trying to fix it all because it's too much for me to go through. I don't need all the functionality it offers and I certainly don't need c interop so I'm getting rid of all that shit

#

Meanwhile

warning: the following packages contain code that will be rejected by a future version of Rust: allsorts-rental v0.5.6, allsorts_no_std v0.5.2
note: to see what the problems were, use the option `--future-incompat-report`, or run `cargo report future-incompatibilities --id 1`

tender nimbus Mar 11, 2023, 2:31 PM

#

lol

#

fucking rental fcw

tender nimbus Mar 11, 2023, 2:32 PM

#

thorny venture The blatant UB it found was ```rust let mut stack_mem = mem::MaybeUninit::::u...

DUDE WHAT

#

5.4k starts

#

thankfully/sadly it doesn't appear to be maintained anymore

thorny venture Mar 11, 2023, 2:33 PM

#

tender nimbus thankfully/sadly it doesn't appear to be maintained anymore

yeah that's why I'm just stealing the parts I need

tough leaf Mar 11, 2023, 4:47 PM

#

thorny venture The blatant UB it found was ```rust let mut stack_mem = mem::MaybeUninit::::u...

hahaha
amazing
that's actually a good lint, i did that the other day

glass chasm Mar 14, 2023, 8:55 AM

#

thorny venture The blatant UB it found was ```rust let mut stack_mem = mem::MaybeUninit::::u...

What the fuck ferrisBut

#

Did he think ptr::copy_nonoverlapping was just a shim around memcpy?!

thorny venture Mar 14, 2023, 9:48 AM

#

glass chasm Did he think `ptr::copy_nonoverlapping` was just a shim around `memcpy`?!

Tbh I have made that mistake myself a few times

wheat lava Mar 14, 2023, 9:49 AM

#

thorny venture Tbh I have made that mistake myself a few times

can you explain it i dont understand

thorny venture Mar 14, 2023, 9:50 AM

#

wheat lava can you explain it i dont understand

copy_nonoverlapping has a length for how many values to copy. In libc, memcpy counts how many bytes to copy

wheat lava Mar 14, 2023, 9:50 AM

#

ok

thorny venture Mar 14, 2023, 9:51 AM

#

It's pretty easy to automatically fall back to size_of(T) * n

wheat lava Mar 14, 2023, 9:51 AM

#

yes

#

but what about nonoverlapping

tough leaf Mar 14, 2023, 11:24 AM

#

that just means the source and destination byte ranges can't overlap

haughty mica Mar 14, 2023, 2:40 PM

#

glass chasm Did he think `ptr::copy_nonoverlapping` was just a shim around `memcpy`?!

This is a very reasonable mistake. A lot of code is written based on vibes and "look the tests pass"

We have a clippy lint for this because it's such a common mistake.

proper belfry Mar 14, 2023, 4:53 PM

#

wheat lava but what about nonoverlapping

memcpy also assumes src and dst don’t overlap

#

rust just has a better name for it ferrisBut

#

although tbh ptr::copy should be called ptr::copy_overlapping

wheat lava Mar 14, 2023, 4:54 PM

#

yea

still hearth Mar 14, 2023, 4:58 PM

#

eh, seems similar to [T]::sort vs [T]::sort_unstable

#

in that the shorter name goes to the one with stronger guarantees (preserves order of equal items/works even if overlapping) but that's less optimized if those aren't needed

tough leaf Mar 14, 2023, 5:00 PM

#

yeah, using copy when you mean copy_nonoverlapping is always correct

#

same for sort instead of sort_unstable

proper belfry Mar 14, 2023, 5:14 PM

#

still hearth eh, seems similar to `[T]::sort` vs `[T]::sort_unstable`

I also think that sort should’ve been called sort_stable ferrisBut (and keep sort_unstable also)

#

Optimization missed opportunities should be made clear in code

#

Or if it requires it, the invariant should be made explicit

tough leaf Mar 14, 2023, 5:17 PM

#

we do have a lint for "you're using sort on something that doesn't need it"

#

but maybe it should also use specialisation

#

pub trait IndistinguishableEq

proper belfry Mar 14, 2023, 5:19 PM

#

I’d prefer ```rs
trait PartialEq {
fn is_indistinguishable(&self) -> bool { false }
}

#

or maybe we could have fn sort have a bound for IndistinguishableEq ferrisThonk

tough leaf Mar 14, 2023, 5:22 PM

#

no because you might not care

#

where things end up that compare equal

#

even if you can distinguish

grim copper Mar 15, 2023, 5:35 AM

#

glass chasm Did he think `ptr::copy_nonoverlapping` was just a shim around `memcpy`?!

wait isn't it?

#

in what case isn't it a memcpy / memcpy analog?

glass chasm Mar 15, 2023, 5:36 AM

#

It's semantically equivalent, but count is the amount of instances you want to copy, not the byte size of the type

#

This guy was passing size_of::()

grim copper Mar 15, 2023, 5:37 AM

#

oh right yeah

#

I missed that detail

#

not to mention the whole mem::MaybeUninit::::uninit() thing

glass chasm Mar 15, 2023, 5:39 AM

#

grim copper not to mention the whole `mem::MaybeUninit::::uninit()` thing

This is safe

#

He's using a ptr to the MaybeUninit as the write-out destination

grim copper Mar 15, 2023, 5:39 AM

#

wouldn't that be UB for the same reason that std::mem::uninit is UB?

glass chasm Mar 15, 2023, 5:39 AM

#

No

#

MaybeUninit is a union

grim copper Mar 15, 2023, 5:39 AM

#

oh rip, I thought it was assume_init not ::uninit

glass chasm Mar 15, 2023, 5:39 AM

#

mem::uninitialized returns an uninitialized T. MaybeUninit is a union between T and ()

grim copper Mar 15, 2023, 5:40 AM

#

I obviously haven't gotten enough sleep

proper belfry Mar 15, 2023, 8:44 AM

#

tough leaf no because you might not care

Then sort_unstable makes that explicit in code

#

The ideä is just that .sort() with no qualification should only occur when it literally cannot matter

wide acorn Mar 23, 2023, 12:26 AM

#

Yay, I love unsafe code that takes &mut self and panics partway through modifying it

#

https://gitlab.com/tspiteri/rug/-/issues/49

GitLab

Rational::mutate_numer_denom() can expose a denominator of zero (#4...

xmpq::canonicalize() panics when the denominator is zero, but catching this panic will leave the previous non-canonical value in place.

tender nimbus Mar 23, 2023, 5:23 AM

#

ferrisballSweat

wheat lava Mar 23, 2023, 6:21 AM

#

wide acorn Yay, I love unsafe code that takes `&mut self` and panics partway through modify...

me too

#

❤️

thorny venture Mar 23, 2023, 7:12 AM

#

Ok rug is cursed ferrisballSweat

#

Why bother with gmp ferrisballSweat

hexed basin Mar 23, 2023, 8:21 AM

#

malachite or num-bigint my beloved

wide acorn Mar 23, 2023, 1:51 PM

#

thorny venture Why bother with gmp <:ferrisballSweat:678714352450142239>

I think once I rewrote my application in rug and it was faster

proper belfry Mar 25, 2023, 8:53 PM

#

hexed basin malachite or num-bigint my beloved

I like how this is an or, so you actually only love one of them, but you’re not telling us which one

hexed basin Mar 25, 2023, 8:55 PM

#

proper belfry I like how this is an or, so you actually only love one of them, but you’re not ...

Well malachite is GPL, so that tends to stop most people 👀

proper belfry Mar 25, 2023, 8:56 PM

#

GPL :(

tender nimbus Mar 25, 2023, 8:56 PM

#

gnu poland

proper belfry Mar 25, 2023, 8:56 PM

#

I want a Rust-specific license like MPL but per-crate

tender nimbus Mar 25, 2023, 8:56 PM

#

for our polish gnu lovers

proper belfry Mar 25, 2023, 8:56 PM

#

so non-viral but still copyleft enough for libraries

still hearth Mar 25, 2023, 9:01 PM

#

hexed basin Well malachite is GPL, so that tends to stop most people 👀

LGPL, not GPL. still an obstacle for some people but less so than GPL

hexed basin Mar 25, 2023, 9:02 PM

#

ah right

still hearth Mar 25, 2023, 9:03 PM

#

iirc it's LGPL because many of the algorithms are sufficiently close translations of GMP algorithms that it could reasonably be considered at least partially a derivative work, and GMP is itself LGPL-licensed

hexed basin Mar 25, 2023, 9:03 PM

#

interesting

haughty mica Mar 25, 2023, 9:17 PM

#

I have one GPL project for that reason

#

🙃

#

Turns out I ported the GPL algorithm wrong though so I might just try to reinvent it from scratch and switch to MIT/Apache-2.0

hexed basin Mar 25, 2023, 9:18 PM

#

based

tender nimbus Mar 30, 2023, 7:43 PM

#

https://github.com/jonhoo/inferno/blob/bf0d00159be8ca33454c71009ca3d583197c211a/src/collapse/dtrace.rs#L458 lol

haughty mica Mar 30, 2023, 8:00 PM

#

Y tho

hexed basin Mar 30, 2023, 8:01 PM

#

lmao

dry frost Mar 30, 2023, 8:50 PM

#

I guess they did want to prove utf-8 resilience somewhere else in the code, but that format! is ferrisSob

#

https://github.com/jonhoo/inferno/pull/289/files

wide acorn Mar 30, 2023, 10:05 PM

#

This wouldn't be an issue if we had format!(b"...") ferrisBut

sudden hawk Apr 1, 2023, 8:06 AM

#

tough leaf have each message be prefixed with a plaintext URL on where to find the schema

bzzzt - this fails the "I can decode the file with no network access" test

tough leaf Apr 1, 2023, 10:17 AM

#

sudden hawk bzzzt - this fails the "I can decode the file with no network access" test

it would be informational only

#

you should know the schema of what you're decoding

#

somehow Google manages to use non-self describing formats in prod just fine

wide acorn Apr 1, 2023, 2:18 PM

#

So we're inventing XML schemas all over again?

#

Make it an URN instead of an URL while we're at it

sudden hawk Apr 1, 2023, 4:24 PM

#

oh yeah that's okay then

wide acorn Apr 29, 2023, 5:55 PM

#

https://github.com/nix-rust/nix/issues/2028

GitHub

`kernel_version` can cause a data race when called from multiple th...

In src/features.rs, kernel_version() is defined as: fn kernel_version() -> Result<usize> { static mut KERNEL_VERS: usize = 0; unsafe { if KERNEL_VERS == 0 { KERNEL_VERS = parse_kernel_vers...

#

naughty nix, using a static mut

haughty mica Apr 29, 2023, 5:59 PM

#

O u c h

#

Before I started using nextest I was using multiple test threads specifically to detect this, but the timeouts from nextest are just too good

tender nimbus Apr 29, 2023, 6:04 PM

#

the fuck

wide acorn Apr 29, 2023, 7:18 PM

#

annoyingly, I can't demonstrate it in Miri, since Miri doesn't support uname

haughty mica Apr 29, 2023, 7:26 PM

#

You can fix that

#

You have the power

#

And/or open an issue and one of the Miri helper people will swoop by

#

There are a few community members who will contribute easy shims if you hold their hand a bit. It's very cool.

tender nimbus Apr 29, 2023, 7:27 PM

#

I think this is one of the cases where you don't need any demonstration fepher

hexed basin Apr 29, 2023, 7:34 PM

#

static mut bad is an absolute zero take, after all

flat inlet Apr 29, 2023, 8:02 PM

#

wide acorn https://github.com/nix-rust/nix/issues/2028

Made a PR https://github.com/nix-rust/nix/pull/2029

GitHub

Remove 'static mut' usage in features::os::kernel_version. by zachs...

Resolves #2028
Note that this is (AFAICT) the first use of Atomic* types in nix (other than tests). However, this shouldn't be a portability issue, since nix is not #![no_std], and (IIUC) std r...

tender nimbus Apr 29, 2023, 8:17 PM

#

dont you love it when you make a pr and ci fails horribly for reasons you didnt cause

tough leaf Apr 29, 2023, 8:19 PM

#

OnceLock stable when :(

#

or rather

#

wait is that getting stablised

tender nimbus Apr 29, 2023, 8:19 PM

#

yes

tough leaf Apr 29, 2023, 8:19 PM

#

(i still don't like the name :( )

tender nimbus Apr 29, 2023, 8:20 PM

#

lol, this code was added in 2014

#

https://github.com/nix-rust/nix/commit/c976be575f4fefcf03a70c5b87f898388150c8aa ferrisClueless

tough leaf Apr 29, 2023, 8:21 PM

#

uint

#

jeeez

still hearth Apr 29, 2023, 8:30 PM

#

tough leaf OnceLock stable when :(

1.70

flat inlet Apr 29, 2023, 10:36 PM

#

tender nimbus dont you love it when you make a pr and ci fails horribly for reasons you didnt ...

I think I found the issue: rustix 0.37.16 started using linux_raw_sys::ioctl without enabling linux-raw-sys's ioctl feature on some targets.
https://github.com/bytecodealliance/rustix/pull/645
nix dev dep -> tempfile -> rustix so nix's tests don't compile on those targets.

GitHub

Add `ioctl` cargo feature for `linux-raw-sys` ... by zachs18 · Pull...

... when using libc backend.
May fix issue described in #637 (comment) and CI errors for nix under some Linux and Android targets, when using rustix >=0.37.16. (Tested locally)

flat inlet Apr 30, 2023, 12:37 AM

#

(Fixed in rustix 0.37.18)

tender nimbus Apr 30, 2023, 7:17 AM

#

cargo features ferrisClueless

haughty mica May 28, 2023, 11:13 PM

#

https://asan.saethlin.dev/ub updated btw. Might be more UB than before. Unclear.

grim copper May 29, 2023, 12:47 AM

#

libunwind-sys ferrisballSweat

#

hashbrown ferrisballSweat

haughty mica May 29, 2023, 12:48 AM

#

hashbrown is fine

grim copper May 29, 2023, 12:49 AM

#

oh yeah that's just allocation fail

haughty mica May 29, 2023, 12:49 AM

#

libunwind-sys could do with a look

#

I'm tweaking the settings to get allocation failures off the page

haughty mica May 29, 2023, 1:20 AM

#

Also there are some crates that are trying to mem::uninitialized types with a niche ferrisPlead

wide acorn May 29, 2023, 3:26 PM

#

Maybe they just really want 0x01-initialization and don't want to risk calling ptr::write_bytes() incorrectly ferrisClueless

hexed basin May 29, 2023, 3:30 PM

#

broke: using mem::uninitialized because you haven't heard of MaybeUninit
woke: relying on mem::uninitialized for simple RNG, which isn't so random anymore
bespoke: using mem::uninitialized to fill an array with 0x01

grim copper May 29, 2023, 3:31 PM

#

Wait for speed? Is there a speed difference?

hexed basin May 29, 2023, 3:31 PM

#

nah they just couldn't be arsed to switch to MaybeUninit

grim copper May 29, 2023, 3:31 PM

#

Oh gotcha, dev speed

#

Using uninitialized for an RNG

tough leaf May 29, 2023, 3:44 PM

#

grim copper Using uninitialized for an RNG

 fn random_seed(_: &Path, _: &str) -> [u64; 2] { 
     use std::mem::uninitialized as rand; 
     unsafe { [rand::<u64>() ^ 0x12345678, rand::<u64>() ^ 0x87654321] } 
 }

grim copper May 29, 2023, 3:45 PM

#

Ah yes, very sound and normal

thorny venture May 29, 2023, 3:45 PM

#

Lgtm

hexed basin May 29, 2023, 3:45 PM

#

god that alias is so evil

tough leaf May 29, 2023, 3:45 PM

#

use std::mem::uninitialized as rand;
deeply deeply cursed line

dry frost May 30, 2023, 10:10 AM

#

hexed basin broke: using `mem::uninitialized` because you haven't heard of `MaybeUninit` wok...

fn random() -> u8 {
    0x01 // determined by fair dice-roll
}

wheat lava May 30, 2023, 10:39 AM

#

dry frost ```rs fn random() -> u8 { 0x01 // determined by fair dice-roll } ```

const RAND_VALUE: u8 = 1; // determined by fair dice-roll

wraith wind May 30, 2023, 10:42 AM

#

dry frost ```rs fn random() -> u8 { 0x01 // determined by fair dice-roll } ```

Not making mem::uninit produce 0x04 was a missed opportunity ferrisClueless

tough leaf May 30, 2023, 11:17 AM

#

lmao
cons: breaks bool

pros: funny

dry frost May 31, 2023, 12:43 PM

#

tough leaf lmao cons: breaks bool pros: funny

https://tenor.com/view/money-dollars-cash-rich-shut-up-and-take-my-money-gif-3555042

Tenor

shut up!

▶ Play video

#

?miri

#![cfg_attr(all(), feature(generic_const_exprs), allow(incomplete_features))]

// Safety: bruh.
unsafe trait Randomizable {}

trait Get : Randomizable {
    const RANDOM: Self;
}

macro_rules! Sized {( $T:ty $(,)? ) => ( [(); ::core::mem::size_of::<T>()] )}

impl<T : Randomizable> Get for T
where
    Sized!(T) :,
{
    const RANDOM: Self = unsafe {
        #[repr(u8)]
        #[derive(Clone, Copy)]
        enum TheRandomValue {
            /// Determined by fair dice roll
            Is = 0x01,
        }

        #[repr(C)] // <- EDIT: whoops
        union Bruh<T>
        where
            Sized!(T) :,
        {
            src: [TheRandomValue; ::core::mem::size_of::<T>()],
            dst: ::core::mem::ManuallyDrop<T>,
        }
        ::core::mem::ManuallyDrop::into_inner(Bruh {
            src: [TheRandomValue::Is; ::core::mem::size_of::<T>()],
        }.dst)
    };
}

unsafe impl Randomizable for u16 {}

const RAND_VALUE: u16 = u16::RANDOM;

dbg!(RAND_VALUE);

small apexBOT May 31, 2023, 12:58 PM

#

[src/main.rs:42] RAND_VALUE = 257```

tough leaf May 31, 2023, 1:18 PM

#

is this.... const mem::uninit

#

what the fuck

hexed basin May 31, 2023, 2:13 PM

#

AHHHHHHH

#

I petition for forbidding Yandros to write code, for the sake of everyone's sanity

coral trout May 31, 2023, 2:19 PM

#

I petition for Yandros to write all the code for everyone

dry frost May 31, 2023, 2:42 PM

#

hexed basin I petition for forbidding Yandros to write code, for the sake of everyone's sani...

You have my vote

grim copper May 31, 2023, 2:52 PM

#

Reimplement coq out of rust macros

dry frost May 31, 2023, 5:45 PM

#

-days-since missing repr(C) on union (#981649653764333598 message)

versed wingBOT May 31, 2023, 5:45 PM

#

Days since last missing repr(C) on union in #981649653764333598: 0

tough leaf May 31, 2023, 6:19 PM

#

hahaha

tiny cedar Jun 7, 2023, 1:27 AM

#

dry frost -days-since missing repr(C) on union (https://discord.com/channels/2735342393104...

I really want to just require that all union fields are at the same offset in repr(Rust). When everything overlaps anyway, I don't see any reason that there'd be a need to ever do anything else, and everyone assumes it's true anyway.

dry frost Jun 7, 2023, 7:52 AM

#

tiny cedar I really want to just require that all union fields are at the same offset in `r...

If it overlaps yes, that's quite plausible (no reason to artificially make the whole union bigger), but I could imagine

#[repr(Rust)]
union U {
    a: InOrder<u8, bool>,
    b: bool,
}

being laid out with .b starting at offset 1

tough leaf Jun 7, 2023, 7:57 AM

#

assuming unions have niches

#

which isn't a given, and would be nontrivial to do

tiny cedar Jun 7, 2023, 8:20 AM

#

dry frost If it overlaps yes, that's quite plausible (no reason to artificially make the w...

It's just not at all obvious to me that that's better in any way than having all the fields at the same offset. Given that nobody could rely on it, what would it make better for anyone?

dry frost Jun 7, 2023, 8:21 AM

#

tiny cedar It's just not at all obvious to me that that's *better* in any way than having a...

To make Option smaller etc. I guess 🤷

#

FWIW, I'm not saying this is better, and imho I do think this shouldn't really be the default repr (#[repr(niched)] pls ferrisPlead ); but it's at least something plausible.

tiny cedar Jun 7, 2023, 8:24 AM

#

dry frost To make `Option` smaller _etc._ I guess 🤷

Right, but that requires that unions have niches, which today they don't, which Ralf at least seems pretty sure about.

dry frost Jun 7, 2023, 8:25 AM

#

tiny cedar Right, but that requires that unions have niches, which today they don't, which ...

Ah, I wasn't caught up with that aspect, my bad

#

Even more reason to at least not have it as default repr

tiny cedar Jun 7, 2023, 8:26 AM

#

I'm not personally convinced that not having niches is right -- it penalizes the "I just wanted to store two things" cases -- but it is what it is right now.

dry frost Jun 7, 2023, 8:26 AM

#

So yeah, in that case I fail to see any other plausible reason to use non-0 offsets for union variants under the default repr

tiny cedar Jun 7, 2023, 8:27 AM

#

And MaybeUninit::<MyUnion>::uninit()::assume_init() is thus not insta-UB for unions right now, and thus changing that might be difficult because it would make currently-sound code into insta-UB-but-still-compiles code, which is the worst kind of breaking change.

tiny cedar Jun 7, 2023, 8:28 AM

#

dry frost FWIW, I'm not saying this is better, and imho I do think this shouldn't really b...

I really wish that "all unions can have niches" was the default, though, since adding a unit field to a union is such an obvious way to opt-in to the "nope, fully uninit is legal too" behaviour if people want that.

tough leaf Jun 7, 2023, 8:29 AM

#

tiny cedar And `MaybeUninit::<MyUnion>::uninit()::assume_init()` is thus not insta-UB for u...

would be interesting to run Miri against the ecosystem with that change and see what things actually break with it

dry frost Jun 7, 2023, 8:29 AM

#

tiny cedar I really wish that "all unions can have niches" *was* the default, though, since...

That was my naïve intuition as well

tough leaf Jun 7, 2023, 8:29 AM

#

is it a few crates or is it a ton

#

though yes Miri can't test FFI

tiny cedar Jun 7, 2023, 8:30 AM

#

But who knows, maybe we'll get repr(strict) on unions one day, or something

tough leaf Jun 7, 2023, 8:31 AM

#

repr(active variant rule)

#

for when you're not going to be doing transmutations using the union

verbal laurel Jun 12, 2023, 5:31 PM

#

tough leaf ```rust fn random_seed(_: &Path, _: &str) -> [u64; 2] { use std::mem::uni...

tough leaf Jun 12, 2023, 5:49 PM

#

verbal laurel

?eval unsafe { std::mem:: uninitialized::<[u8; 128]>() }

small apexBOT Jun 12, 2023, 5:49 PM

#

[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]```

tough leaf Jun 12, 2023, 5:50 PM

#

we don't return uninit data anymore :)

coral trout Jun 12, 2023, 6:16 PM

#

The problem with relying on UB is that it may not do what you expect :P

grim copper Jun 12, 2023, 10:55 PM

#

tough leaf ?eval `unsafe { std::mem:: uninitialized::<[u8; 128]>() }`

Decided by fair dice roll

west violet Jun 13, 2023, 2:55 AM

#

I knew Nils made it "all ones" but I thought that meant !0

grim copper Jun 13, 2023, 3:02 AM

#

time to depend on this behaviour

#

pub use std::mem::uninit as fill_ones```

haughty mica Jun 13, 2023, 3:02 AM

#

west violet I knew Nils made it "all ones" but I thought that meant `!0`

Wouldn't be a valid bool

west violet Jun 13, 2023, 3:03 AM

#

0b1111111 is definitely not a valid bool

#

1 is, however

grim copper Jun 13, 2023, 3:04 AM

#

is bool always 0 / 1 byte?

west violet Jun 13, 2023, 3:04 AM

#

Yes

west violet Jun 13, 2023, 3:06 AM

#

haughty mica Wouldn't be a valid bool

Wait you want it to be a valid bool?

flat inlet Jun 13, 2023, 3:08 AM

#

IIUC the whole point of 1-filling std::mem::uninitialized() was to make it stop being insta-UB to use for primitive types like ints; if the mitigation didn't make it not insta-UB for bool when it could have, that would be less than ideal.
(Obviously ideally no-one would have been using it in a UB manner to begin with ferrisClueless )

#

(and not 0-fill because then its insta-UB for NonNull things)

west violet Jun 13, 2023, 3:08 AM

#

Ideally using uninitialized would actively try to break your assumptions

#

(In debug mode)

haughty mica Jun 13, 2023, 3:19 AM

#

The point of 1-filling is that it doesn't turn into surprise compilation

west violet Jun 13, 2023, 3:23 AM

#

Fair ig

tender nimbus Jun 13, 2023, 4:21 AM

#

west violet I knew Nils made it "all ones" but I thought that meant `!0`

that was Ralf ferrisClueless

west violet Jun 13, 2023, 4:22 AM

#

My b I thought you (or someone else here) implemented that

tiny cedar Jun 13, 2023, 6:39 AM

#

It makes all the primitives, as well as &str and NonZeroUsize, not trivially instant-UB with mem::uninitialized, which is the point. It doesn't solve everything, but it was an easy way to deal with the known-bad cases.

bitter nymph Jun 22, 2023, 7:37 PM

#

I don't have test cases pushed (I can do that tonight). But it looks like xz2 has some unaligned memory access when run on non x86 systems.

This only happens when I have the arm filter enabled. Can't run miri and I'm not that experienced in fixing these. Any help?

https://github.com/alexcrichton/xz2-rs/blob/1a82c40d6d80171b7df328aea43b7054acd10c44/src/stream.rs#L773

GitHub

xz2-rs/src/stream.rs at 1a82c40d6d80171b7df328aea43b7054acd10c44 · ...

Bindings to liblzma in Rust (xz streams in Rust). Contribute to alexcrichton/xz2-rs development by creating an account on GitHub.

tender nimbus Jun 22, 2023, 7:40 PM

#

do you know where the unaligned access happens? through a debugger or something like that

bitter nymph Jun 22, 2023, 7:41 PM

#

The code I linked to caused one of them. I can fix that with some as * const u64 (I think)

The other one that I put a print before and after to find is this one

https://github.com/alexcrichton/xz2-rs/blob/1a82c40d6d80171b7df328aea43b7054acd10c44/src/stream.rs#L793

GitHub

xz2-rs/src/stream.rs at 1a82c40d6d80171b7df328aea43b7054acd10c44 · ...

Bindings to liblzma in Rust (xz streams in Rust). Contribute to alexcrichton/xz2-rs development by creating an account on GitHub.

tender nimbus Jun 22, 2023, 7:44 PM

#

do you have any way to reproduce it

bitter nymph Jun 22, 2023, 7:52 PM

#

I can tonight

tiny cedar Jun 22, 2023, 8:01 PM

#

Any chance the new debug-mode alignment assertions trigger on it? If not, that could be a good bug too.

haughty mica Jun 22, 2023, 9:11 PM

#

bitter nymph I don't have test cases pushed (I can do that tonight). But it looks like xz2 ha...

What do you mean by "unaligned when not run on x86 systems"?
Unaligned accesses are unaligned only depending on the value of the pointer and the alignment of the pointee, the arch doesn't matter

bitter nymph Jun 22, 2023, 11:01 PM

#

haughty mica What do you mean by "unaligned when not run on x86 systems"? Unaligned accesses ...

For sure, I misspoke on that

haughty mica Jun 22, 2023, 11:02 PM

#

Cool cool

#

Also I looked up the docs for this and the stream API is confusing

bitter nymph Jun 22, 2023, 11:03 PM

#

yah the whole read/write for everything is weird imo

#

If that's what you're referencing

#

I'm not at home rn, but something like this.
I'm not actually sure this is UB, looks like just the wrong bindings (which is UB?). I can provide a gdb(gef) output later. Notice running it on x86_64-unknown-linux-gnu works fine.

use xz2::stream::{Filters, LzmaOptions, MtStreamBuilder};

fn main() {
    let dict_size = 0x40000;

    let mut opts = LzmaOptions::new_preset(6).unwrap();
    opts.dict_size(dict_size);

    let mut filters = Filters::new();

    filters.ia64();
    filters.arm();
    filters.arm_thumb();
    filters.lzma2(&opts);


    let stream = MtStreamBuilder::new()
        //.block_size(0x1000)
        .filters(filters)
        .check(xz2::stream::Check::Crc32)
        .encoder()
        .unwrap();
}

cross r --release --target arm-unknown-linux-musleabi

haughty mica Jun 22, 2023, 11:29 PM

#

Yeah idk what's going on here

#

Any filters makes this crash

bitter nymph Jun 22, 2023, 11:32 PM

#

Ah, just using -musl makes this crash?

> gdb ./target/x86_64-unknown-linux-musl/debug/xz-issue
GNU gdb (GDB) 13.1
Copyright (C) 2023 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Type "show copying" and "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<https://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
    <http://www.gnu.org/software/gdb/documentation/>.

For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from ./target/x86_64-unknown-linux-musl/debug/xz-issue...
warning: Missing auto-load script at offset 0 in section .debug_gdb_scripts
of file /home/wcampbell/projects/wcampbell/xz-issue/target/x86_64-unknown-linux-musl/debug/xz-issue.
Use `info auto-load python-scripts [REGEXP]' to list them.
(gdb) r
Starting program: /home/wcampbell/projects/wcampbell/xz-issue/target/x86_64-unknown-linux-musl/debug/xz-issue

Program received signal SIGSEGV, Segmentation fault.
0x00007ffff7f8a2f3 in lzma_mt_block_size (filters=0x7ffff7f638e0) at xz-5.2/src/liblzma/common/filter_encoder.c:237
237            if (fe->block_size != NULL) {

#

-gnu is fine ferrisAngry

bitter nymph Jun 23, 2023, 2:37 AM

#

I went ahead and pushed an issue: https://github.com/alexcrichton/xz2-rs/issues/114

GitHub

Invalid memory reference with filter on some archs · Issue #114 · a...

The following code: use xz2::stream::{Filters, LzmaOptions, MtStreamBuilder}; fn main() { let dict_size = 0x40000; let mut opts = LzmaOptions::new_preset(6).unwrap(); opts.dict_size(dict_size); let...

past fiber Oct 15, 2023, 3:05 AM

#

how does the website work?

haughty mica Oct 15, 2023, 3:20 AM

#

https://github.com/saethlin/crater-at-home
cargo r -- run --tool=miri --bucket=your-bucket-here
Then I have an Cloudfront Distribution in front of the bucket so that you get caching, TLS, and a nice URL.

GitHub

GitHub - saethlin/crater-at-home: We have Crater At Home

We have Crater At Home. Contribute to saethlin/crater-at-home development by creating an account on GitHub.

past fiber Oct 15, 2023, 3:23 AM

#

i see

#

so it runs the test suites of crates?

haughty mica Oct 15, 2023, 3:23 AM

#

Oh

past fiber Oct 15, 2023, 3:23 AM

#

im surprised theres so much ub

haughty mica Oct 15, 2023, 3:24 AM

#

It's a bit abstracted, but yes it runs the test suites of crates: https://github.com/saethlin/crater-at-home/blob/985667303836eca2aecf5735f3f78cbc94b843bc/docker/run.sh#L56-L63

past fiber Oct 15, 2023, 3:24 AM

#

also, did you make your own ansi2html crate?

haughty mica Oct 15, 2023, 3:24 AM

#

I started with the published one but yes I eventually rewrote every line in it I think

past fiber Oct 15, 2023, 3:24 AM

#

thats cool

haughty mica Oct 15, 2023, 3:25 AM

#

The crater-at-home CI setup uses it

#

It's kind of based

haughty mica Oct 15, 2023, 3:03 PM

#

This looks like a useful Solana exploit if someone wants it 😂 https://asan.saethlin.dev/logs/spl-token-2022/0.9.0

thorny venture Oct 15, 2023, 3:05 PM

#

Lmao

hexed basin Oct 15, 2023, 3:06 PM

#

owo

tough leaf Oct 15, 2023, 3:53 PM

#

haughty mica This looks like a useful Solana exploit if someone wants it 😂 https://asan.saet...

cryptoshit aside, damn since when did asan have good diagnostics

#

that looks rly nice

#

mainly the showing locals

haughty mica Oct 15, 2023, 3:54 PM

#

It's been like this for a while

#

Though back when I did C++ I don't remember seeing them

wide acorn Oct 15, 2023, 7:40 PM

#

    /// # Safety
    ///
    /// This method makes assumptions about the layout and location of memory
    /// referenced by `AccountInfo` fields. It should only be called for
    /// instances of `AccountInfo` that were created by the runtime and received
    /// in the `process_instruction` entrypoint of a program.
    pub fn realloc(&self, new_len: usize, zero_init: bool) -> Result<(), ProgramError> {

#

This signature appears to be a keyword short, but I can't quite put my finger on it... ferrisClueless

#

Also,

pub struct Pubkey(pub(crate) [u8; 32]);

impl<'a> AccountInfo<'a> {
    #[rustversion::attr(since(1.72), allow(invalid_reference_casting))]
    pub fn assign(&self, new_owner: &Pubkey) {
        // Set the non-mut owner field
        unsafe {
            std::ptr::write_volatile(
                self.owner as *const Pubkey as *mut [u8; 32],
                new_owner.to_bytes(),
            );
        }
    }
}

hexed basin Oct 15, 2023, 7:51 PM

#

AHHHHH

haughty mica Oct 15, 2023, 8:02 PM

#

Lmao

hexed basin Oct 15, 2023, 8:03 PM

#

wonder how many safe functions have safety comments

grim copper Oct 15, 2023, 8:03 PM

#

Cursed

wide acorn Oct 15, 2023, 8:07 PM

#

Anyway, I just looked at all the callers of AccountInfo::realloc(), only those tests in spl-token-2022 actually violate the precondition by constructing an AccountInfo<'_> on the stack

haughty mica Oct 15, 2023, 8:09 PM

#

The rate at which people misuse their unsafe APIs in tests is extraordinary

#

At some point I'll submit patches to fix some of them

#

Most of the global buffer overflows I have looked at are people forgetting to null terminate a string they're passing to a C binding.

wide acorn Oct 15, 2023, 8:17 PM

#

hexed basin wonder how many safe functions have safety comments

solana_program::account_info::AccountInfo::realloc(); solana_program::program::invoke_unchecked() and invoke_signed_unchecked(); solana_program::program_memory::sol_memcmp(), sol_memcpy(), and sol_memset(); and solana_geyser_plugin_manager::geyser_plugin_manager::load_plugin_from_config() and GeyserPluginManager::load_plugin().

hexed basin Oct 15, 2023, 8:17 PM

#

Jesus

wide acorn Oct 15, 2023, 8:17 PM

#

Most of them say

This function is incorrectly missing an unsafe declaration.

hexed basin Oct 15, 2023, 8:18 PM

#

lmfao

#

wait holy shit this is from the actual Solana repo wtf

#

crypto is such a meme

haughty mica Oct 15, 2023, 8:20 PM

#

Yes

#

It shouldn't surprise anyone that people who just assume they know better about finance take the same attitude towards writing the code that runs their financial systems

#

And yet it still surprises me on occasion

wide acorn Oct 15, 2023, 8:23 PM

#

I suppose it's the particular combination of "move fast and break things" and "adopt an immutable ledger as a source of truth", both of which can be work on their own but cause problems when taken together

haughty mica Oct 15, 2023, 8:29 PM

#

I don't think that's it

thorny venture Oct 15, 2023, 9:08 PM

#

haughty mica This looks like a useful Solana exploit if someone wants it 😂 https://asan.saet...

Thanks for this. I had dinner tonight with a friend who was excited about Rust being used to build safe, secure systems and it gave me a really good laugh to mention financial infrastructure in rust having stack buffer exploits

tame jewel Oct 16, 2023, 12:00 AM

#

wide acorn Also, ```rust pub struct Pubkey(pub(crate) [u8; 32]); impl<'a> AccountInfo<'a> ...

Please let me unsee this. There should be snarky lints for trying to use &T as if it was merely const T*.

wide acorn Oct 16, 2023, 12:14 AM

#

tame jewel Please let me unsee this. There should be snarky lints for trying to use `&T` as...

#[rustversion::attr(since(1.72), allow(invalid_reference_casting))]

#

error: assigning to `&T` is undefined behavior, consider using an `UnsafeCell`
 --> src/main.rs:6:13
  |
6 | /             std::ptr::write_volatile(
7 | |                 self.owner as *const Pubkey as *mut [u8; 32],
8 | |                 new_owner.to_bytes(),
9 | |             );
  | |_____________^
  |
  = note: for more information, visit <https://doc.rust-lang.org/book/ch15-05-interior-mutability.html>
  = note: `#[deny(invalid_reference_casting)]` on by default

#

https://github.com/solana-labs/solana/pull/32961/commits/cc1bf78fa5581222b716f646e8ce52e684858070

haughty mica Oct 16, 2023, 12:39 AM

#

Yup, it is also very common in the "holy shit that's just UB" code that people have disabled lints put there to detect the problem

grim copper Oct 16, 2023, 12:58 AM

#

@haughty mica https://miri.saethlin.dev/no-sb/ub seems to be 403ed

haughty mica Oct 16, 2023, 12:59 AM

#

Correct, it is gonezo

grim copper Oct 16, 2023, 1:00 AM

#

ah ok, plz could you update the pinned message then

#

oh it's nils

#

@tender nimbus

haughty mica Oct 16, 2023, 1:00 AM

#

The pinned message should point to https://asan.saethlin.dev/ub that's where the really spicy UB is

#

smh my 403 page is even broken

past fiber Oct 16, 2023, 1:03 AM

#

haughty mica Yup, it is also very common in the "holy shit that's just UB" code that people h...

make them forbid() ferrisBanne

tough leaf Oct 16, 2023, 1:03 AM

#

at least... most of these are crates i've not heard of

tender nimbus Oct 16, 2023, 5:32 AM

#

wide acorn Also, ```rust pub struct Pubkey(pub(crate) [u8; 32]); impl<'a> AccountInfo<'a> ...

that's how you fix the issue, right?

#

I love how it's volatile, probably because the non volatile write wasn't getting compiled correctly

regal galleon Oct 16, 2023, 5:55 AM

#

need eyebleach

tender nimbus Oct 16, 2023, 6:04 AM

#

haughty mica The pinned message should point to https://asan.saethlin.dev/ub that's where the...

hint: pc points to the zero page.

neon tiger Oct 16, 2023, 11:15 AM

#

hexed basin wonder how many safe functions have safety comments

perhaps there should be a clippy lint for this

#

OTOH I think std also has safety comments on some safe APIs that are like "this is a safe function but take heed: it interacts in the following manner with these other unsafe APIs"

#

oh, https://rust-lang.github.io/rust-clippy/master/index.html#/unnecessary_safety_doc ferrisClueless

Clippy Lints

A collection of lints to catch common mistakes and improve your Rust code.

tough leaf Oct 16, 2023, 11:28 AM

#

neon tiger OTOH I think std also has safety comments on some safe APIs that are like "this ...

I think # Safety as a header is special and you probably shouldn't use # Safety to denote poor interactions with other functions

#

name it something else

#

hmm

neon tiger Oct 16, 2023, 11:28 AM

#

indeed checking std docs now I can't find any such place

#

I looked at a few "sussy but not itself unsafe" things in ManuallyDrop, AssertUnwindSafe and Pin

#

they go into why it might be scary but none of them use # Safety

tender nimbus Oct 16, 2023, 11:50 AM

#

try x clippy --deny thelint

#

or something like that, x clippy is cursed

neon tiger Oct 16, 2023, 1:41 PM

#

my only rust checkout is THOROUGHLY fucked atm sorry

#

no I do not have time to chat about our lord and savior git worktrees

haughty mica Oct 16, 2023, 1:53 PM

#

Same

tender nimbus Oct 16, 2023, 2:31 PM

#

I have two worktrees and keep getting annoyed that I can't check out master on both of them

neon tiger Oct 16, 2023, 10:43 PM

#

wait is that a thing? that sounds terrible

#

that's just cloning the repo twice but worse

#

if you want to save space you can use a reference repo I guess

tough leaf Oct 16, 2023, 10:56 PM

#

neon tiger that's just cloning the repo twice but worse

the point of worktrees is that it shares .git

neon tiger Oct 16, 2023, 10:56 PM

#

yes but isn't that also the point with reference repos?

tough leaf Oct 16, 2023, 10:56 PM

#

... what's that

#

yet another git feature?

neon tiger Oct 16, 2023, 10:56 PM

#

yeeees ferrisCluelesser

#

you can use a .git from another place on your filesystem

#

but I think it only shares one way

tiny cedar Oct 17, 2023, 1:21 AM

#

hexed basin wonder how many safe functions have safety comments

Might be an interesting rustc lint, actually -- if you have a # Safety section in the doc-comment, it needs to be unsafe fn

wide acorn Oct 17, 2023, 1:47 AM

#

It's already a Clippy lint

#

(and imho Clippy is where it belongs, doc comments being a social convention rather than anything lang-related; you could have your own private crate ecosystem where # Safety means whatever you want)

neon tiger Oct 17, 2023, 8:35 AM

#

there are a select few lints in rustc itself around social conventions, like identifier casing. but yes I agree this should stay in clippy

dry frost Oct 17, 2023, 9:34 AM

#

tender nimbus I have two worktrees and keep getting annoyed that I can't check out master on b...

git fetch && git checkout origin/master

#

I never check out master/main or w/e as its own branch, only detached heads of it

tough leaf Oct 17, 2023, 10:18 AM

#

wide acorn (and imho Clippy is where it belongs, doc comments being a social convention rat...

doesn't rustc have style lints

#

oh already mentioned

tiny cedar Oct 17, 2023, 6:22 PM

#

wide acorn (and imho Clippy is where it belongs, doc comments being a social convention rat...

To me this is the kind of convention that is fine for rustc, because it's so easily avoided if you didn't want it. You just have to call it # IO Safety or # Float Safety or ...

Having "# Safety" consistently mean "real unsafe-related safety" would be a good thing, IMHO.

#

It's the "this is unsafe; you should have a # Safety section" lint that's less applicable for rustc, to me, since it's a "you ought to do this" lint rather than a "don't do that" lint.

neon tiger Oct 18, 2023, 1:25 PM

#

yeah, that lint will be awkward for FFI bindings too

ruby jacinth Oct 21, 2023, 12:29 PM

#

wide acorn ```rust error: assigning to `&T` is undefined behavior, consider using an `Unsaf...

the compiler says its ub but it seems to work fine so it's fine ferrisClueless

#

(average C enjoyer)

tough leaf Oct 21, 2023, 12:38 PM

#

ruby jacinth the compiler says its ub but it seems to work fine so it's fine <:ferrisClueless...

5 months later: omg a rust upgrade broke our stuff, rust is a terrible language

thorny venture Oct 21, 2023, 12:44 PM

#

It's true, but not for that reason

ruby jacinth Oct 21, 2023, 2:05 PM

#

binary search moment

tough leaf Oct 21, 2023, 2:20 PM

#

ruby jacinth binary search moment

Wow, the hostility in this thread is stunning.

ruby jacinth Oct 21, 2023, 2:27 PM

#

unless there's deleted comments i can't see, i dont see how this is hostile

sudden hawk Oct 25, 2023, 8:58 PM

#

i'm guessing that message is a prediction of what the comments will look like

tender nimbus Jun 1, 2022, 8:05 PM

#

One great way to help the ecosystem is to check out https://miri.saethlin.dev/ub (it's sorted by recent downloads), find some interesting crates with UB, and open PRs fixing the UB.

For more spicy UB, checkout https://asan.saethlin.dev/ub.

It's often a pretty simple fix, and running miri with the env var MIRIFLAGS='-Zmiri-strict-provenance -Zmiri-check-number-validity' will help you a lot.

I would also recommend adding miri with the above mentioned miriflags to the CI of the crate in your PR.

For more infos, @haughty mica can probably tell some more things about this.

#

Example: https://github.com/maciejhirsz/beef/pull/47

GitHub

Fix `into_owned` `String` not having enough provenance by Nilstrieb...

Calling .as_mut_ptr on a String actually goes through &mut str, which shrinks the provenance of the pointer to only contain the initialized bytes. This caused issues when a reconstructed String...

haughty mica Jun 1, 2022, 8:07 PM

#

-Zmiri-check-number-validity is no longer required

tender nimbus Jun 1, 2022, 8:07 PM

#

ah, was it -Zmiri-symbolic-alignment-check that still is? or is this defaulted or implied in strict provenance now as well?

haughty mica Jun 1, 2022, 8:08 PM

#

Symbolic alignment check has false positives, so I don't recommend it

#

The number validity check isn't part of strict provenance, it's just the default behavior of Miri now

tender nimbus Jun 1, 2022, 8:10 PM

#

@pastel lily write this down for your #dark-arts edit ferrisForgor

tough leaf Jun 1, 2022, 8:11 PM

#

there's also RUSTFLAGS="-Zstrict-init-checks" which makes the mem::uninitialized/mem::zeroed checks stricter

otherwise known as the "panic on http" flag, because that's all you find

ruby jacinth Jun 1, 2022, 8:11 PM

#

can you also expand on how to approach a crater owner about this? e.g., how do you convince them there is a problem? some crate owners have been very defensive about this

haughty mica Jun 1, 2022, 8:11 PM

#

Don't bother them if they're defensive

pastel lily Jun 1, 2022, 8:16 PM

#

ferrisForgor

#

im sorry i have literally 5 seconds of memory

haughty mica Jun 1, 2022, 8:17 PM

#

Note that the site uses RUSTFLAGS=-Zrandomize-layout. This is only rarely important.

In general, if you are looking for something to do in this list I would prioritize fixes both according to position in the list and the kind of UB. In descending order of badness (most dangerous at the top)

Definitely UB with or without any aliasing model:
miasligned pointer
unaligned reference
invalid pointer offset

Probably UB without any aliasing model:
uninitialized memory

Probably UB in any model that supports normal provenance optimizations:
ptr-int transmute

Probably UB in any aliasing model that supports noalias which rustc currently emits:
&->&mut
write-via-&
SB-use-outside-provenance

Debatably UB, depending on the exact details of the aliasing model:
SB-invalidation

Technically not UB but it would be awesome if people did this less because diagnostics go to crap when you do them:
int-to-ptr cast

stable arrow Jun 1, 2022, 8:17 PM

#

ruby jacinth can you also expand on how to approach a crater owner about this? e.g., how do y...

if I were trying to optimize reception I'd make sure to phrase the PR in as neutral-to-positive language as I could (e.g. if you can eliminate the unsafe code entirely, that can be pitched as a benefit without needing to get into definition-of-UB at all) and maybe include benchmarks showing that the replacement is not slower (since unsafe is often motivated by performance)

haughty mica Jun 1, 2022, 8:18 PM

#

FWIW I don't think a single one of my patches has reduced the amount of unsafe code in a crate

#

Though it would be very cool if people took that approach

tender nimbus Jun 1, 2022, 8:19 PM

#

"a tiny diff that slightly changes code so that it's now better? interesting, weird, but lgtm"
is a pretty common reaction i guess

ruby jacinth Jun 1, 2022, 8:19 PM

#

i think many people would be surprised that if you write things in a safe way it can often be as fast or faster than an unsafe solution

#

i've had that thing where i wanted to speed something up and introduced unsafe, only to make it slower

tough leaf Jun 1, 2022, 8:23 PM

#

stable arrow if I were trying to optimize reception I'd make sure to phrase the PR in as neut...

yeah when i was removing unsafe code from httpdate i removed the unsafe code and then replaced it with more complex (but entirely safe) code, which i felt confident doing because it was entirely safe, and therefore could only panic

#

and it was faster overall

tender nimbus Jun 1, 2022, 8:30 PM

#

One great low hanging fruit is to search for &->&mut
and replace the x.as_ptr() as *mut T with x.as_mut_ptr()
melobonk

ornate agate Jun 1, 2022, 8:36 PM

#

melobooli

haughty mica Jun 1, 2022, 8:37 PM

#

Also contributing Miri in CI is cool, but without -Zmiri-tag-raw-pointers or -Zmiri-strict-aliasing you're opted into a very permissive model.

It would be very cool if someone has a pluggable way to drop Miri into CI

ruby jacinth Jun 1, 2022, 8:40 PM

#

maybe you could make one of those action things

tender nimbus Jun 1, 2022, 8:42 PM

#

miri-action

#

with strict defaults ferrisOwO

#

Would be nice I guess

ruby jacinth Jun 1, 2022, 8:42 PM

#

i will make an exception to my hate of rocket emojis and give you a rocket emoji: 🚀

tender nimbus Jun 1, 2022, 8:43 PM

#

🔥 🚀 blazingly ub 🚀

haughty mica Jun 1, 2022, 8:44 PM

#

Yeah if someone made one of those action things that might be nice

#

Maybe

#

The Miri readme has this: https://github.com/rust-lang/miri#running-miri-on-ci

#

Dunno if that's easy enough already

tender nimbus Jun 1, 2022, 8:46 PM

#

doesn't use miriflags ferrisForgor

tough leaf Jun 1, 2022, 8:47 PM

#

also.i'd like to see non-miri sanitizers be actually good

#

you don't need miri to notice a move of an invalid type

west violet Jun 1, 2022, 10:09 PM

#

haughty mica Note that the site uses `RUSTFLAGS=-Zrandomize-layout`. This is only rarely impo...

Does randomize work on box now?

haughty mica Jun 1, 2022, 10:09 PM

#

No?

west violet Jun 1, 2022, 10:09 PM

#

Oh nvm then

#

By "work" I meant "not explode" btw

haughty mica Jun 1, 2022, 10:13 PM

#

The standard library still doesn't build with the flag if that's what you mean?

west violet Jun 1, 2022, 10:15 PM

#

Ah yep, that’s what I meant

#

I need to work on that sometime

haughty mica Jun 1, 2022, 10:22 PM

#

Yeah the error seems impenetrable to me

tame jewel Jun 3, 2022, 4:35 PM

#

Hm, miri doesn't really pinpoint the location too well. For image-canvas:
Undefined Behavior: trying to reborrow <255625> for Unique permission at alloc103546[0x0], but that tag only grants SharedReadOnly permission for this location
|
= help: this indicates a potential bug in the program: it performed an invalid operation, but the rules it violated are still experimental
= help: see https://github.com/rust-lang/unsafe-code-guidelines/blob/master/wip/stacked-borrows.md for further information
help: <255625> was created by a retag at offsets [0x0..0x1]
--> src/shader.rs:1713:24
|
1713 | let ne_bytes = texel.to_mut_bytes(core::slice::from_mut(val));
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

The fault is actual inside image_texel::to_mut_bytes. But good to know, will be fixed asap.

GitHub

unsafe-code-guidelines/stacked-borrows.md at master · rust-lang/uns...

Home for the Unsafe Code Guidelines working group. - unsafe-code-guidelines/stacked-borrows.md at master · rust-lang/unsafe-code-guidelines

#

It's one of those as_ptr() -> &mut _ instances 😅

pastel lily Jun 3, 2022, 4:36 PM

#

@haughty mica looks like this error is missing a span

haughty mica Jun 3, 2022, 4:39 PM

#

Aaaaaa

#

Give me a bit

#

Thank you for summoning me

#

This was a Miri bug

tender nimbus Jun 3, 2022, 4:40 PM

#

tame jewel It's one of those `as_ptr() -> &mut _` instances 😅

I wonder if clippy could lint on that
Or does it already

haughty mica Jun 3, 2022, 4:41 PM

#

Definitely doesn't

pastel lily Jun 3, 2022, 4:42 PM

#

Yea was pretty sure it was a miri bug that this diagnostic didn’t have the code that went wrong

haughty mica Jun 3, 2022, 4:43 PM

#

I reran the top 1000 to update them then introduced this problem then never uploaded the re-rerun logs

haughty mica Jun 3, 2022, 4:58 PM

#

Now my internet is dying???

haughty mica Jun 3, 2022, 7:08 PM

#

I really need a good rerun mechanism. image-canvas is surprisingly far down the list (currently all I have is "run the top N")

#

So I deleted image-canvas's log and now I'm waiting for the other 200 crates ahead of it that have had new versions to run

tender nimbus Jun 3, 2022, 8:13 PM

#

@haughty mica it would be cool if there was a flag to show nice stack traces for the location where the tag was created/invalidated

#

help: <210780> was created by a retag at offsets [0x0..0xffc]
    --> /home/nilsh/.rustup/toolchains/nightly-x86_64-unknown-linux-gnu/lib/rustlib/src/rust/library/core/src/slice/mod.rs:507:9
     |
507  |         self as *mut [T] as *mut T
     |         ^^^^
help: <210780> was later invalidated at offsets [0x0..0xffc]
    --> /home/nilsh/.rustup/toolchains/nightly-x86_64-unknown-linux-gnu/lib/rustlib/src/rust/library/core/src/mem/maybe_uninit.rs:1019:5
     |
1019 | /     pub const fn slice_as_mut_ptr(this: &mut [MaybeUninit<T>]) -> *mut T {
1020 | |         this.as_mut_ptr() as *mut T
1021 | |     }
     | |_____^

that's not very helpful

haughty mica Jun 3, 2022, 8:23 PM

#

tender nimbus <@176135688666742784> it would be cool if there was a flag to show nice stack tr...

This is not really possible

#

You're asking for a stack trace to be collected on every invalidation which is an incredible amount of data

tender nimbus Jun 3, 2022, 8:23 PM

#

oh well

#

ferrisballSweat

tough leaf Jun 3, 2022, 8:24 PM

#

can miri have an option to run twice and then collect that way?

haughty mica Jun 3, 2022, 8:24 PM

#

The data you want can be collected with tag tracking, but you need to run a second time

tough leaf Jun 3, 2022, 8:24 PM

#

that does require determinism

haughty mica Jun 3, 2022, 8:24 PM

#

But pointer tags are not necessarily stable with isolation off

#

Yeah

#

This particular problem though is a FnEntry retag which I'm aware are problematic

#

You should report this tbh so that Ralf cares more about it

#

If we reported the span of that function call instead, I bet that would be helpful

#

I discussed this problem in the PR that adds these diagnostics

tender nimbus Jun 3, 2022, 8:27 PM

#

lmao, just ran into a case where #[may_dangle] made it compile, lmao

haughty mica Jun 3, 2022, 8:28 PM

#

Read from here on down https://github.com/rust-lang/miri/pull/2030#issuecomment-1072928828

tender nimbus Jun 3, 2022, 8:34 PM

#

https://github.com/rust-lang/miri/issues/2185

haughty mica Jun 3, 2022, 8:34 PM

#

Oof uh your issue is wrong

#

When a pointer tag is created/invalidated somewhere in std or a dependency, the diagnostics point at that code. That's not very helpful for finding the actual problem.

#

Look around at other diagnostics

#

The created/invalidated spans are always your code

#

You should post the exact situation you're looking at

tender nimbus Jun 3, 2022, 8:36 PM

#

are they?

haughty mica Jun 3, 2022, 8:36 PM

#

Aaaaaaa

#

That's the backtrace

#

This is a lot of me fighting with Ralf

#

Ralf wants to be technically exactly correct and I want to be helpful

tender nimbus Jun 3, 2022, 8:37 PM

#

ferrisBut

#

i will be able to post a rustc pr with the example to the miri issue

#

soon

haughty mica Jun 3, 2022, 8:37 PM

#

What's actually going on is you have a backtrace from the actual case where the program becomes UB. The top frame is formatted a bit differently, and between the topmost frame and the next one there are a bunch of helps inserted

#

The help spans are always in a local crate

#

When a tag is created or invalidated, we walk up the stack from the actual place the interpreter is at until we hit a frame which is part of a crate in the local workspace, according to Cargo

tender nimbus Jun 3, 2022, 8:39 PM

#

according to Cargo
aaaaaaaaa, that explains the issue

#

I'm in the same workspace as core

haughty mica Jun 3, 2022, 8:39 PM

#

Mmmmmm

tender nimbus Jun 3, 2022, 8:39 PM

#

i guess I'll close that issue then

haughty mica Jun 3, 2022, 8:39 PM

#

What would you prefer instead?

#

No no no

tender nimbus Jun 3, 2022, 8:39 PM

#

i think leaving it as is is ok tbh

haughty mica Jun 3, 2022, 8:39 PM

#

Keep the issue open just explain more about what your situation is

tender nimbus Jun 3, 2022, 8:40 PM

#

I'm in rustc_arena

haughty mica Jun 3, 2022, 8:40 PM

#

The diagnostics could be better for you, so explain why

tender nimbus Jun 3, 2022, 8:40 PM

#

and there was ub

haughty mica Jun 3, 2022, 8:40 PM

#

You should explain that in the issue instead of attempting to generalize it 🙂

tender nimbus Jun 3, 2022, 8:40 PM

#

(with storing box, as I easily found out because the function where the tag is invalidated is only used once)

#

i guess the issue is "rustc is in the same workspace as core so i get bad diagnostics"?

haughty mica Jun 3, 2022, 8:41 PM

#

No

tender nimbus Jun 3, 2022, 8:41 PM

#

but actually the backtrace point still stands

haughty mica Jun 3, 2022, 8:41 PM

#

The issue is that I made a bug over here but the diagnostics are pointing over there and I want them to point here instead

tender nimbus Jun 3, 2022, 8:41 PM

#

because this could happen in other workspaces as well

haughty mica Jun 3, 2022, 8:42 PM

#

I think the question is whether we want to expose any control over the crate selection

#

cargo-miri tells miri about what crates are local, and if you're running miri directly I think you can give it any list of crates you feel like

#

You can search for local_crates in the code to find the logic

haughty mica Jun 3, 2022, 8:54 PM

#

tame jewel Hm, miri doesn't really pinpoint the location too well. For `image-canvas`: Unde...

Finally fixed this: https://miri.saethlin.dev/ub?crate=image-canvas&version=0.3.1

tender nimbus Jun 3, 2022, 8:57 PM

#

ferrisOwO

pastel lily Jun 3, 2022, 11:22 PM

#

haughty mica I really need a good rerun mechanism. image-canvas is surprisingly far down the ...

Being able to point it at a file that has an array of crate names (and maybe versions?) would be useful maybe?

haughty mica Jun 3, 2022, 11:26 PM

#

Yes

#

Definitely

tender nimbus Jun 4, 2022, 6:50 AM

#

pastel lily Being able to point it at a file that has an array of crate names (and maybe ver...

Sounds useful, post it on the issue ferrisBorrowCheck

haughty mica Jun 4, 2022, 11:13 AM

#

I think Callie is thinking of a feature for my tool not Miri

#

Most of the whole thing is in this one file https://github.com/saethlin/miri-tools/blob/main/src/main.rs

tender nimbus Jun 4, 2022, 11:15 AM

#

oh

#

it was early in the morning ferrisBut

haughty mica Jun 5, 2022, 4:50 PM

#

The site now lists more than half of all published crates (crates, not versions)

tender nimbus Jun 5, 2022, 5:15 PM

#

amazing!

#

so much new ub ferrisOwO

pastel lily Jun 5, 2022, 5:34 PM

#

octowo

tender nimbus Jun 6, 2022, 10:01 AM

#

https://github.com/Nercury/android_logger-rs/commit/f1500f8c456fd97f82ee9b56b9cfae1a0d9e016d
why did they create an issue instead of just - you know - fixing it

I have investigated this, and it would be useful for maybe_uninit_uninit_array to be stabilized before trying to implement this.
maybe_uninit_uninit_array it would save a single unsafe block, wow, definitely worth it

#

also,

mem::uninitialized() was used to put outgoing messages on stack and avoid hitting allocator for each and every message.
they store this on the stack
zero-initing it would not hit the allocator

tough leaf Jun 6, 2022, 10:06 AM

#

lmao

tender nimbus Jun 6, 2022, 10:08 AM

#

out of all the clues that exist in the wide world, android_logger might not contain all of them

#

tough leaf Jun 6, 2022, 10:13 AM

#

tender nimbus out of all the clues that exist in the wide world, `android_logger` might not co...

lmaooooo

#

are you called blue because you're gonna give them a clue?

tender nimbus Jun 6, 2022, 10:13 AM

#

ferrisClueless

#

MaybeUninit::write my beloved

proper belfry Jun 6, 2022, 10:39 AM

#

tender nimbus <https://github.com/Nercury/android_logger-rs/commit/f1500f8c456fd97f82ee9b56b9c...

not even ferrisBut ```rs
struct MaybeUninitConsts<T>(T);
impl<T> MaybeUninitConsts<T> {
const UNINIT: MaybeUninit<T> = MaybeUninit::uninit();
}
let array = [<MaybeUninitConsts<T>>::UNINIT; N];

tender nimbus Jun 6, 2022, 10:46 AM

#

doesn't that have bad codegen though

#

or does it not

proper belfry Jun 6, 2022, 10:49 AM

#

tender nimbus doesn't that have bad codegen though

It works in a const context so I imagine it would be const-folded

#Fixing UB in random crates