Fixing UB in random crates | Rust Programming Language Community | Page 3

small apexBOT Jun 6, 2022, 10:49 AM

#

tender nimbus Jun 6, 2022, 10:50 AM

#

maybe don't make it generic

#

hey ferris can you godbolt ```rust
use std::mem::MaybeUninit;
struct MaybeUninitConsts<T>(T);
impl<T> MaybeUninitConsts<T> {
const UNINIT: MaybeUninit<T> = MaybeUninit::uninit();
}
pub fn foo() -> [MaybeUninit<u8>; 128] {
[<MaybeUninitConsts<u8>>::UNINIT; 128]
}

small apexBOT Jun 6, 2022, 10:51 AM

#

example::foo:
        mov     rax, rdi
        ret

tender nimbus Jun 6, 2022, 10:51 AM

#

hm, nice

#

hey ferris can you godbolt ```rust
use std::mem::MaybeUninit;
pub fn foo() -> [MaybeUninit<u8>; 128] {
[MaybeUninit::uninit(); 128]
}

small apexBOT Jun 6, 2022, 10:52 AM

#

example::foo:
        mov     rax, rdi
        ret

tender nimbus Jun 6, 2022, 10:52 AM

#

hm, nice

#

not sure whether i trust it 100% though ferrisballSweat

#

#[unstable(feature = "maybe_uninit_uninit_array", issue = "none")]
the fuck core, ???

#

ah, a tracking issue was added recently

#

oh god i hate how basically every call to copy or copy_nonoverlapping to the same allocation has SB issues because of course it fucking does, people are reasonable to write code that has these issues

#

self.buffer.get_unchecked_mut(len)
can we get a clippy lint please ferrisPlead

#

wait, what is it doing

#

wtf

#

this is straight up reading uninit memory

#

what

#

ah, it needs to read it as MU, then it's fine

#

rerunning this beauty #932319394724479037 message

#

📎 causes.txt

tender nimbus Jun 7, 2022, 1:10 PM

#

writing extremely cursed (fish) shell scripts again to sort the ub crates by downloads

curl https://miri.saethlin.dev/ub > miri.html
curl -L https://static.crates.io/db-dump.tar.gz -o dump.tar.gz
tar -xzf dump.tar.gz
cat miri.html | sd '.*crate">(.*?) (.*?)</div><div class="status">UB: (.*?)</div.*' '"$1","$3"' > ub.csv

for c in (cat ./ub.csv) ;
    set NAME (echo $c | xsv select 1)
    set COUNT (xsv search "^$NAME\$" s_crates.csv | xsv select -n 2 | sed -n '2p')
    echo "$COUNT $NAME $UB"
end > downloads.txt

cat downloads.txt | sort -nr > result.txt

#

📎 result.txt

haughty mica Jun 7, 2022, 1:15 PM

#

They're already sorted by recent downloads

tender nimbus Jun 7, 2022, 1:16 PM

#

oh

#

ferrisballSweat

tough leaf Jun 7, 2022, 1:16 PM

#

lol

tender nimbus Jun 7, 2022, 1:16 PM

#

well, but it doesn't show the download count so my script is still not useless ferrisClueless

tough leaf Jun 7, 2022, 1:16 PM

#

would you like a clue

#

or two

tender nimbus Jun 7, 2022, 1:17 PM

#

yes

#

i'd take three

tough leaf Jun 7, 2022, 1:17 PM

#

🔵 💙 📘

tender nimbus Jun 7, 2022, 1:18 PM

#

ferrisOwO

west phoenix Jun 9, 2022, 11:03 AM

#

simdjson needs unsoundness fixed

tender nimbus Jun 9, 2022, 11:18 AM

#

simdjson is a bit annoying because it can't really be mirid

haughty mica Jun 9, 2022, 12:22 PM

#

How do you know it has a soundness issue @west phoenix ?

tough leaf Jun 9, 2022, 12:26 PM

#

haughty mica How do you know it has a soundness issue <@341486397917626381> ?

i found it handing out non-utf8 Strings
which isn't that bad but it's still unsoundness

#

still gotta report that

haughty mica Jun 9, 2022, 12:27 PM

#

Yeah that's not great

west phoenix Jun 9, 2022, 2:08 PM

#

haughty mica How do you know it has a soundness issue <@341486397917626381> ?

Well, first I saw it allowed clippy::uninit_vec

#

and that means there is definitely ub somewhere else

haughty mica Jun 9, 2022, 2:13 PM

#

Interesting

#

Oh fucking hell this is bad

west violet Jun 9, 2022, 2:25 PM

#

There’s a lotta sketchy stuff in that crate tbh

#

The nice thing is that it’s got extensive tests & fuzzing so changes are easy

haughty mica Jun 9, 2022, 3:04 PM

#

west phoenix and that means there is definitely ub somewhere else

There's UB right there, if rustc starts adding noundef

west phoenix Jun 9, 2022, 3:05 PM

#

the uninit vec stuff was fixed iirc

tender nimbus Jun 9, 2022, 3:06 PM

#

Yes, they fixed it shortly after I opened an issue, so they are definitely open for soundness fixes

tough leaf Jun 9, 2022, 3:10 PM

#

haughty mica There's UB right there, if rustc starts adding `noundef`

yeah, we currently do emit noundef for the same types that mem::uninitialized panics for
anything with an invalid value
i don't think noundef does much right now but it would be interesting to see if any benchmarks improve if we emit it everywhere we can

haughty mica Jun 9, 2022, 3:13 PM

#

But where can we

#

Oh fucking hell

#

    pub(crate) fn parse_str_<'invoke>(
        input: &'de [u8],
        data: &'invoke [u8],
        buffer: &'invoke mut [u8],
        mut idx: usize,
    ) -> Result<&'de str> {
        use ErrorType::{InvalidEscape, InvalidUnicodeCodepoint};
        let input: &mut [u8] = unsafe { std::mem::transmute(input) };

#

https://github.com/simd-lite/simd-json/blob/ae48dc4ee34c8cc698c73ac599bcfdd648a53cd4/src/avx2/deser.rs#L31-L38

tough leaf Jun 9, 2022, 3:15 PM

#

lmaooooo

#

    #[allow(
        clippy::if_not_else,
        mutable_transmutes,
        clippy::transmute_ptr_to_ptr,
        clippy::too_many_lines,
        clippy::cast_ptr_alignment,
        clippy::cast_possible_wrap,
        clippy::if_not_else,
        clippy::too_many_lines
    )]

#

mutable_transmutes is just another annoying lint just like the rest right

west violet Jun 9, 2022, 3:16 PM

#

Oh god

#

clippy::transmute_ptr_to_ptr literally just cast

haughty mica Jun 9, 2022, 3:17 PM

#

mutable_transmutes is literally just a UB detecting lint

west phoenix Jun 9, 2022, 3:17 PM

#

btw this crate is used as an optional dep of serenity (the discord bot library), so like, is probably being used in production rn

haughty mica Jun 9, 2022, 3:17 PM

#

serenity != production

west phoenix Jun 9, 2022, 3:17 PM

#

probably
and by probably, the only reason I'm not running it on my 36k server bot is because it had some deserialization issues (before I found out about this)

haughty mica Jun 9, 2022, 3:18 PM

#

Also in general I'm not worried about these sorts of issues "in production", I'm worried about them going forward, or how they reflect on Rust overall

#

What do you mean by "issues"?

west phoenix Jun 9, 2022, 3:19 PM

#

https://github.com/serenity-rs/serenity/issues/1869

GitHub

Snowflakes and anything containing them don't appear to be deserial...

Serenity version: 0.11.1 Rust version: 1.60 When I enabled the simdjson feature, my bot wouldn't properly add guild data from the GuildCreate event into the cache and practically any event ...

haughty mica Jun 9, 2022, 3:20 PM

#

Oh, snowflakes

#

I don't remember exactly but those do tickle some uncommon scenario for deserializers

west violet Jun 9, 2022, 3:29 PM

#

haughty mica ```rust pub(crate) fn parse_str_<'invoke>( input: &'de [u8], ...

Oh god it actually copies into input https://github.com/simd-lite/simd-json/blob/ae48dc4ee34c8cc698c73ac599bcfdd648a53cd4/src/avx2/deser.rs#L144-L146

haughty mica Jun 9, 2022, 3:29 PM

#

And it uses clone_from_slice for &[u8] which is not awful but just why

tough leaf Jun 9, 2022, 3:30 PM

#

it makes invalid UTF-8 this way too but that's a known issue

#

they even mention it

west violet Jun 9, 2022, 3:34 PM

#

So they're aware but don't care?

tough leaf Jun 9, 2022, 3:37 PM

#

yes

#

well

#

they might not know it's UB

#

https://docs.rs/simd-json/latest/simd_json/serde/fn.from_str.html

parses a str using a serde deserializer. note that the slice will be rewritten in the process and might not remain a valid utf8 string in its entirety.

from_str in simd_json::serde - Rust

parses a str using a serde deserializer. note that the slice will be rewritten in the process and might not remain a valid utf8 string in its entirety.

haughty mica Jun 9, 2022, 3:39 PM

#

It's not as serious as all the other issues

#

You can tiptoe around library UB

tough leaf Jun 9, 2022, 3:40 PM

#

agreed

ruby jacinth Jun 9, 2022, 4:10 PM

#

If input is a static string literal wont writing into it crash instantly

haughty mica Jun 9, 2022, 4:13 PM

#

I suspect that scenario is avoided somehow

tender nimbus Jun 9, 2022, 6:28 PM

#

haughty mica ```rust pub(crate) fn parse_str_<'invoke>( input: &'de [u8], ...

ferrisballSweat

#

corro

#

this is what happens when you port a c++ library ferrisWhen

grim copper Jun 10, 2022, 4:21 AM

#

It’s sad that it’s so cursed, lemire’s libraries deserve better than a crappy port

tender nimbus Jun 10, 2022, 4:59 AM

#

https://github.com/ParkMyCar/compact_str/pull/100#pullrequestreview-1002178051

ferrisballSweat I spent most of the time fixing the arc module

GitHub

Fix provenance issues by Nilstrieb · Pull Request #100 · ParkMyCar/...

When creating a BoxString using from_string or from_box_str, str::as_ptr was used to get the pointer, which only has read provenance for the initialized part of the string. Going through Vec in the...

knotty oar Jun 10, 2022, 6:56 AM

#

west violet So they're aware but don't care?

i know the author personally, and i wouldn't be surprised if this is true

haughty mica Jun 10, 2022, 4:14 PM

#

once_cell now does pointer stuffing with as-casts 😩

west violet Jun 10, 2022, 4:16 PM

#

Is that good or bad

haughty mica Jun 10, 2022, 4:24 PM

#

Bad, because it's such a core library that this update will cause a lot of things to die under -Zmiri-tag-raw-pointers

west violet Jun 10, 2022, 4:25 PM

#

Gotcha

#

(I didn't know if this was a "yay, it no longer uses transmute!" or something)

haughty mica Jun 10, 2022, 4:28 PM

#

Ah. I've yet to run into a crate that does pointer transmutes which I can't replace with wrapping operations. I'm sure they exist, but haven't hit one yet.

#

Miri thinks one of the once_cell tests deadlocks

#

That is probably not great

haughty mica Jun 10, 2022, 5:02 PM

#

omg they use xtask to run their tests what is this

#

It's very cool but also not

ruby jacinth Jun 10, 2022, 5:12 PM

#

you should still be able to do cargo test

#

at least they use miri in it

haughty mica Jun 10, 2022, 5:33 PM

#

Oh Truuuuuu

ruby jacinth Jun 10, 2022, 5:56 PM

#

better than makefiles at least

haughty mica Jun 10, 2022, 5:57 PM

#

Possibly

#

I really dislike how much people hate on makefiles

ruby jacinth Jun 10, 2022, 5:57 PM

#

eh they're pretty bad

#

extremely messy when they get big, and it's really easy to write them in a platform dependent way

haughty mica Jun 10, 2022, 6:01 PM

#

Don't get me wrong, Cargo is so much better. I've just seen too many "Makefiles but with my personal nits fixed" things

tender nimbus Jun 10, 2022, 7:35 PM

#

these are the sort of -Zmiri-measureme that i love ferrisballSweat

#

that's a lot of time spent offsetting pointers ferrisballSweat

#

these zero cost abstractions become very much not zero cost under miri

haughty mica Jun 10, 2022, 7:50 PM

#

What crate is this

tender nimbus Jun 10, 2022, 7:55 PM

#

compact_str

#

there's probably a huge string in the test suite, I didn't take a closer look

#

that's wild corro

tawny coyote Jun 10, 2022, 8:05 PM

#

I dont know shit about provenance and stuff, so noob question here:

ptr-int transmute

int-to-ptr cast

Does that mean transmute::<*const T, usize> is UB, but usize as *const T is not?
What about the other other directions, transmute from int to ptr and cast from ptr to int?

proper belfry Jun 10, 2022, 8:07 PM

#

tawny coyote I dont know shit about provenance and stuff, so noob question here: > ptr-int t...

cast from ptr to int is definitely OK, it will probably be deprecated though

tender nimbus Jun 10, 2022, 8:07 PM

#

transmuting pointers to integers has been made ok recently

proper belfry Jun 10, 2022, 8:07 PM

#

int to ptr transmute I think is okay, it probably just creates a pointer with zero provenance

proper belfry Jun 10, 2022, 8:08 PM

#

tender nimbus transmuting pointers to integers has been made ok recently

oh that’s nice. so it has the semantics of .addr()?

tender nimbus Jun 10, 2022, 8:08 PM

#

casting integers to pointers is complicated

tender nimbus Jun 10, 2022, 8:08 PM

#

proper belfry oh that’s nice. so it has the semantics of `.addr()`?

yep, if you look at the implementation of .addr, it's now a transmute ferrisOwO

#

https://github.com/rust-lang/rust/pull/97710

proper belfry Jun 10, 2022, 8:09 PM

#

nice, this makes a lot of sense!

tawny coyote Jun 10, 2022, 8:10 PM

#

what makes transmute there better than as usize?

proper belfry Jun 10, 2022, 8:11 PM

#

tawny coyote what makes transmute there better than `as usize`?

It doesn’t expose the address of the pointer, theoretically allowing more aggressive compiler optimizations

tender nimbus Jun 10, 2022, 8:13 PM

#

oversimplified explanation
provenance is an extra made up part of pointers that control what permissions they have to access
if you cast an integer to a pointer, what provenance does it get?
it looks whether a provenance has been "exposed" for this address, if yes, it gets it, if no, the pointer will have no provenance and can therefore not be dereferenced
to expose a provenance, cast a pointer to an integer using as
the transmute doesn't expose this

this is all used for compiler optimizations, so the transmute could (in the future) get slightly better compiler optimizations

pastel lily Jun 10, 2022, 8:16 PM

#

transmute is claiming “this pointer is just an integer” where as (or other methods) tell the compiler you want to actually do things on pointers

tender nimbus Jun 10, 2022, 8:17 PM

#

transmute is "just give me the address btw"
as is "give me the address and make sure that I can cast this back to a pointer later"

tawny coyote Jun 10, 2022, 8:17 PM

#

ah i think get it, also just noticed there's an explanation of provenance in the docs now

haughty mica Jun 10, 2022, 8:18 PM

#

tender nimbus transmuting pointers to integers has been made ok recently

No I think it was made not okay at all

pastel lily Jun 10, 2022, 8:18 PM

#

A pointer is an address plus some abstract machine state that matters to the compiler and may or may not exist at runtime (on platforms like CHERI). So while the address part can always fit in a usize, you have to do Other Things to operate on that other state.

tender nimbus Jun 10, 2022, 8:19 PM

#

haughty mica No I think it was made not okay at all

https://github.com/rust-lang/unsafe-code-guidelines/issues/286#issuecomment-1140280025

haughty mica Jun 10, 2022, 8:20 PM

#

Oh that's interesting

tender nimbus Jun 10, 2022, 8:21 PM

#

as you can see @tawny coyote, these kinds of rules are still in progress ferrisBut

haughty mica Jun 10, 2022, 8:21 PM

#

Yeah 7 days old

#

This landed while I was on vacation 😩

tawny coyote Jun 10, 2022, 8:21 PM

#

tender nimbus as you can see <@331386192928964610>, these kinds of rules are still in progress...

yep but very interesting, I'll read through some of the stuff

tender nimbus Jun 10, 2022, 8:21 PM

#

before this comment, transmuting a pointer to an integer was considered ub ferrisClueless

tender nimbus Jun 10, 2022, 8:22 PM

#

tawny coyote yep but very interesting, I'll read through some of the stuff

if you want a better explanation of all this, read through ralfs blog posts

#

https://ralfj.de/blog

haughty mica Jun 10, 2022, 8:22 PM

#

The heart of the problem is that observing the address of a pointer has implications for what optimizations you're allowed to do

#

But to be honest all this stuff is in the weeds, because stacked borrows with untagged still doesn't support noalias, and I'm waiting for stacked borrows with wildcard to land before I ask Ralf if that supports noalias

pastel lily Jun 10, 2022, 8:23 PM

#

C lets you do whatever you want with pointers and that inhibits so many optimizations

tender nimbus Jun 10, 2022, 8:23 PM

#

pastel lily C lets you do whatever you want with pointers and that inhibits *so many optimiz...

well no, not really

tawny coyote Jun 10, 2022, 8:23 PM

#

ima just read https://plv.mpi-sws.org/rustbelt/stacked-borrows/paper.pdf for maximum coolness

haughty mica Jun 10, 2022, 8:24 PM

#

Well

tender nimbus Jun 10, 2022, 8:24 PM

#

good luck, though I doubt that you'll understand much (i don't either ferrisballSweat )

haughty mica Jun 10, 2022, 8:24 PM

#

The problem with the C rules is that it's not clear how restrict is valid

pastel lily Jun 10, 2022, 8:24 PM

#

Well every pointer is basically exposed and you can do things like serialize them, isn’t it

tender nimbus Jun 10, 2022, 8:24 PM

#

nah

pastel lily Jun 10, 2022, 8:24 PM

#

and nobody uses restrict even if that helps some

tender nimbus Jun 10, 2022, 8:24 PM

#

you have to expose them manually still

#

under PNVI-ae-udi
which is what c will probably get

pastel lily Jun 10, 2022, 8:24 PM

#

I’ve only ever seen restrict used in libc

tender nimbus Jun 10, 2022, 8:25 PM

#

funnily c provenance is still not really settled as well ferrisBut

haughty mica Jun 10, 2022, 8:25 PM

#

pastel lily I’ve only ever seen restrict used in libc

It's used here and there in libraries

pastel lily Jun 10, 2022, 8:25 PM

#

And I’m sure that nobody calls it right

tender nimbus Jun 10, 2022, 8:25 PM

#

and it's used in all good cuda code, so I've heard from our local cuda wizard

#

c still has provenance, you're not allowed to go out of bounds with pointers, still have to expose them for int2ptr casts, so c pointers are limited
but not as limited as rust pointers

haughty mica Jun 10, 2022, 8:27 PM

#

Probably

#

It's still possible for the the compiler/lang teams to just say "oh dear we can't break anyone's code!" and simply remove all noalias from the compiler and let everyone live with the regression.

tender nimbus Jun 10, 2022, 8:27 PM

#

ferrisClueless

haughty mica Jun 10, 2022, 8:41 PM

#

😂

#

The answer is that once_cell contains data race(s)

tender nimbus Jun 10, 2022, 8:42 PM

#

ferrisballSweat

#

great

haughty mica Jun 10, 2022, 8:42 PM

#

Oh jfc

#

I'm bad

#

TSan misbehaves if you don't pass -Zbuild-std

tender nimbus Jun 10, 2022, 8:45 PM

#

good

#

well, not good
but good

haughty mica Jun 10, 2022, 8:46 PM

#

Wait a second, Mr. Weak Memory Effects pasted 2 backtraces into std::sync::mpsc

#

https://github.com/matklad/once_cell/pull/182#issuecomment-1152706383

GitHub

Miri detects a deadlock? by saethlin · Pull Request #182 · matklad/...

I'm trying to do some hacking locally, and Miri is detecting a deadlock. So I'm putting up this PR just to see if this reproduces in your CI, in which case this is probably related to new M...

#

This is bizarre

tender nimbus Jun 10, 2022, 8:48 PM

#

now, i wouldn't be surprised if std::sync::mpsc was deadlocking ferrisballSweat

#

tf, running just the deadlocking test does not make it deadlock

haughty mica Jun 10, 2022, 9:01 PM

#

Anyway

tender nimbus Jun 10, 2022, 9:06 PM

#

but it's definitely this test, ignoring it makes it not deadlock

west violet Jun 10, 2022, 9:09 PM

#

So if transmuting ints is chill can bytemuck add them back?

tender nimbus Jun 10, 2022, 9:09 PM

#

i don't think the transmute in the other direction is cool

west violet Jun 10, 2022, 9:10 PM

#

Ah

#

Funky

#

Oh yah since it strips provenance

tender nimbus Jun 10, 2022, 9:18 PM

#

i wonder why the stampede_once is disabled under miri

#

ah, i guess the old miri scheduler didn't support it

#

it works now

#

it doesn't deadlock with all miri seeds, but it happens to deadlock you run the entire test suite with seed 0

#

the miri seed 🅱️ reproduces the issue with just the one test

#

the test is doing some funky things with channels, threads and oncecells

#

it does look good to my brain, though i don't think my brain is very good at this

#

hmmmmmmmmmm, this is interesting

#

if i add a SeqCst fence to the initializer it looks great ("looks great" meaning that it passes 0x11 seeds and then fails because, uhm, deallocating while item is protected: [SharedReadWrite for <222825> (call 65038)] (somewhere in mpsc corro )

#

the fun thing is that there's a nice comment on initialize

Safety: synchronizes with store to value via SeqCst read from state,
yet the inner method only uses Acquires

haughty mica Jun 10, 2022, 9:35 PM

#

Ralf thinks this might also be a futex issue

#

Comment in the zulip

#

https://rust-lang.zulipchat.com/#narrow/stream/269128-miri/topic/Weak.20memory.20emulation.20causes.20once_cell.20tests.20to.20deadlock

Zulip

Chat for distributed teams

Zulip combines the immediacy of real-time chat with an email threading model. With Zulip, you can catch up on important conversations while ignoring irrelevant ones.

#

I have not seen that protector error, but there is an issue with dangling Arc

tender nimbus Jun 10, 2022, 10:26 PM

#

https://github.com/rust-lang/miri/issues/2223

#

looks like a miri issue

#

ferrisRelieved

haughty mica Jun 11, 2022, 1:38 AM

#

I coaxed a SIGSEGV out of another Facebook codebase ayyyyyyy

haughty mica Jun 11, 2022, 2:00 AM

#

And a Solana crate lmaooo

west violet Jun 11, 2022, 3:41 AM

#

Time to crash the blockchain

haughty mica Jun 11, 2022, 4:29 AM

#

Oh boy oh boy -Zrandomize-layout + -Zbuild-std is starting to turn up crates that SIGILL

#

That's almost certainly misuse of an unsafe API in the standard library

#

I already see one crate we use at work which is just so exciting

#

In case anyone wants to take a crack at some of these in the meantime:

abomonation/0.7.3
plotters-bitmap/0.3.1
encode_unicode/0.3.6
safe-transmute/0.11.2
fallible_collections/0.4.4
plotters/0.3.1
heapless/0.7.13
wasmer/2.3.0
typed-index-collections/3.0.3
slice-deque/0.3.0
swc_ecma_transforms_compat/0.102.0
swc_ecma_transforms_optimization/0.128.0
parquet/15.0.0

I already know about the issues in abomonation, heapless, and safe-transmute. The others don't jump out at me as familiar

grim copper Jun 11, 2022, 4:56 AM

#

parquet is apache so hopefully they should be able to fix issues well enough

#

also it underpins arrow / datafusion / polars

tender nimbus Jun 11, 2022, 7:06 AM

#

speedy web segfault

ruby jacinth Jun 11, 2022, 7:09 AM

#

Web3isgoinggreat

tender nimbus Jun 11, 2022, 7:12 AM

#

swc isn't web3 crap, it's the web compiler/bundler for web2 things

knotty oar Jun 11, 2022, 7:15 AM

#

~~web(-3.0)~~

tender nimbus Jun 11, 2022, 7:16 AM

#

so, I'd actually care about swc ferrisBut

grim copper Jun 11, 2022, 7:20 AM

#

also it's a typescript compiler

tender nimbus Jun 11, 2022, 7:34 AM

#

with no typechecking ferrisPensive

grim copper Jun 11, 2022, 7:57 AM

#

tender nimbus with no typechecking <:ferrisPensive:857440844549324851>

hah didn't know that. I guess it kinda makes sense is compile speed is a priority though

#

I guess your language server can do the type checking instead..

haughty mica Jun 11, 2022, 8:04 AM

#

grim copper parquet is apache so hopefully they should be able to fix issues well enough

The parquet crate has a lot of soundness problems historically

#

I mean like users opening tickets about segfaults

grim copper Jun 11, 2022, 8:05 AM

#

😭 all the cool crates have soundness problems

#

UB is hard

haughty mica Jun 11, 2022, 8:05 AM

#

In the case of parquet it seems kind of like they didn't care at first

#

They do now and one developer is trying to reimplement the whole thing

grim copper Jun 11, 2022, 8:06 AM

#

well it's a start I suppose

#

oh no not rkyv too

#

what is SB-invalidation?

haughty mica Jun 11, 2022, 8:12 AM

#

Creating a mutable reference or doing write through a raw pointer removes all tags for the memory in question that post-date the source of the reborrow or the pointer for the access

#

This is not a good behavior in SB

grim copper Jun 11, 2022, 8:12 AM

#

what does SB stand for?

#

oh stacked borrows

#

nvm

#

I wonder if we could get a mod to pin https://miri.saethlin.dev/ in this channel - it's a super cool tool!

haughty mica Jun 11, 2022, 8:13 AM

#

It's in the opening comment I think

tender nimbus Jun 11, 2022, 8:14 AM

#

yes

grim copper Jun 11, 2022, 8:14 AM

#

can you get to the opening comment without scrolling way up?

tender nimbus Jun 11, 2022, 8:14 AM

#

I don't think so

#

ferrisballSweat

ruby jacinth Jun 11, 2022, 8:14 AM

#

Just ask mod to pin it

grim copper Jun 11, 2022, 8:15 AM

#

@tender nimbus are you able to pin here?

tender nimbus Jun 11, 2022, 8:15 AM

#

<@&631915156854538260> ferrisPlead can you pin the original post

#

this is also something for forum feedback

wintry forge Jun 11, 2022, 8:16 AM

#

tender nimbus Jun 11, 2022, 8:16 AM

#

thanks ferrisOwO

grim copper Jun 11, 2022, 8:16 AM

#

cheers m8

wintry forge Jun 11, 2022, 8:17 AM

#

https://media.discordapp.net/attachments/633711059445874691/927388814660481085/bow.gif

grim copper Jun 11, 2022, 8:21 AM

#

I wonder how many of these errors still apply if you ignore provenance

#

looks like even stuff like once_cell is failing due to provenance checks

tender nimbus Jun 11, 2022, 8:23 AM

#

yeah, most of these are provenance related

#

There are also tons of nullptr derer because bindgen tests

haughty mica Jun 11, 2022, 8:25 AM

#

Not very many

tender nimbus Jun 11, 2022, 8:26 AM

#

738

haughty mica Jun 11, 2022, 8:26 AM

#

I should upload the version of this with SB disabled

tender nimbus Jun 11, 2022, 8:26 AM

#

yeah that would be very useful as well

#

not seeing so many uninit memory (http) is soo nice ferrisHeartEyes

grim copper Jun 11, 2022, 8:28 AM

#

is SB == provenance stuff? I always kinda thought that provenance is some subset of SB?

tender nimbus Jun 11, 2022, 8:28 AM

#

sb is a model for handling provenance basically

#

the concept of provenance itself is not rust specific (also happens in every other compiled language with optimizing compilers)

#

rerunning my script on the httpless data reveals the new top ub causer

📎 causes.txt

grim copper Jun 11, 2022, 8:30 AM

#

yeah, I understand that much, I just think I wasn't understanding the relationship, but that makes sense

tender nimbus Jun 11, 2022, 8:30 AM

#

bytes already has fix they just need a new version release ferrisSob

grim copper Jun 11, 2022, 8:31 AM

#

rust-crypto oh boy

#

that's not a good sign

tender nimbus Jun 11, 2022, 8:31 AM

#

let mut tmp: u32 = mem::uninitialized();

#

imagine zero init ferrisClueless

grim copper Jun 11, 2022, 8:32 AM

#

bro wat..

#

why are they...

tender nimbus Jun 11, 2022, 8:32 AM

#

🚀p🚀e🚀r🚀f🚀o🚀r🚀m🚀a🚀n🚀c🚀e🚀

pastel lily Jun 11, 2022, 8:32 AM

#

perf (but we didn't profile)

tender nimbus Jun 11, 2022, 8:33 AM

#

profiling is for nerds

#

we want blazingly fast programs

grim copper Jun 11, 2022, 8:33 AM

#

wait... is the rust-crypto crate different from the rust crypto org?

#

it is isn't it...

tender nimbus Jun 11, 2022, 8:34 AM

#

https://github.com/DaGenix/rust-crypto/ not in the org

grim copper Jun 11, 2022, 8:34 AM

#

ffs

tender nimbus Jun 11, 2022, 8:34 AM

#

but still highly downloaded

#

the uninit memory should be a trivial fix

grim copper Jun 11, 2022, 8:35 AM

#

yeah

#

MaybeUninit

tender nimbus Jun 11, 2022, 8:35 AM

#

no, zero init

grim copper Jun 11, 2022, 8:35 AM

#

oh right it's a u32

proper belfry Jun 11, 2022, 8:35 AM

#

last updated 2016
don’t think there’s merging that

tender nimbus Jun 11, 2022, 8:35 AM

#

oh

grim copper Jun 11, 2022, 8:35 AM

#

oh boy

#

rustsec time

proper belfry Jun 11, 2022, 8:36 AM

#

https://rustsec.org/advisories/RUSTSEC-2016-0005.html ¯_(ツ)_/¯

RUSTSEC-2016-0005: rust-crypto: rust-crypto is unmaintained; switch...

Security advisory database for Rust crates published through https://crates.io

grim copper Jun 11, 2022, 8:36 AM

#

oof

proper belfry Jun 11, 2022, 8:36 AM

#

and https://rustsec.org/advisories/RUSTSEC-2022-0011.html

tender nimbus Jun 11, 2022, 8:36 AM

#

ferrisBut

#

i think you're better off PRing these people https://crates.io/crates/rust-crypto/reverse_dependencies

grim copper Jun 11, 2022, 8:38 AM

#

most downloaded rev dependency last updates 2 years ago

#

of course it's a merkle tree crate and is definitely in every rust blockchain project

#

💯

tender nimbus Jun 11, 2022, 8:38 AM

#

merkletree had its last update a month ago

grim copper Jun 11, 2022, 8:39 AM

#

oh I guess only last release 2 years ago

grim copper Jun 11, 2022, 8:42 AM

#

tender nimbus rerunning my script on the httpless data reveals the new top ub causer

what is the number on the left?

tender nimbus Jun 11, 2022, 8:44 AM

#

how many crates failed miri because they had the bad dep on the right

grim copper Jun 11, 2022, 8:44 AM

#

ah I see

#

IMO the most useful metric would be sorting unsound crates by recent downloads

tender nimbus Jun 11, 2022, 8:45 AM

#

they are already sorted by downloads on the website

#

idk whether it's total or recent though

grim copper Jun 11, 2022, 8:45 AM

#

oh, didn't know that

tender nimbus Jun 11, 2022, 8:46 AM

#

tender nimbus writing extremely cursed (fish) shell scripts again to sort the ub crates by dow...

you're not the only one ferrisBut

grim copper Jun 11, 2022, 8:48 AM

#

might be worth adding it to the original post

#

nice 👍

tender nimbus Jun 11, 2022, 8:49 AM

#

haughty mica In case anyone wants to take a crack at some of these in the meantime: ``` abomo...

checking out the swcs

knotty oar Jun 11, 2022, 8:49 AM

#

~~relax i'll handle the prs if you don't want to~~

tender nimbus Jun 11, 2022, 8:50 AM

#

knotty oar ~~relax i'll handle the prs if you don't want to~~

can you do rollups for edits of my forum post

#

nice, i got the sigill as well

knotty oar Jun 11, 2022, 8:51 AM

#

what forum post 😛

grim copper Jun 11, 2022, 8:52 AM

#

every time I see a rollup I think of these

tender nimbus Jun 11, 2022, 8:52 AM

#

knotty oar what forum post 😛

the op of this thread

knotty oar Jun 11, 2022, 8:53 AM

#

linkez-moi

grim copper Jun 11, 2022, 8:53 AM

#

tis pinned

tender nimbus Jun 11, 2022, 8:55 AM

#

fun, it sigills somwhere in slice indexing

#

uuuhm, why does it hit a ud2 instruction after a ret

#

ah because a branch nvm

knotty oar Jun 11, 2022, 8:59 AM

#

ah wait you mean saethlin's repo

tender nimbus Jun 11, 2022, 9:00 AM

#

i mean this ferrisballSweat

knotty oar Jun 11, 2022, 9:00 AM

#

ye that

tender nimbus Jun 11, 2022, 9:00 AM

#

ooooooooooo

#

swc is doing get_unchecked after having set_lened to 0

pastel lily Jun 11, 2022, 9:01 AM

#

ferrisThonk

tender nimbus Jun 11, 2022, 9:02 AM

#

the goods news
a bunch of tests pass now
the bad news
it still sigills at some point

#

it's really nice that cargo tells you the name of the test binary

#

ah, the same pattern 20 lines above

#

so lmao, it's not even the randomize-layout that's going on here but the debug asserts from build-std

#

and it passes!

tender nimbus Jun 11, 2022, 10:10 AM

#

ah yes, love it when ci does that

#

as you can see, im a very busy person on github with so many notifications

knotty oar Jun 11, 2022, 10:44 AM

#

ah github notifications the thing that nobody checks

tender nimbus Jun 11, 2022, 11:28 AM

#

And my swc fix is in the docs.rs build queue, nice

#

fewwis

haughty mica Jun 11, 2022, 12:30 PM

#

I obsessively check my notifications

#

It's the only way to stay on top of like 10 PRs

tender nimbus Jun 11, 2022, 12:34 PM

#

same

knotty oar Jun 11, 2022, 12:43 PM

#

i check it via my email

haughty mica Jun 11, 2022, 12:44 PM

#

To each their own. My email is far too messy for that

knotty oar Jun 11, 2022, 12:44 PM

#

ya fair

#

i wouldn't mind using github notifications, i thought it would solve my problems but ugh the UI just makes me not remember it exists

west violet Jun 11, 2022, 1:52 PM

#

haughty mica Oh boy oh boy `-Zrandomize-layout` + `-Zbuild-std` is starting to turn up crates...

good, good

#

Good that it faults abomination, nice to have confirmation it’s working properly

tough leaf Jun 11, 2022, 2:00 PM

#

i forgot about that crate

#

and now i wish i didn't remember it

west violet Jun 11, 2022, 2:05 PM

#

I still have no idea how to fix the padding thing

#

No one’s given me any help I know how to act on

haughty mica Jun 11, 2022, 2:06 PM

#

I don't know what you mean

#

You can query the layout of a type

west violet Jun 11, 2022, 2:06 PM

#

My extension of randomize that adds random padding before fields

#

There’s a bug with option on guaranteed niche types rn

#

But I don’t know how to express "if is option with guaranteed niche T"

#

https://rust-lang.zulipchat.com/#narrow/stream/131828-t-compiler/topic/special.20casing.20option.20niche.20layouts/near/285557188

Zulip

Chat for distributed teams

Zulip combines the immediacy of real-time chat with an email threading model. With Zulip, you can catch up on important conversations while ignoring irrelevant ones.

haughty mica Jun 11, 2022, 2:08 PM

#

Yeah I'm in there

#

Surely you can detect option-like enums right?

west violet Jun 11, 2022, 2:09 PM

#

Well that’s the thing, it’s just option specifically

haughty mica Jun 11, 2022, 2:09 PM

#

I think you're making this too hard for yourself

west violet Jun 11, 2022, 2:09 PM

#

And I’m not at all familiar with the compiler so I don’t know anything

#

The biggest issue is that I just don’t know what’s going on

haughty mica Jun 11, 2022, 2:10 PM

#

You could just skip everything that looks like Option.
I for one just stomp around the docs until I find something that looks useful

#

For example, if I search for layout there are a few helpful looking functions, depending on what you have https://doc.rust-lang.org/stable/nightly-rustc/rustc_middle/?search=layout

rustc_middle - Rust

The “main crate” of the Rust compiler. This crate contains common type definitions that are used by the other crates in the rustc “family”. Some prominent examples (note that each of these modules has their own README with further details).

west violet Jun 11, 2022, 2:13 PM

#

I’m not sure how much of those I can use since I’m within layout compilation

#

Maybe I just need to check https://doc.rust-lang.org/stable/nightly-rustc/rustc_middle/ty/struct.ReprOptions.html#method.inhibit_enum_layout_opt?

haughty mica Jun 11, 2022, 2:14 PM

#

Perhaps

west violet Jun 11, 2022, 2:14 PM

#

Hum, I don’t think so

#

Option is just repr rust

#

I’m also not entirely sure what my function "knows" about

#

I don’t know if it knows it’s within an option or anything since it’s just calculating an aggregate type layout

#

Oh? https://doc.rust-lang.org/stable/nightly-rustc/rustc_hir/lang_items/struct.LanguageItems.html#method.option_some_variant

haughty mica Jun 11, 2022, 2:21 PM

#

You also have the layout of all the fields already

#

The SIGILL list is growing

tender nimbus Jun 11, 2022, 2:41 PM

#

some of those actually just build-std debug assertions instead of randomize layout (swc was just a debug assertion)

tender nimbus Jun 11, 2022, 2:42 PM

#

west violet But I don’t know how to express "if is option with guaranteed niche T"

I would just express "if it's option, don't"
Even if there is no niche

west violet Jun 11, 2022, 2:42 PM

#

Eh, that's sub-optimal

haughty mica Jun 11, 2022, 3:00 PM

#

I suspect most of them are just from turning on the stdlib debug assertions

#

Though it's possible that adding randomize-layout in there produced some problems which are only detected by the debug assertions

west violet Jun 11, 2022, 3:01 PM

#

Also potentially true, you could check with build-std without randomize

haughty mica Jun 11, 2022, 3:01 PM

#

It's worth noting that the only other way I have to detect layout problems is looking for a SIGSEGV from a randomize-layout run, which basically only happens when you confuse pointers

#

Looking for test failures is too hard

#

I plan on diagnosing all of these crashes individually, so while it might be interesting to do build-std without randomize I think I'll power through

west violet Jun 11, 2022, 3:30 PM

#

This is the function I'm working in btw if you have any ideas https://doc.rust-lang.org/nightly/nightly-rustc/rustc_middle/ty/layout/struct.LayoutCx.html#method.univariant_uninterned

#

I'm just confused tbh

#

This is a cry for help

haughty mica Jun 11, 2022, 3:31 PM

#

Ask more detailed follow-up questions on the zulip

#

People there are trying to be helpful but they probably are assuming you know more than you do

#

You're also not in compiler/help

west violet Jun 11, 2022, 3:32 PM

#

Ah whoops

haughty mica Jun 11, 2022, 3:32 PM

#

They probably don't mind much

west violet Jun 11, 2022, 3:32 PM

#

Can you move threads?

#

Or it doesn't matter

haughty mica Jun 11, 2022, 3:33 PM

#

You could ask someone to move it for you

#

But also if someone hasn't suggested it already they probably don't care that much

#

It sure is cool that the authors of cryptography crates test their code with sanitizers and/or Miri

west violet Jun 11, 2022, 3:44 PM

#

~~Sarcasm, I assume?~~

haughty mica Jun 11, 2022, 3:45 PM

#

It's a brown M&M angle

west violet Jun 11, 2022, 3:45 PM

#

I'm not familiar with that reference

haughty mica Jun 11, 2022, 3:47 PM

#

https://www.insider.com/van-halen-brown-m-ms-contract-2016-9

Insider

There's a brilliant reason why Van Halen asked for a bowl of M&Ms w...

It wasn't because they were a bunch of a**holes.

#

"If I came backstage, having been one of the architects of this lighting and staging design, and I saw brown M&Ms on the catering table, then I guarantee the promoter had not read the contract rider, and we would have to do a serious line check" of the entire stage setup, Roth said.

west violet Jun 11, 2022, 3:48 PM

#

Ahhh, smart approach

#

Yah, I guess that's fairly indicative of them caring about safety and whatnot

haughty mica Jun 11, 2022, 3:49 PM

#

Yes this is a harmless little UB in your test, but the fact that it's in here tells me that nobody is using ASan on this library

tender nimbus Jun 11, 2022, 4:09 PM

#

why would one even use ASan or Miri
my code is perfect, i don't need such commonfolk tools

haughty mica Jun 11, 2022, 4:14 PM

#

How do I put up an advisory that just says "holy shit do not use this crate please"

west violet Jun 11, 2022, 4:15 PM

#

~~Abomonation be like~~

haughty mica Jun 11, 2022, 4:15 PM

#

Whiplash from

Use after free
Protector error due to inserting into a hashmap while holding a reference across the insert

west violet Jun 11, 2022, 4:16 PM

#

Do you have any sanitizer ci bases for me btw?

#

Github ci

haughty mica Jun 11, 2022, 4:16 PM

#

What's a base

#

I'm lost

west violet Jun 11, 2022, 4:16 PM

#

Starters I guess?

#

I donno the term

#

I want to run asan on github ci, do you have any examples of that

haughty mica Jun 11, 2022, 4:16 PM

#

Normal cargo test but with

env:
    RUSTFLAGS: -Zsanitizer=address

west violet Jun 11, 2022, 4:17 PM

#

Wait really?

haughty mica Jun 11, 2022, 4:17 PM

#

Yes that's all you have to do

west violet Jun 11, 2022, 4:17 PM

#

You don't need to run it under anything special?

haughty mica Jun 11, 2022, 4:17 PM

#

Nope, just a nightly toolchain and a flag

west violet Jun 11, 2022, 4:17 PM

#

Just on linux I assume?

#

Neat

#

And can you do multiple sanitizers at the same time?

haughty mica Jun 11, 2022, 4:17 PM

#

ASan supports MacOS and on Windows it's supported by clang

#

You can mix ASan and UBsan, which is not supported on Rust because there's no reason to

#

But other than that no you cannot mix them, you need to do them one at a time because their shadow memory runtimes collide

west violet Jun 11, 2022, 4:18 PM

#

Gotcha

tender nimbus Jun 11, 2022, 4:48 PM

#

west violet Do you have any sanitizer ci bases for me btw?

arc_swap has great CI ferrisOwO

tender nimbus Jun 11, 2022, 4:49 PM

#

haughty mica How do I put up an advisory that just says "holy shit do not use this crate plea...

make a 9.8 critical cve ferrisClueless , just like the one from failure

haughty mica Jun 11, 2022, 4:50 PM

#

Holy shit it got a 9.8 for being EoL

#

Oh it's type confusion. That's still probably not a 9.8 but it's not just for being unmaintained

tender nimbus Jun 11, 2022, 4:52 PM

#

it got a 9.8 for "if the programmer willingly exploits a weakness in a library they are using, they can cause ub in safe code!"

#

which is very yikes

haughty mica Jun 11, 2022, 4:55 PM

#

To be fair, rustsec is part of the problem here

proper belfry Jun 11, 2022, 4:55 PM

#

tender nimbus it got a 9.8 for "if the programmer willingly exploits a weakness in a library t...

can we give every C library ever a 9.8 CVE ferrisBut

haughty mica Jun 11, 2022, 4:56 PM

#

The fact that rustsec keeps saying "attack vector: network" is probably a big part of the problem here

tender nimbus Jun 11, 2022, 4:57 PM

#

attack vector: funny developer

#

i will suggest an improvement to the failure advistory

#

i mean

haughty mica Jun 11, 2022, 4:58 PM

#

#

I just don't know how you defend this

tender nimbus Jun 11, 2022, 4:58 PM

#

i have no idea what to even fill out here

knotty oar Jun 11, 2022, 4:58 PM

#

haughty mica How do I put up an advisory that just says "holy shit do not use this crate plea...

report it to wg-sec they should handle it 😛

tender nimbus Jun 11, 2022, 4:58 PM

#

like, there are no attack vectors

haughty mica Jun 11, 2022, 4:58 PM

#

Attack vector: Network + Confidentiality: High means you can use this vuln to dump the whole contents of a web server

knotty oar Jun 11, 2022, 4:58 PM

#

tender nimbus i have no idea what to even fill out here

~~eenie meenie minie moe~~

tender nimbus Jun 11, 2022, 4:58 PM

#

and if a developer willingly writes bad code
then everything would possibly be the vector

haughty mica Jun 11, 2022, 4:58 PM

#

knotty oar report it to wg-sec they should handle it 😛

every single CVSS score they put out is like this

tender nimbus Jun 11, 2022, 4:59 PM

#

filling them out realistically gave me this

#

ferrisBut

haughty mica Jun 11, 2022, 5:00 PM

#

Like, look at this: https://rustsec.org/advisories/RUSTSEC-2019-0035.html

RUSTSEC-2019-0035: rand_core: Unaligned memory access › RustSec Adv...

Security advisory database for Rust crates published through https://crates.io

knotty oar Jun 11, 2022, 5:00 PM

#

tender nimbus filling them out realistically gave me this

~~congrats nils, you are now a member of wg-sec~~

haughty mica Jun 11, 2022, 5:00 PM

#

Unaligned access means you can... totally own a server over the network apparently?

tender nimbus Jun 11, 2022, 5:00 PM

#

knotty oar ~~congrats nils, you are now a member of wg-sec~~

inb4 updating the log4shell advisory to do

tender nimbus Jun 11, 2022, 5:01 PM

#

haughty mica Unaligned access means you can... totally own a server over the network apparent...

yeah see, if the memory is not aligned as you want, it could rebel! and if memory rises up against you, everything is bad

haughty mica Jun 11, 2022, 5:01 PM

#

FYI there are 3 log4shell CVEs, people lost their minds over all 3, and 2 of them are rated lower than most of the RustSec CVSS scores

knotty oar Jun 11, 2022, 5:01 PM

#

aye

tender nimbus Jun 11, 2022, 5:02 PM

#

see, the log4shell attack are only hypothetical
who in their right might would put up a java webserver????
yet when I grep through my code for __private_get_type_id__ , I get hundreds of results
this is critical to web integrity

#

i'm giving it low here because idk, when a developer fucks this up maybe they could get hacked

#

why is there no attack vector "access to source code"
Oh right, because then the CVE would make no sense!

haughty mica Jun 11, 2022, 5:13 PM

#

as_mut_ptr considered harmful: Now without Stacked Borrows

tender nimbus Jun 11, 2022, 5:13 PM

#

https://rust-lang.zulipchat.com/#narrow/stream/146229-wg-secure-code/topic/Too.20high.20CVSS.20scores.20for.20Rustsec/near/285788895

tender nimbus Jun 11, 2022, 5:13 PM

#

haughty mica `as_mut_ptr` considered harmful: Now without Stacked Borrows

oh no what happened

haughty mica Jun 11, 2022, 5:14 PM

#

Two crates doing a let buffer = Thing::new().as_mut_ptr();

tender nimbus Jun 11, 2022, 5:16 PM

#

oh no

#

corro

#

I wish crates stopped doing things

haughty mica Jun 11, 2022, 5:22 PM

#

Ah found another crate that does the same

tender nimbus Jun 11, 2022, 5:22 PM

#

is this a clippy lint

#

not like I expect people that don't run Miri to run clippy
But one can hope

haughty mica Jun 11, 2022, 5:27 PM

#

We should really have a lint against this

tame jewel Jun 11, 2022, 6:31 PM

#

haughty mica `Attack vector: Network + Confidentiality: High` means you can use this vuln to ...

No idea how 'tokio has a race condition' gets 8.1 while unaligned memory access gets >9.x
With no evidence that this even leads to practical miscompilation given anywhere. (there's double frees with less score)
Incorrect hash in sha2 gets 9.8 by means of 'Availability: High', 'Confidentiatlity: High'.
I have yet to discover how to reveal information or crash anything with this when it swapped two blocks of data during hashing.
CVE for libraries is entirely political, change my mind

haughty mica Jun 11, 2022, 6:32 PM

#

I agree, libraries should probably not get CVEs

#

I think I could make an argument that all my unsound advisories can be prompted to a 9.8 CSS CVE by their logic but I don't have the patience for that

west violet Jun 11, 2022, 6:33 PM

#

Do you know why miri wouldn't recognize -Z miri-strict-provenance?


      - name: Run miri
        uses: actions-rs/cargo@v1
        env:
          OS: ${{ matrix.os }}
          PROPTEST_CASES: "10"
          MIRIFLAGS: "-Z miri-strict-provenance -Z miri-check-number-validity"
        with:
          command: miri
          args: test --all-features

error: unknown debugging option: `miri-strict-provenance`
Error: unknown debugging option: `miri-strict-provenance`
error: test failed, to rerun pass '--lib'
Error: The process '/home/runner/.cargo/bin/cargo' failed with exit code 1

tender nimbus Jun 11, 2022, 6:34 PM

#

i think you need to write it as -Zmiri-thing

#

miriflags parsing ferrisAware

#

i think it just does a split space

west violet Jun 11, 2022, 6:34 PM

#

Lovely

haughty mica Jun 11, 2022, 6:34 PM

#

It's awful

west violet Jun 11, 2022, 6:35 PM

#

What's the isolation one, Zmiri-disable-isolation?

haughty mica Jun 11, 2022, 6:35 PM

#

Yes

tender nimbus Jun 11, 2022, 6:35 PM

#

yes

haughty mica Jun 11, 2022, 6:35 PM

#

They're all in the readme

tender nimbus Jun 11, 2022, 6:35 PM

#

with the - of course

#

since it's a flag!

tame jewel Jun 11, 2022, 6:37 PM

#

Lol, there's literal XSS vuln in the database with lower scores on confidentiality than the above race conditions. smh.

tender nimbus Jun 11, 2022, 6:38 PM

#

you're making fun of the race condition
but it gets even worse

#

a soundness issue (realistically non-issue) got a 9.8

#

https://github.com/github/advisory-database/pull/392

tame jewel Jun 11, 2022, 6:39 PM

#

So any as_ptr() as *mut T will get the same, then?

tender nimbus Jun 11, 2022, 6:39 PM

#

tame jewel So any `as_ptr() as *mut T` will get the same, then?

i mean, that's an actual stacked borrows issue (but doesn't deserve a cve of course)

#

what I'm talking about is
"a macro assumed that you wouldn't create a function with a very special name, and if you did create that function (you have to do this on purpose) then you can cause ub in safe rust"

tame jewel Jun 11, 2022, 6:42 PM

#

Nvm that it's an actual sb issue. We already found it can't escape the local analysis in llvm<14, so practically it can't cause miscompilation.

tender nimbus Jun 11, 2022, 6:42 PM

#

sure, it can't miscompile right now, but it's still not good

#

and should be fixed

haughty mica Jun 11, 2022, 6:43 PM

#

Casting pointers around is always valid. Are you referring to doing a write through that pointer?

tame jewel Jun 11, 2022, 6:51 PM

#

referring to returning this pointer via a library interface as &mut _

#

And in |x: &mut [u8]| &mut *(x[..4].as_ptr() as *mut [u8; 4])
which is UB according to SB but the llvm compilation will never know that the pointer can't be writtent through

haughty mica Jun 11, 2022, 6:56 PM

#

Currently, yes

#

It's a bug, but not a security issue, yet

tame jewel Jun 11, 2022, 6:58 PM

#

That's what I'm trying to say, I have no idea how to handle those advisories for that reason.

tender nimbus Jun 11, 2022, 6:58 PM

#

only things that can actually break should be an advisory

tame jewel Jun 11, 2022, 6:58 PM

#

complexity: Low, impact: None?

tender nimbus Jun 11, 2022, 6:59 PM

#

isn't complexity about attack complexity?

tame jewel Jun 11, 2022, 6:59 PM

#

the difficult thing is that future Rust compiler can make it break, and would then have to bump up all the issues

#

even compiling the same thing with gcc might do it, I don't know

#

There's no way to express that dependency in cve, because it's not built for libraries

haughty mica Jun 11, 2022, 7:01 PM

#

I'm not sure they even deserve complexity: low

#

It's more like complexity: unknown

#

The exactly miscompilation that may arise isn't known, and even if one does occur I have no idea what the odds are that it would even get deployed

#

And what's the user interaction?

tender nimbus Jun 11, 2022, 7:03 PM

#

these attack vectors really really aren't made for these kinds of things

#

like, not at all

#

how the fuck should i know whether my library ub is exploitable over the network or not

haughty mica Jun 11, 2022, 7:03 PM

#

Correct, they're designed for applications not libraries

#

https://docs.rs/bittree/latest/src/bittree/lib.rs.html#68

lib.rs - source

Source of the Rust file src/lib.rs.

#

chefskiss

#

Another one that I cannot patch

pastel lily Jun 11, 2022, 7:09 PM

#

Is this self referential too

haughty mica Jun 11, 2022, 7:09 PM

#

Yes, but the stack use after return is much worse

#

The self referential stuff could be patched over with some AliasiableBox or whatnot

#

Also the GitHub repo doesn't exit anymore so this one too will not be patched

tame jewel Jun 11, 2022, 7:11 PM

#

Yikes, is that meant to be some form of sentry?

haughty mica Jun 11, 2022, 7:11 PM

#

What do you mean by that?

tame jewel Jun 11, 2022, 7:12 PM

#

Is that a single linked list, and it tries to use out as a form of termination?

haughty mica Jun 11, 2022, 7:12 PM

#

Yes I think it's supposed to be a linked list... of some sort

#

Oh hell

#

https://docs.rs/serde_v2/latest/src/serde_v2/de.rs.html#69

de.rs - source

Source of the Rust file src/de.rs.

west violet Jun 11, 2022, 7:57 PM

#

Apparently mpsc has a double free?

📎 message.txt

haughty mica Jun 11, 2022, 7:57 PM

#

That's not a double free

west violet Jun 11, 2022, 7:57 PM

#

Oh yah, a "protected deallocation"

haughty mica Jun 11, 2022, 7:58 PM

#

Protectors are a direct expression of the dereferenceable LLVM attribute that goes on references

west violet Jun 11, 2022, 7:58 PM

#

So why would this be happening?

haughty mica Jun 11, 2022, 7:58 PM

#

Does this go away if you pass -Zmiri-disable-weak-memory-emulation

west violet Jun 11, 2022, 8:00 PM

#

Related, I thought miri supported windows threads?

haughty mica Jun 11, 2022, 8:00 PM

#

It's possible that it doesn't. All I know is the Windows support is very flaky

west violet Jun 11, 2022, 8:00 PM

#

Darn

haughty mica Jun 11, 2022, 8:00 PM

#

The problem with Windows is that there are precious few contributors for it

tender nimbus Jun 11, 2022, 8:04 PM

#

not yet

#

but @golden summit is implementing them right now

golden summit Jun 11, 2022, 8:04 PM

#

Hi

#

I am

#

Almost done

west violet Jun 11, 2022, 8:06 PM

#

#WindowsRiseUp

#

Do you recommend -Zmiri-symbolic-alignment-check?

west violet Jun 11, 2022, 8:18 PM

#

haughty mica Does this go away if you pass `-Zmiri-disable-weak-memory-emulation`

No it does not

#

This is happening on mac btw

#

https://github.com/vmware/database-stream-processor/runs/6845417286?check_suite_focus=true

haughty mica Jun 11, 2022, 8:20 PM

#

That's very exciting

west violet Jun 11, 2022, 8:20 PM

#

We'll see if it happens on linux, I assume not

#

lmao

 warning: associated function is never used: `name_cstr`
--> /Users/runner/.rustup/toolchains/nightly-x86_64-apple-darwin/lib/rustlib/src/rust/library/std/src/sys/unix/fs.rs:763:8
|
763|     fn name_cstr(&self) -> &CStr {
| ^^^^^^^^^
|
= note: `#[warn(dead_code)]` on by default
warning: `std` (lib) generated 1 warning

haughty mica Jun 11, 2022, 8:22 PM

#

wat

west violet Jun 11, 2022, 8:23 PM

#

Oh yah, is -Zmiri-symbolic-alignment-check good though?

haughty mica Jun 11, 2022, 8:23 PM

#

Define good

west violet Jun 11, 2022, 8:23 PM

#

Should I use it

haughty mica Jun 11, 2022, 8:23 PM

#

It has false positives, so I wouldn't suggest it normally. You need to manually diagnose each thing it finds, or run Miri a bunch of times with different -Zmiri-seed values

west violet Jun 11, 2022, 8:24 PM

#

Alright cool

haughty mica Jun 11, 2022, 8:24 PM

#

It tends to not find false positives because most people do not manually align things, but when they do ferrisballSweat

west violet Jun 11, 2022, 8:24 PM

#

What flags would you recommend for CI then I guess is a better question

haughty mica Jun 11, 2022, 8:25 PM

#

-Zmiri-tag-raw-pointers -Zmiri-disable-isolation

tender nimbus Jun 11, 2022, 8:27 PM

#

how does symbolic-alignment-check work, what does it do?

west violet Jun 11, 2022, 8:27 PM

#

It checks alignment symbolically ferrisClueless

#

That is unironically how it works though

#

You can deduce a lot of things from symbolic execution

#

Basically, even though you don't know the exact pointer values, you know some attributes it has and the invariants of the operations performed on it

haughty mica Jun 11, 2022, 8:30 PM

#

No, actually

#

I wish it did symbolic execution

west violet Jun 11, 2022, 8:30 PM

#

Like, if I have <ptr align(16)> and I perform <ptr align(16)> + 16 I know that it's still aligned since forall { P: Ptr }, P % 16 == 0, (P + 16) % 16 == 0

tender nimbus Jun 11, 2022, 8:30 PM

#

oh god, mockall is super cursed

haughty mica Jun 11, 2022, 8:31 PM

#

What it says is "a pointer to a u8 can never be used for a read which required an alignment greater than 1"

west violet Jun 11, 2022, 8:31 PM

#

Oh

tender nimbus Jun 11, 2022, 8:31 PM

#

ferrisClueless

#

very clueless of miri

haughty mica Jun 11, 2022, 8:31 PM

#

Frankly it's amazing that this doesn't just hork on everything

#

If you inspect the address of a pointer to walk it up to a correct alignment, it has no idea and you get a false positive

west violet Jun 11, 2022, 8:32 PM

#

~~Although I guess technically it could actually use concolic execution here~~

haughty mica Jun 11, 2022, 8:32 PM

#

Yes it could and that would be awesome

#

Misaligned pointers are very common in the Rust ecosystem and they're a pain to debug because Miri only catches them sometimes, and Ralf has a theoretical concern about a better alignment strategy for what people usually do

#

The official solution is "run Miri a bunch with different seeds"

tender nimbus Jun 11, 2022, 8:33 PM

#

yeah alignment is hard because you can hardly detect randomly getting correct alignment

west violet Jun 11, 2022, 8:33 PM

#

@neon tiger implement concolic pointer alignment checks in miri

haughty mica Jun 11, 2022, 8:34 PM

#

Honestly I think just maximally misaligning allocations would be way better

west violet Jun 11, 2022, 8:34 PM

#

Sure, that could also work

#

But both is better

haughty mica Jun 11, 2022, 8:34 PM

#

It's a very easy implementation and anecdotally it doesn't suffer the concern Ralf has

tender nimbus Jun 11, 2022, 8:35 PM

#

haughty mica Honestly I think just maximally misaligning allocations would be way better

that does sound like the best solution

west violet Jun 11, 2022, 8:35 PM

#

Robust and accurate checks and perturbations give the best results

haughty mica Jun 11, 2022, 8:36 PM

#

I need to bug libs about doing that in the standard library

tender nimbus Jun 11, 2022, 8:36 PM

#

today i wrote another very cursed shell script ferrisClueless

📎 lmao.sh

#

whether a list of repos contains ub

west violet Jun 11, 2022, 8:36 PM

#

Oh no it do be broken on linux

📎 message.txt

#

That's with -Zmiri-tag-raw-pointers -Zmiri-disable-isolation

haughty mica Jun 11, 2022, 8:37 PM

#

That makes sense

#

You shouldn't even need raw pointer tagging for this situation

tender nimbus Jun 11, 2022, 8:38 PM

#


unsafe fn tm_array<T, U, const N: usize>(array: [T; N]) -> [U; N] {
    let array = ManuallyDrop::new(array);
    unsafe { array.as_ptr().cast::<[U; N]>().read() }
}


        let uninit = MaybeUninit::<[T; N]>::uninit();
        let uninit = unsafe { tm_array::<T, MaybeUninit<T>, N>(MaybeUninit::assume_init(uninit)) };

wtf is this doing

neon tiger Jun 11, 2022, 8:38 PM

#

west violet <@320000638329290752> implement concolic pointer alignment checks in miri

oof

#

I was looking at the weak memory stuff and thinking "wow it's so cool that we can do this now but kinda sad that it's random"

haughty mica Jun 11, 2022, 8:43 PM

#

tender nimbus ```rust unsafe fn tm_array<T, U, const N: usize>(array: [T; N]) -> [U; N] { ...

That's a transmute

#

Without the size check

pastel lily Jun 11, 2022, 8:44 PM

#

does transmute not like const generics?

haughty mica Jun 11, 2022, 8:44 PM

#

Transmute does not like generics

west violet Jun 11, 2022, 8:44 PM

#

haughty mica Without the size check

It's faster 😎

pastel lily Jun 11, 2022, 8:44 PM

#

yea

haughty mica Jun 11, 2022, 8:44 PM

#

I don't grasp the code at the bottom

tender nimbus Jun 11, 2022, 8:44 PM

#

it's trying really hard to do ```rust
let uninit: [MU<T>; N] = unsafe { MaybeUninit::uninit().assume_init() };

pastel lily Jun 11, 2022, 8:45 PM

#

OH

tender nimbus Jun 11, 2022, 8:45 PM

#

(but accidentally makes an uninit t)

haughty mica Jun 11, 2022, 8:45 PM

#

Nice

pastel lily Jun 11, 2022, 8:45 PM

#

fun

tender nimbus Jun 11, 2022, 8:46 PM

#

this is @thorny ventures code btw

#

i hope you're proud

haughty mica Jun 11, 2022, 8:48 PM

#

I think the top cause of use after free is people trying to test their "clears the memory on drop" things

pastel lily Jun 11, 2022, 8:48 PM

#

corro

#

i wonder can you like

#

drop_in_place

#

so it's not actually freed

haughty mica Jun 11, 2022, 8:49 PM

#

Not it it's a type that owns the data

#

I fixed the one in ed25519-dalek because it's non-owning

pastel lily Jun 11, 2022, 8:50 PM

#

ah

haughty mica Jun 11, 2022, 8:52 PM

#

The right way to test them is probably with a custom allocator actually

pastel lily Jun 11, 2022, 8:52 PM

#

ah yea

haughty mica Jun 11, 2022, 8:53 PM

#

let ptr: *const u8 = mem::transmute(&self.to_be());

You do not love to see it

pastel lily Jun 11, 2022, 8:53 PM

#

what the

thorny venture Jun 11, 2022, 8:53 PM

#

tender nimbus i hope you're proud

:)

pastel lily Jun 11, 2022, 8:53 PM

#

but why

tender nimbus Jun 11, 2022, 8:53 PM

#

haughty mica ```rust let ptr: *const u8 = mem::transmute(&self.to_be()); ``` You do not love ...

ferrisSob

haughty mica Jun 11, 2022, 8:53 PM

#

They wanted a pointer to the big-endian form of a number

#

And this uh type checks

pastel lily Jun 11, 2022, 8:54 PM

#

my unsafe code typechecks so it must be right

tender nimbus Jun 11, 2022, 8:54 PM

#

pro tip: if code doesn't type check, insert transmute or transmute_copy to fix it

thorny venture Jun 11, 2022, 8:56 PM

#

tender nimbus it's trying really hard to do ```rust let uninit: [MU<T>; N] = unsafe { MaybeUni...

Tell std to stabilise the MU APIs already for arrays

#

I'm tired of always rewriting them

pastel lily Jun 11, 2022, 9:08 PM

#

same

neon tiger Jun 11, 2022, 9:15 PM

#

same

haughty mica Jun 11, 2022, 10:05 PM

#

https://gitlab.com/Apanatshka/aterm/-/blob/master/src/rc/mod.rs#L221

GitLab

src/rc/mod.rs · master · Jeff Smits / aterm · GitLab

The Annotated Term (ATerm) format implemented in Rust

tender nimbus Jun 11, 2022, 10:05 PM

#

uhm

haughty mica Jun 11, 2022, 10:05 PM

#

This person tried to hack around a lifetime error and wrote a use after free instead facepalm

west violet Jun 11, 2022, 10:06 PM

#

I'm having a miri moment

tender nimbus Jun 11, 2022, 10:06 PM

#

ferrisClueless

west violet Jun 11, 2022, 10:06 PM

#

It's been running for 40 minutes

haughty mica Jun 11, 2022, 10:06 PM

#

This is the way

#

Do some cfg(miri)

west violet Jun 11, 2022, 10:06 PM

#

Doesn't that kinda undermine the checking

tender nimbus Jun 11, 2022, 10:07 PM

#

do you do loops or have large inputs

#

in the test

#

if yes, cfg(miri) them down

haughty mica Jun 11, 2022, 10:07 PM

#

Yes but also if this never finishes or OOMs the machine it's on that also undermines the checking

west violet Jun 11, 2022, 10:07 PM

#

Fair

tender nimbus Jun 11, 2022, 10:07 PM

#

otherwise, if you have a few tests that take very long but most tests don't, cfg(miri)ing them out is also fine

haughty mica Jun 11, 2022, 10:07 PM

#

Do what you can on this test case and head on to other code is how I think of it

west violet Jun 11, 2022, 10:11 PM

#

Ah yep, 2048 iterations with 32 threads

tender nimbus Jun 11, 2022, 10:12 PM

#

i understand why miri doesn't like it

#

get that down to a lot less iterations and threads ferrisballSweat

haughty mica Jun 11, 2022, 10:13 PM

#

Don't shrink the threads

#

Also what code are you running I want to profile it

#

If you use less threads it's harder to hit some race conditions

golden summit Jun 11, 2022, 10:13 PM

#

hmm I wonder if MIRI would get noticibly faster if you threw it into souper

haughty mica Jun 11, 2022, 10:13 PM

#

No

#

I mean yes, but not a lot

#

I have a PR that takes the edge off of SB but it's not a fix

golden summit Jun 11, 2022, 10:14 PM

#

I guess it's mainly from the algorithm yeah

west violet Jun 11, 2022, 10:15 PM

#

This https://github.com/vmware/database-stream-processor/blob/0a58584ad58d1de8b7a67b03ac0fbd4d38a91dca/src/operator/communication/exchange.rs#L653-L718

haughty mica Jun 11, 2022, 10:15 PM

#

No it's mainly because the implementation leaks tags

west violet Jun 11, 2022, 10:15 PM

#

It seems to be hit-or-miss though, probably seed based

haughty mica Jun 11, 2022, 10:15 PM

#

I just want to watch it run

#

The problem with SB is that its design mandates a linear search, and because the runtime doesn't know when a pointer goes away, those linear searches grow in size over the course of program execution

#

For small programs you don't notice SB overhead hardly at all, then for large programs it eventually eats all your memory

west violet Jun 11, 2022, 10:16 PM

#

Can't you at some points stop searching?

#

Like, once you hit your required access you can stop for something like a read, right?

haughty mica Jun 11, 2022, 10:17 PM

#

Yes, that's what it does

#

You search top down and you add to the top, so if you're constantly searching for the same pointer but also creating new pointers, because for example you're reborrowing... yeah.

#

To be clear, you don't search for acccess in SB you search for tags

west violet Jun 11, 2022, 10:18 PM

#

I was meaning stuff like SharedRead or whatever

haughty mica Jun 11, 2022, 10:18 PM

#

Those are permissions but yes

#

Same deal

west violet Jun 11, 2022, 10:18 PM

#

Store the tags in a vec and simd search them

haughty mica Jun 11, 2022, 10:19 PM

#

That does not fix the algorithmic problem

west violet Jun 11, 2022, 10:19 PM

#

This is true, but isn't the linear nature fundamental?

haughty mica Jun 11, 2022, 10:19 PM

#

I have a PR that does some clever caching and it neutralizes the SB overhead for some programs

#

The linear nature is fundamental, but the leak isn't. Oli and I think there is some way to implement a garbage collector for tags

west violet Jun 11, 2022, 10:20 PM

#

Ahhh, gotcha

tender nimbus Jun 11, 2022, 10:24 PM

#

@haughty mica will you publish the disable-stacked-borrows run on your website?

haughty mica Jun 11, 2022, 10:25 PM

#

Yes

#

I keep needing my CPU for other things

#

I'll post here when it's up

tender nimbus Jun 11, 2022, 10:25 PM

#

ferristhumbsup

west violet Jun 11, 2022, 10:28 PM

#

I still vote for a volunteer-based thingy

#

I'd totally run a container for you, I'm sure others would as well

haughty mica Jun 11, 2022, 10:31 PM

#

Yeah on one core though?

#

Really what I should do is physically move my computer farther from my bed so I can run this while I'm asleep

#

You know if people want to help, figuring out some kind of caching scheme for build artifacts that doesn't also let crates corrupt it when I run their tests would be a big help to the runtime speed

west violet Jun 11, 2022, 10:32 PM

#

Containers can have more than one core

west violet Jun 11, 2022, 10:33 PM

#

haughty mica You know if people want to help, figuring out some kind of caching scheme for bu...

sccache?

haughty mica Jun 11, 2022, 10:34 PM

#

I tried it and I couldn't observe a significant improvement. Probably ~64 processes banging away on one directory that's mounted into 64 different Docker containers isn't fast

west violet Jun 11, 2022, 10:34 PM

#

Gotcha

#

iirc it has s3 support, you could hook it up to mimio (fake s3, we use it in docs.rs for local tests)

haughty mica Jun 11, 2022, 10:35 PM

#

You know what, the other thing I should do is actually profile this

west violet Jun 11, 2022, 10:35 PM

#

Or maybe do cross-crate scheduling by yourself

#

Gather all (or some) of the crates that you need tested and build each of their dependency graphs

#

Merge all the graphs together and then you have a build plan

haughty mica Jun 11, 2022, 10:36 PM

#

Yeah see that seems complicated

west violet Jun 11, 2022, 10:36 PM

#

This is true

haughty mica Jun 11, 2022, 10:37 PM

#

I would rather have something that's slow and not buggy than something that's really complicated but slightly faster

west violet Jun 11, 2022, 10:37 PM

#

Fair

#

I mean it's theoretically the best you can do

#

Minimal work done, maximal allowed concurrency

tender nimbus Jun 11, 2022, 10:37 PM

#

Would be nice if there was a way to have the most common dependencies precompiled

west violet Jun 11, 2022, 10:37 PM

#

Wait isn't that called a workspace

haughty mica Jun 11, 2022, 10:38 PM

#

Lol like what the playground does

west violet Jun 11, 2022, 10:38 PM

#

west violet Wait isn't that called a workspace

Cargo abomination incoming ferrisVibe

#

Clone them all into one repo and make them one big 'ole workspace

#

cargo miri test --all

tender nimbus Jun 11, 2022, 10:38 PM

#

gohno

haughty mica Jun 11, 2022, 10:38 PM

#

Timeouts are hard

#

You need to timeout things

tough leaf Jun 11, 2022, 10:38 PM

#

isn't that basically a shared cargo target dir

#

which i do

west violet Jun 11, 2022, 10:38 PM

#

Ah that's true

tender nimbus Jun 11, 2022, 10:39 PM

#

tough leaf isn't that basically a shared cargo target dir

true, this might be a good idea

west violet Jun 11, 2022, 10:39 PM

#

tough leaf isn't that basically a shared cargo target dir

I think it's better, cargo may combine their build plans into one big boi

haughty mica Jun 11, 2022, 10:39 PM

#

Also some crates stomp around on the filesystem so I'm cautious of just merging them

tender nimbus Jun 11, 2022, 10:39 PM

#

On the other hand, are shared target dirs safe for multiple compilations in parallel

west violet Jun 11, 2022, 10:39 PM

#

No

haughty mica Jun 11, 2022, 10:40 PM

#

Yes

west violet Jun 11, 2022, 10:40 PM

#

Really?

haughty mica Jun 11, 2022, 10:40 PM

#

You will simply not get any parallelism

west violet Jun 11, 2022, 10:40 PM

#

Doesn't cargo take out a lockfile

tough leaf Jun 11, 2022, 10:40 PM

#

there's a lock

west violet Jun 11, 2022, 10:40 PM

#

Touche'

tender nimbus Jun 11, 2022, 10:40 PM

#

that's not great

tough leaf Jun 11, 2022, 10:40 PM

#

how often are you really compiling 2 different projects at once

tender nimbus Jun 11, 2022, 10:40 PM

#

tough leaf how often are you really compiling 2 different projects at once

Im not, but miri-tools is

haughty mica Jun 11, 2022, 10:41 PM

#

I really need to fix the name

#

It's going to ossify

#

Thank goodness it sucks

tender nimbus Jun 11, 2022, 10:41 PM

#

ferrisBut

haughty mica Jun 11, 2022, 10:53 PM

#

west violet This <https://github.com/vmware/database-stream-processor/blob/0a58584ad58d1de8b...

I can't see SB craziness in my profile of this, and it eventually dies on some classic as_mut_ptr invalidation

#

Might just be general interpreter slow

west violet Jun 11, 2022, 11:12 PM

#

Miri is still dying with the changes if you want to look at it again ig https://github.com/vmware/database-stream-processor/actions/runs/2481434847

haughty mica Jun 11, 2022, 11:16 PM

#

Yeah uh all I see is a gray screen waiting for output

#

I'm curious to know what it's running right now but I cannot tell

haughty mica Jun 11, 2022, 11:55 PM

#

tender nimbus <@176135688666742784> will you publish the disable-stacked-borrows run on your w...

Only has the top 10,000 but go nuts https://miri.saethlin.dev/no-sb/ub

haughty mica Jun 12, 2022, 1:24 AM

#

#3  0x00005567c2e03e92 in core::slice::raw::from_raw_parts<u8> (data=0x0, len=0) at src/slice/raw.rs:93
#4  0x00005567c2c1cb4d in font_kit::loaders::freetype::Font::rasterize_glyph (self=0x7ffd363d3c48, canvas=0x7ffd363cfcf8, 
    glyph_id=3, point_size=9.67741966, transform=..., hinting_options=...,

#

            // Safety:
            // we just allocated enough capacity and data_len is correct.
            unsafe { escape_field(bytes, self.quote_char, &mut self.data[data_len..]) }

I love when the safety comments are simply wrong

#

Really makes me wonder about the merits of requiring them on every unsafe if people just write wrong safety comments

#

@tender nimbus if you or anyone else fixes a SIGILL please shoot me a message or link me the PR or something, I want to track what bugs these are finding

ruby jacinth Jun 12, 2022, 1:48 AM

#

haughty mica Really makes me wonder about the merits of requiring them on every `unsafe` if p...

At least if the unsafe code itself looks OK at a glance but the reasoning in the safety comment is totally off, that'll make me look extra hard

haughty mica Jun 12, 2022, 1:49 AM

#

🤷 Check out the above code from polars_io

#

It's a very common mistake but it's just a wrong safety comment

west violet Jun 12, 2022, 3:58 AM

#

@haughty mica Uh https://github.com/vmware/database-stream-processor/runs/6846036453?check_suite_focus=true

#

Over 5 hours

#

And still not done, I killed it

haughty mica Jun 12, 2022, 4:00 AM

#

Yeah cool but if I view that web page I need you to know that I cannot figure out what is being run

#

If you could name the test that was still running maybe I could help, but this level of slowdown is typical

#

Const eval is ~1000x slowdown, and SB is ~infinite in the general case

tender nimbus Jun 12, 2022, 6:39 AM

#

haughty mica Only has the top 10,000 but go nuts https://miri.saethlin.dev/no-sb/ub

ferrisHeartEyes

#

arc_swap corro

tender nimbus Jun 12, 2022, 6:40 AM

#

haughty mica <@414755070161453076> if you or anyone else fixes a SIGILL please shoot me a mes...

You have found the SIGILL in the two swc crates yourself but here's the fix: <1https://github.com/swc-project/swc/pull/4943>

#

wtf, invalid pointer derefs inside vec in alacritty gohno

tender nimbus Jun 12, 2022, 9:44 AM

#

running just the data race test of arc-swap in miri makes it uaf instead

#

fun

#

TSan finds the data race as well

#

yikes

#

they do run tsan in ci though ferrisWhat

#

ah there is an issue

#

https://github.com/vorner/arc-swap/issues/71

#

so they say that this is a false positive

tender nimbus Jun 12, 2022, 10:17 AM

#

this goes way above my head ferrisballSweat

#

https://github.com/vorner/arc-swap/issues/76 i opened an issue i guess

haughty mica Jun 12, 2022, 1:36 PM

#

I thought arc-swap fixed the SB issue but just didn't release it

tender nimbus Jun 12, 2022, 2:37 PM

#

ferrisClueless no

haughty mica Jun 12, 2022, 2:37 PM

#

Oh I swear I did a PR, darn

knotty oar Jun 12, 2022, 2:38 PM

#

~~i'm sure saethwin dreams about SB issues~~

haughty mica Jun 12, 2022, 2:38 PM

#

I wish my dreams were that calm

tender nimbus Jun 12, 2022, 2:38 PM

#

haughty mica I wish my dreams were that calm

who doesn't

#

thank you discord

#

please stop using relative message counts for the replies

west violet Jun 12, 2022, 2:58 PM

#

haughty mica Yeah cool but if I view that web page I need you to know that I cannot figure ou...

Yah it’s kinda annoying that gh doesn’t show in-progress logs

#

But it’s the exchange test that it’s stalled on

haughty mica Jun 12, 2022, 3:00 PM

#

Maybe this is another thing that we could tune up in libtest

#

Or cargo-miri

west violet Jun 12, 2022, 3:01 PM

#

Oh my bad I totally didn’t realize you had to log in to view logs

haughty mica Jun 12, 2022, 3:01 PM

#

Because normally right these tests are pretty quick so it doesn't matter but doctests have that nice warning when they run for a while

west violet Jun 12, 2022, 3:01 PM

#

I’ll send the log archive, gh doesn’t leak secrets in logs right?

#

Yah, nothing else has any trouble

haughty mica Jun 12, 2022, 3:01 PM

#

Just paste me the name of the test that was stuck

west violet Jun 12, 2022, 3:02 PM

#

Even all four of the sanitizers are super quick

haughty mica Jun 12, 2022, 3:02 PM

#

Yeah they're only like 2x slowdown

tender nimbus Jun 12, 2022, 3:02 PM

#

sanitizers slow a program down
unlike miri, which basically brings it to a halt

west violet Jun 12, 2022, 3:02 PM

#

operator::communication::exchange::tests::test_exchange

#

Yah I was just confirming that this is very much a miri issue, nothing else cares

haughty mica Jun 12, 2022, 3:05 PM

#

Yeah totally, that's why I think it would be best to do these sorts of hacks in cargo-miri

#

Oh this is SB thrashing

#

Just a perf top --pid $(pgrep miri | tail -n1)

tender nimbus Jun 12, 2022, 3:07 PM

#

ferrisballSweat

haughty mica Jun 12, 2022, 3:07 PM

#

Those top 3 functions are linear in the runtime of the borrow stack

#

Total memory usage holding steady though that's nice

#

perf says 97% of runtime in SB, but because SB also trashes your cache it's closer to 100%

tender nimbus Jun 12, 2022, 3:14 PM

#

is it better on your branch?

haughty mica Jun 12, 2022, 3:19 PM

#

It's better

#

I think I never implemented a fix for the linear behavior of find_first_write_incompatible

#

#

This still destroys the cache, though I could fix that, though I think Ralf would be unhappy about it

#

Curious that this doesn't stress iter_mut

tender nimbus Jun 12, 2022, 3:23 PM

#

haughty mica This still destroys the cache, though I could fix that, though I think Ralf woul...

why, what would you do?

haughty mica Jun 12, 2022, 3:23 PM

#

Well first of all this needs SoA

west violet Jun 12, 2022, 3:24 PM

#

lmao

haughty mica Jun 12, 2022, 3:24 PM

#

This loop is just a scan of Permission which is a 4-variant enum, but it's actually scanning a Vec<Item> where Item is 24 bytes

tender nimbus Jun 12, 2022, 3:24 PM

#

ferrisballSweat

haughty mica Jun 12, 2022, 3:24 PM

#

You could bit-pack away most of the cache usage there

#

So yeah

tender nimbus Jun 12, 2022, 3:25 PM

#

why would this make ralf unhappy? because it would make the code harder to read?

haughty mica Jun 12, 2022, 3:25 PM

#

It makes the code harder to hack on

west violet Jun 12, 2022, 3:25 PM

#

Dammit, why can't I remember the word?

#

It's for when there's a bad thing and you've found a situation that exacerbates it to horrible levels

haughty mica Jun 12, 2022, 3:25 PM

#

The changes Ralf wants to make to SB are very much in the guts of how all this lookups into the borrow stacks work

west violet Jun 12, 2022, 3:25 PM

#

haughty mica It makes the code harder to hack on

If you make the api nice enough it could be a moot point?

west violet Jun 12, 2022, 3:26 PM

#

west violet It's for when there's a bad thing and you've found a situation that exacerbates ...

Specifically in software, it's not an edge case but close?

#

Dammit

tender nimbus Jun 12, 2022, 3:26 PM

#

there was a soa derive crate that made this kind of not completely horrible i think?

west violet Jun 12, 2022, 3:26 PM

#

Whatever, of course we made another piece of code that thrashes rust stuff lmao

haughty mica Jun 12, 2022, 3:26 PM

#

SB is a big tangled mess of state so yes you could do that but it's not easy and I'm trying to do it

west violet Jun 12, 2022, 3:27 PM

#

The last one we made was a program that took an hour to compile

#

In debug mode

tender nimbus Jun 12, 2022, 3:27 PM

#

why

haughty mica Jun 12, 2022, 3:27 PM

#

Sounds like a good benchmark

west violet Jun 12, 2022, 3:27 PM

#

I thought so too but no one took me up on it

tender nimbus Jun 12, 2022, 3:27 PM

#

was the bug fixed

haughty mica Jun 12, 2022, 3:27 PM

#

wg-compiler-perf didn't want it?

west violet Jun 12, 2022, 3:28 PM

#

tender nimbus was the bug fixed

Donno

tender nimbus Jun 12, 2022, 3:28 PM

#

or what caused this abomination

west violet Jun 12, 2022, 3:28 PM

#

Lemme find the issue

#

Maybe it was this? https://github.com/rust-lang/rust/issues/78925

GitHub

Heavy usage of traits & generics causes incredibly slow compile tim...

differential-datalog makes heavy use of timely-dataflow and differential-dataflow. When using this library, compile times are excessively slow (even when no changes are made to the library itself),...

tender nimbus Jun 12, 2022, 3:30 PM

#

lmao, llvm

#

llvm melobonk

#

well, also rustc melobonk

haughty mica Jun 12, 2022, 3:32 PM

#

Oh this is just big mono energy

#

So yes, but also boring

west violet Jun 12, 2022, 3:32 PM

#

Totally, it's still a really big issue though

#

Other teams were disliking us because of our compile times

haughty mica Jun 12, 2022, 3:32 PM

#

It's an issue with the architecture of your code

#

There's really not much the compiler could do to help you here

west violet Jun 12, 2022, 3:33 PM

#

Yah, probably fair

#

It's basically all because of timely

haughty mica Jun 12, 2022, 3:33 PM

#

momo is supposed to maybe help with some code like this

#

Anyway this isn't about UB melobonk

tender nimbus Jun 12, 2022, 3:35 PM

#

the first timely test already times out in miri ferrisForgor

west violet Jun 12, 2022, 3:35 PM

#

lmao

#

You'll find plenty of ub there, have fun

haughty mica Jun 12, 2022, 3:35 PM

#

timely uses abomonation

west violet Jun 12, 2022, 3:35 PM

#

Timely also has its own ub apart from abomonation

#

The consolidation code is ub off the top of my head

tender nimbus Jun 12, 2022, 3:36 PM

#

https://github.com/TimelyDataflow/timely-dataflow/issues/433 ferrisClueless

haughty mica Jun 12, 2022, 3:36 PM

#

"potential" unaligned memory access

tender nimbus Jun 12, 2022, 3:36 PM

#

no response for over 6 months

haughty mica Jun 12, 2022, 3:36 PM

#

That's typical

west violet Jun 12, 2022, 3:37 PM

#

Frank doesn't care

haughty mica Jun 12, 2022, 3:37 PM

#

materialize isn't getting owned so...

west violet Jun 12, 2022, 3:37 PM

#

Hum?

haughty mica Jun 12, 2022, 3:37 PM

#

It's not a security issue

west violet Jun 12, 2022, 3:38 PM

#

Ahh gotcha

#

I mean, abomonation is a massive issue if he ever wants to actually utilize timely's distribution mechanics

#

It will shit the bed, I've tried before

haughty mica Jun 12, 2022, 3:39 PM

#

I honestly do not understand the use case for all this

west violet Jun 12, 2022, 3:39 PM

#

For timely?

haughty mica Jun 12, 2022, 3:39 PM

#

I've worked in code that needs to pump data to a serialization format and looking at the system holistically I would never use abomonation because that doesn't help with my hot path

west violet Jun 12, 2022, 3:39 PM

#

Oh yah, same

haughty mica Jun 12, 2022, 3:40 PM

#

tender nimbus the first `timely` test already times out in miri <:ferrisForgor:920768835135627...

times out?
btw I'm running it and this is in the interpreter not SB lol

west violet Jun 12, 2022, 3:40 PM

#

It's vaguely hot since it's used within exchange operators, but they're miniscule in comparison to building/maintaining indexes and anything using indexes like joins or aggregation

haughty mica Jun 12, 2022, 3:41 PM

#

For me (and maybe this is just because of our cost structure) arranging the data to be compressible and compressing it always dominates feeding it to a serializer

west violet Jun 12, 2022, 3:41 PM

#

Also shilling dbsp, we're faster and use less memory than timely, and we actually have theory & math behind our stuff that's understandable

west violet Jun 12, 2022, 3:42 PM

#

haughty mica For me (and maybe this is just because of our cost structure) arranging the data...

Yep, most definitely

west violet Jun 12, 2022, 3:42 PM

#

west violet Also shilling dbsp, we're faster and use less memory than timely, and we actuall...

We're also working on disk-backed persistence and out-of-core indexes, meaning we can work over more data than main memory can hold which timely can't do since it's 100% in-memory

tender nimbus Jun 12, 2022, 3:48 PM

#

west violet Also shilling dbsp, we're faster and use less memory than timely, and we actuall...

but how can you be faster than timely, you don't even do horrible ub??? 🚀

haughty mica Jun 12, 2022, 3:49 PM

#

Still interning a lot of types

#

Doing a bit of SB now though

west violet Jun 12, 2022, 3:49 PM

#

tender nimbus but how can you be faster than timely, you don't even do horrible ub??? 🚀

bors

#

Could my miri thing be because of thread::yield_now()?

#

Maybe that's screwing with miri

haughty mica Jun 12, 2022, 3:56 PM

#

What is your miri thing

west violet Jun 12, 2022, 3:56 PM

#

It taking multiple hours?

haughty mica Jun 12, 2022, 3:56 PM

#

No, that's just Miri being slow and SB being ~infinitely slow

west violet Jun 12, 2022, 3:56 PM

#

Ah gotcha

haughty mica Jun 12, 2022, 3:56 PM

#

You need to shrink the working set of the test or cfg it out

tender nimbus Jun 12, 2022, 3:57 PM

#

usually, there is literally nothing behind miri being slow
just.. miri being slow

west violet Jun 12, 2022, 3:57 PM

#

I'm decreasing the number of rounds it does

tender nimbus Jun 12, 2022, 3:57 PM

#

reducing the work miri will have to do will make the test faster

haughty mica Jun 12, 2022, 5:11 PM

#

The dbsp SRW blocks are thousands of tags long notlikethis

west violet Jun 12, 2022, 5:12 PM

#

You're welcome for your bench case ferrisClueless

golden summit Jun 12, 2022, 8:47 PM

#

golden summit Almost done

I'm gonna open prs now

#

to std and miri

haughty mica Jun 13, 2022, 12:20 AM

#

Pog

golden summit Jun 13, 2022, 5:26 AM

#

https://github.com/rust-lang/miri/pull/2231 and https://github.com/rust-lang/rust/pull/98042

GitHub

Windows thread support by DrMeepster · Pull Request #2231 · rust-la...

This PR adds support for threads on Windows. Thread parking requires a change to std: rust-lang/rust#98042.

GitHub

Fix compat_fn option method on miri by DrMeepster · Pull Request #9...

This change is required to make WaitOnAddress work with rust-lang/miri#2231

tender nimbus Jun 13, 2022, 5:29 AM

#

🎉

knotty oar Jun 13, 2022, 5:38 AM

#

golden summit to std and miri

ye on it 😛

#

assigned it to miri chief

neon tiger Jun 13, 2022, 7:01 PM

#

west violet Minimal work done, maximal allowed concurrency

https://github.com/kolloch/crate2nix may get you this

GitHub

GitHub - kolloch/crate2nix: nix build file generator for rust crates

nix build file generator for rust crates. Contribute to kolloch/crate2nix development by creating an account on GitHub.

#

caveat: it's Nix, so you'll have to dick around with it forever to get it to work

west violet Jun 13, 2022, 7:01 PM

#

lol

neon tiger Jun 13, 2022, 7:01 PM

#

but it will get you obviously-sound per-crate caching

#

obviously because the cache is keyed by all inputs, by construction

west violet Jun 13, 2022, 7:02 PM

#

That could be good, yah

#

And I guess nix would help with the volunteer problem, people could just spin it up

haughty mica Jun 13, 2022, 7:07 PM

#

I'm now obsessing about a good way to share build artifacts, thanks yall

haughty mica Jun 13, 2022, 11:43 PM

#

My shitty miri tools repo is getting stars

#

Why is the world like this

tender nimbus Jun 14, 2022, 4:20 AM

#

because it's cool

tough leaf Jun 14, 2022, 12:00 PM

#

flurry has UB and the only reason it wasn't detected is because they cfg_attr(miri, ignore) some tests

#

:)

#

i'll look into this

#

it's at least a fresh form of UB (deallocation with wrong layout)

#

wait

#

it's a seize issue?

#

hmmm

#

yeah, sieze's tests fail miri

#

which is weird since they have miri in CI

tender nimbus Jun 14, 2022, 12:05 PM

#

@manic tangle ferrisBorrowCheck

#

someone seized your soundness

tough leaf Jun 14, 2022, 12:08 PM

#

well
maybe
i'm still looking into this
but yeah weird

#

oh
wtf
this is allocating a vec and then deallocating it with Box::from_raw ??

#

no

tender nimbus Jun 14, 2022, 12:13 PM

#

hmm

#

this is obviously extremely cursed

#

but is it actually not allowed?

tough leaf Jun 14, 2022, 12:13 PM

#

is what not allowed

#

wait

#

why is this making a vec

#

fn allocate_bucket<T>(size: usize) -> *mut Entry<T> {
    Box::into_raw(
        (0..size)
            .map(|_| Entry::<T> {
                present: AtomicBool::new(false),
                value: UnsafeCell::new(MaybeUninit::uninit()),
            })
            .collect(),
    ) as *mut _
}

#

can you collect into a box?

tender nimbus Jun 14, 2022, 12:14 PM

#

wtf is this

tough leaf Jun 14, 2022, 12:14 PM

#

yes you can

tender nimbus Jun 14, 2022, 12:15 PM

#

you can collect into str and [T]

#

lmao

tough leaf Jun 14, 2022, 12:15 PM

#

impl<I> FromIterator<I> for Box<[I]> {
    fn from_iter<T: IntoIterator<Item = I>>(iter: T) -> Self {
        iter.into_iter().collect::<Vec<_>>().into_boxed_slice()
    }
}

tender nimbus Jun 14, 2022, 12:15 PM

#

lmao

tough leaf Jun 14, 2022, 12:15 PM

#

okay so

#

is Entry dynamically sized
how does it know the length

#

it is not

#

oh

#

is this

#

okay

#

it's allocating a *mut [Entry<T>] (or similar)

#

where the size is not 1

#

and then it's deallocating it as a *mut Entry<T>

pastel lily Jun 14, 2022, 12:17 PM

#

ah

tough leaf Jun 14, 2022, 12:17 PM

#

which only deallocates one item

#

maybe?

tender nimbus Jun 14, 2022, 12:17 PM

#

tf

tough leaf Jun 14, 2022, 12:17 PM

#

and that doesn't blow up in practice

#

because malloc/free does not care

#

about layout

tender nimbus Jun 14, 2022, 12:18 PM

#

that's pretty cursed

tough leaf Jun 14, 2022, 12:18 PM

#

but miri does

#

okay so

#

we do know how big the bucket we just made is

#

where is allocate_bucket used
i might just change that to return *mut [Entry<T>]

#

and then cast that to a *mut Entry<T> as needed

#

hmmm

#

nah i'll just deallocate using thread.bucket_size

#

how do you deallocate a boxed slice from a pointer

tender nimbus Jun 14, 2022, 12:20 PM

#

raw dealloc ferrisBanne

#

or make a boxed slice

#

and let box drop it

tough leaf Jun 14, 2022, 12:21 PM

#

don't i need ptr metadata for that

tender nimbus Jun 14, 2022, 12:21 PM

#

ptr::slice_from_raw_parts

tough leaf Jun 14, 2022, 12:21 PM

#

ah

#

okay that makes that test pass

#

good

tender nimbus Jun 14, 2022, 12:23 PM

#

ferrisRelieved

#

currently looking at some cursed alacritty shit

#

they read pointer bytes as normal bytes

#

not good

tough leaf Jun 14, 2022, 12:23 PM

#

i might need to pass -Zmiri-allow-ptr-int-transmute for this to pass

#

since we might be putting pointer bytes in atomics

#

(integer atomics)

#

yep

tender nimbus Jun 14, 2022, 12:24 PM

#

a lot better than the issue before

tough leaf Jun 14, 2022, 12:24 PM

#

or not, actually?

tender nimbus Jun 14, 2022, 12:24 PM

#

oh nice

tough leaf Jun 14, 2022, 12:24 PM

#

AtomicPtr is used

#

but i'm getting an "invalid pointer" error

tender nimbus Jun 14, 2022, 12:24 PM

#

😵‍💫

tough leaf Jun 14, 2022, 12:25 PM

#

strange

#

well
do the rest of the tests pass allowing ptr-int transmute

#

they do not

#

data race

#

deallocate / read

#

race

tender nimbus Jun 14, 2022, 12:28 PM

#

ah, this uses a custom qword memcpy
but using usize instead of MU<usize>

tough leaf Jun 14, 2022, 12:47 PM

#

MIRIFLAGS="-Zmiri-disable-isolation -Zmiri-allow-ptr-int-transmute -Zmiri-disable-weak-memory-emulation -Zmiri-ignore-leaks"

#

tests pass with this

haughty mica Jun 14, 2022, 12:51 PM

#

Could be the weak memory emulation bug again

tough leaf Jun 14, 2022, 12:51 PM

#

is that a stdlib bug or a miri bug

haughty mica Jun 14, 2022, 12:52 PM

#

Miri

#

https://github.com/rust-lang/miri/issues/2223

GitHub

Review Linux futex implementation for weak memory · Issue #2223 · r...

This appears to be the smoking gun of std::sync::mpsc matklad/once_cell#182 and crossbeam channel rust-lang/rust#55005 (comment) deadlocks miri/src/shims/unix/linux/sync.rs Lines 129 to 148 in 32a7...

tender nimbus Jun 14, 2022, 1:09 PM

#

if we implement memcpy using MaybeUninit, is it mumcpy?

knotty oar Jun 14, 2022, 1:18 PM

#

~~it's MaybeCopy~~

tender nimbus Jun 14, 2022, 1:28 PM

#

ferrisClueless

manic tangle Jun 14, 2022, 1:29 PM

#

tender nimbus <@755965137013309563> <:ferrisBorrowCheck:774062955468685352>

ferrisThonk

tough leaf Jun 14, 2022, 1:30 PM

#

hi
miri's complaining about the Box::from_raw

#

i can get (current) miri to pass with no errors but only if i enable like 3 ignores

manic tangle Jun 14, 2022, 1:31 PM

#

that part of the code is pretty much just vendoring the thread-local crate

tough leaf Jun 14, 2022, 1:31 PM

#

ah so thread-local is broken too?

#Fixing UB in random crates