Iris - A Journey through OpenGL and beyond to learn Graphics | Graphics Programming | Page 22

fiery bolt Aug 20, 2024, 2:59 AM

#

1 ms for culling

#

and about 9 ms for raster

#

i need software raster now

fiery bolt Aug 20, 2024, 6:57 AM

#

ok holy fuck it's rendering 1.2 million meshlets that doesn't sound good bleaker_kekw

#

same scene in unreal takes 1.92 ms

wispy spear Aug 20, 2024, 10:26 AM

#

https://tenor.com/view/true-incredible-water-gif-23861053

Tenor

glass sphinx Aug 20, 2024, 11:15 AM

#

nice culling hole 😈

#

very impressive tho

#

is this lodding also?

fiery bolt Aug 20, 2024, 4:07 PM

#

glass sphinx is this lodding also?

yeah

primal shadow Aug 23, 2024, 4:58 AM

#

@wicked notch what heuristic did you use for software raster vs hardware raster again? I'm having a hard time figuring out a good metric. Nanite briefly mentions that they compute some kind of longest triangle edge data per cluster and use that iirc.

fiery bolt Aug 23, 2024, 6:08 AM

#

project that to screenspace and software raster if it's less than 16 pixels or something iirc?

primal shadow Aug 29, 2024, 6:18 AM

#

For now I've just given up on tweaking it, left it as something silly, and have moved on to other bits

#

Apparently Brian Karis read my blog post though, awesome!

wicked notch Aug 29, 2024, 10:27 AM

#

if we never hear from you again, UE has won

primal shadow Aug 29, 2024, 5:20 PM

#

He said he enjoyed it 😅

fiery bolt Aug 31, 2024, 5:18 AM

#

~~unlike that tencent talk~~

runic surge Aug 31, 2024, 5:29 AM

#

primal shadow He said he enjoyed it 😅

Jasmine let us know if Epic Games sends their hit agents against you

faint crane Aug 31, 2024, 6:20 AM

#

I saw both of Tencent's talks between Advances and Moving Mobile Graphics but couldn't understand most of it.

#

Wish I got a picture, but Jasmine's article was linked in Advances. Slides are up, let me link it.

#

https://advances.realtimerendering.com/s2024/index.html#hable

#

fiery bolt Aug 31, 2024, 6:48 AM

#

ok so I just read the tencent slides and... it's just nanite but worse?

#

it's minimum effort nanite reimpl

#

instance cull, parallelize all meshlets, no software raster, and some wacky lod curve

faint crane Aug 31, 2024, 7:34 AM

#

Could the lod curve somehow compensate for lack of software raster? They were targeting mid-high end mobile.

wispy spear Aug 31, 2024, 10:09 AM

#

@solar sentinel's too : )

#

our frogs invading the world

wicked notch Aug 31, 2024, 11:10 AM

#

faint crane Could the lod curve somehow compensate for lack of software raster? They were ta...

if you don't target pixel sized triangles then yeah, sw raster becomes unnecessary

#

buuut the benefits of sw raster are elsewhere too

#

for example with VSM, sw raster is pretty much the greatest source of speedup

delicate rain Aug 31, 2024, 11:24 AM

#

lies, culling is

wicked notch Aug 31, 2024, 11:25 AM

#

culling is the second greatest source of speedup KEKW

#

nah saky's right it's culling first and then sw rast

#

but the advantage of not going through the regular pipeline for triangles is massive

delicate rain Aug 31, 2024, 11:26 AM

#

I was this close to getting into another insane argument

wicked notch Aug 31, 2024, 11:26 AM

#

especially for small ones

delicate rain Aug 31, 2024, 11:26 AM

#

no more

wicked notch Aug 31, 2024, 11:26 AM

#

wink wink bistro trees

delicate rain Aug 31, 2024, 11:26 AM

#

bistro trees bleakekw

wicked notch Aug 31, 2024, 11:26 AM

#

I genuinely believe that sw rast for bistro trees would fix everything

#

but alas

#

I still don't have nanite shippable bleakekw

delicate rain Aug 31, 2024, 11:27 AM

#

my bsm is also still borked

#

I've been distracted by shiny rays

wicked notch Aug 31, 2024, 11:27 AM

#

same

solar sentinel Aug 31, 2024, 5:15 PM

#

wispy spear <@735360596228440066>'s too : )

Hey deccer, just jumping in... what's going on? 🙂

#

I'm doing hardware raster as well. Software raster doesn't pay off for anything larger than a pixel sized triangle. In the UE source code, it's literally called MicroPoly raster (...for a reason). There's overshading to compute hardware gradients. Even Nanite bins and rasterizes wider triangles via HW.

fiery bolt Aug 31, 2024, 5:23 PM

#

nanite does triangles smaller than 16 pixels or so with SW

#

not just pixel-sized

wispy spear Aug 31, 2024, 5:27 PM

#

solar sentinel Hey deccer, just jumping in... what's going on? 🙂

your name was part of references on some siggraph presentation, two or three message up from when i booped you

solar sentinel Aug 31, 2024, 5:28 PM

#

Holy shit

solar sentinel Aug 31, 2024, 5:28 PM

#

wispy spear your name was part of references on some siggraph presentation, two or three mes...

I didn't even realize that. People are referencing my reddit posts. Wow.

solar sentinel Aug 31, 2024, 5:30 PM

#

fiery bolt nanite does triangles smaller than 16 pixels or so with SW

Granted, their limits are a bit fuzzy... but beyond a certain size the hardware is faster.

primal shadow Aug 31, 2024, 6:15 PM

#

solar sentinel I'm doing hardware raster as well. Software raster doesn't pay off for anything ...

What kind of HW raster? Mesh shaders or?

#

I still haven't figured out a good heuristics for SW vs HW. NSight shows like +/- 0.5ms all the time for the raster pass making measuring perf impossible. Idk how to improve when the results are so variable.

frank sail Aug 31, 2024, 6:17 PM

#

Make the test scene bigger

solar sentinel Aug 31, 2024, 6:23 PM

#

primal shadow What kind of HW raster? Mesh shaders or?

Nah my target minspec is 1050Ti. Just regular ole'. Though I do frustum/HiZ culling compute and write out MultiDrawIndirectCount params into a buf.

solar sentinel Aug 31, 2024, 6:24 PM

#

primal shadow I still haven't figured out a good heuristics for SW vs HW. NSight shows like +/...

There some great high poly megascans of giant buildings on Turbosquid... probably have to dish out a few bucks, but worth it. Take those and just replicate over and over side by side.

primal shadow Aug 31, 2024, 6:27 PM

#

frank sail Make the test scene bigger

I tried with like 10 copies of the quixel megascan icelandic cliffs. Maybe that's not enough??? idk

wicked notch Aug 31, 2024, 6:27 PM

#

more.meme.gif

primal shadow Aug 31, 2024, 6:28 PM

#

solar sentinel Nah my target minspec is 1050Ti. Just regular ole'. Though I do frustum/HiZ cull...

1 draw indirect args per cluster? How do you not get super bottlenecked on the command processor? I had tried this before, it was fairly slow

primal shadow Aug 31, 2024, 6:29 PM

#

solar sentinel There some great high poly megascans of giant buildings on Turbosquid... probabl...

Oh nice, thanks for the tip

solar sentinel Aug 31, 2024, 6:29 PM

#

primal shadow 1 draw indirect args per cluster? How do you not get super bottlenecked on the c...

Not clustered, complete instances.

#

But total number of issued calls is once per material.

primal shadow Aug 31, 2024, 6:31 PM

#

solar sentinel Not clustered, complete instances.

But you have clusters(?)

solar sentinel Aug 31, 2024, 6:31 PM

#

primal shadow But you have clusters(?)

whole-ass instances of geometry. Not limited to ~128 triangles.

#

I'm not doing Nanite.

#

I'm doing visibility buffer rendering only 🙂

primal shadow Aug 31, 2024, 6:35 PM

#

Ohhh ok

fiery bolt Aug 31, 2024, 7:53 PM

#

primal shadow I tried with like 10 copies of the quixel megascan icelandic cliffs. Maybe that'...

how many tris in one of them

#

my stress test scene has like 100 billion tris total bleakekw

#

spends about 10 ms in raster (hw, no sw)

#

well, i impld sw inside my mesh shader and that bumped it up to 20 ms lol

primal shadow Aug 31, 2024, 8:18 PM

#

Not home rn, will check later

fiery bolt Sep 1, 2024, 4:39 PM

#

@wicked notch I sped up mesh generation significantly by generating meshlet AABBs correctly instead of going through every single vertex in the mesh bleaker_kekw

wicked notch Sep 1, 2024, 4:39 PM

#

crazy how that works KEKW

fiery bolt Sep 1, 2024, 4:40 PM

#

it even speeds up rendering because now meshlets are being culled!

velvet marsh Sep 1, 2024, 7:20 PM

#

is your AABB structure something you wrote yourself?

fiery bolt Sep 1, 2024, 9:47 PM

#

yeah

#

i wrote some goofy bvh too

#

it... works?

#

the nanite presentation glossed over how it works so idk how correct i am

primal shadow Sep 1, 2024, 10:22 PM

#

@fiery bolt I forget who is who, did you do the WebGPU Nanite impl?

#

(Scthe on github)

fiery bolt Sep 1, 2024, 10:23 PM

#

nope not me

#

i'm doing it in rust and vulkan

primal shadow Sep 1, 2024, 11:34 PM

#

@fiery bolt is your code open source anywhere? I'd like to check out your meshlet DAG building/simplification code

fiery bolt Sep 1, 2024, 11:35 PM

#

primal shadow <@488643966502436865> is your code open source anywhere? I'd like to check out y...

https://github.com/SparkyPotato/radiance/blob/5d6f6b51a6698b239c1f094205a20a2e9b7cf417/crates/lib/asset/src/import/mesh/mod.rs

GitHub

radiance/crates/lib/asset/src/import/mesh/mod.rs at 5d6f6b51a6698b2...

Rendering things. Contribute to SparkyPotato/radiance development by creating an account on GitHub.

#

i think i'm using meshopt for simplification currently, not my own thing

#

changed to meshopt to debug something then forgor to switch back KEKW

primal shadow Sep 1, 2024, 11:37 PM

#

Looks... very similiar to my own code 😅

#

Did you base it off of mine? (which is totally fine, just curious what your process was)

fiery bolt Sep 1, 2024, 11:38 PM

#

primal shadow Did you base it off of mine? (which is totally fine, just curious what your proc...

i did yeah

primal shadow Sep 1, 2024, 11:38 PM

#

Nice, happy it helped

fiery bolt Sep 1, 2024, 11:38 PM

#

ferrisOwO

primal shadow Sep 1, 2024, 11:39 PM

#

If you end up learning anything, let me know please!

#

So far today's experiments have revealed:

Setting target error = 1.0 helps a lot. No reason to limit the target error.
255 v / 128 t is better than 64/64 (meshopt won't let me do 256 vertices :P)

fiery bolt Sep 1, 2024, 11:40 PM

#

yeah i did the first one

#

and nanite does the latter

#

i didn't do the latter exactly because meshopt doesn't like 256 lol

#

i do 128/128

#

well, 128/124 because mesh shaders

primal shadow Sep 1, 2024, 11:41 PM

#

I got better triangle fill rate with 255/128 vs 128/128

fiery bolt Sep 1, 2024, 11:42 PM

#

what's your test model?

primal shadow Sep 1, 2024, 11:42 PM

#

Stanford dragon, currently

fiery bolt Sep 1, 2024, 11:42 PM

#

that is shade flat by default so you have zero vertex reuse

#

i had that issue with all the three stanford models

primal shadow Sep 1, 2024, 11:42 PM

#

I have:

Stanford dragon
Stanford bunny
Jinx from arcane form sketchfab merged down to 1 mesh (I don't really support multiple materials per meshlet mesh)
Icelandic cliffs form quixel megascans

primal shadow Sep 1, 2024, 11:43 PM

#

fiery bolt that is shade flat by default so you have zero vertex reuse

wdym?

fiery bolt Sep 1, 2024, 11:43 PM

#

every tri has unique vertices

#

the bunny too

#

oh yeah i also use the max of error instead of adding the max child

fiery bolt Sep 1, 2024, 11:44 PM

#

fiery bolt the bunny too

(and lucy)

primal shadow Sep 1, 2024, 11:46 PM

#

fiery bolt oh yeah i also use the max of error instead of adding the max child

I was doing that originally, but then your error is not cummulative between LODs

#

So if LOD 0 has 0 error, LOD 1 has 10 error, and LOD 2 has 20 error

#

LOD 2's total error should be 30 relative to the bash mesh (LOD 0)

fiery bolt Sep 1, 2024, 11:47 PM

#

not really

#

because it's a world place displacement

#

higher LODs will naturally have a higher error

primal shadow Sep 1, 2024, 11:48 PM

#

Why? And that's tangential, no?

fiery bolt Sep 1, 2024, 11:48 PM

#

lemme try to ms paint something

primal shadow Sep 1, 2024, 11:48 PM

#

You don't want to know error relative to the previous LOD, you want to know error relative to the original mesh right?

fiery bolt Sep 1, 2024, 11:49 PM

#

actually no idk how to draw it lmao

#

basically, when you simplify, you're collapsing edges into vertices

#

and the simplification error is the maximum displacement a vertex moved (sort of)

#

as you simplify more, edges get longer

#

so when you collapse, you get a higher error naturally

primal shadow Sep 1, 2024, 11:51 PM

#

Sure, and that's a consistent metric for DAG cut purposes, yes

fiery bolt Sep 1, 2024, 11:51 PM

#

i think this would lead to double-counting if you add

#

making your error higher than it should be

primal shadow Sep 1, 2024, 11:51 PM

#

But that means your error projection is no longer saying "is this LOD imperceptible from the base mesh at this distance"

fiery bolt Sep 1, 2024, 11:51 PM

#

the nanite presentation says they max, not add iirc

#

so in this case, if you're collapsing the red edge

#

you get this as the output

#

then you collapse this

#

leading to this

primal shadow Sep 1, 2024, 11:55 PM

#

Compariosn of 255/128 vs 128/128 on icelandic cliffs btw: https://paste.rs/AFqkf.txt. Meshlet occupancy is a map of triangles_per_meshlet:count_of_meshlets key:value

fiery bolt Sep 1, 2024, 11:55 PM

#

the total displacement in the two simplification steps is equal to half the length

#

which is equivalent to the second collapse error

primal shadow Sep 1, 2024, 11:56 PM

#

can you rephrase that last bit?

#

I understand the images, but trying to understand the implications still

fiery bolt Sep 1, 2024, 11:57 PM

#

so if you compare the first and last image, the vertex in the center has moved half the total length, right?

#

that's your final error of the simplification

primal shadow Sep 1, 2024, 11:58 PM

#

That's true

#

Ok makes sense to me then

fiery bolt Sep 1, 2024, 11:58 PM

#

yeah, that's equivalent to the displacement done by the second collapse

primal shadow Sep 1, 2024, 11:59 PM

#

Also I reread the nanite slides rq

#

They say they do the max of child's parent error for the BVH

#

TGhey don't mention how they handle choosing the parent error in the first place though

fiery bolt Sep 1, 2024, 11:59 PM

#

the parent error bit is higher

#

slide 66

primal shadow Sep 2, 2024, 12:01 AM

#

?

fiery bolt Sep 2, 2024, 12:02 AM

#

yeah at least as large implies max imo

#

@wicked notch what does nanite do

#

KEKW

wicked notch Sep 2, 2024, 12:03 AM

#

it's a leq comparison

#

so it doesn't matter

#

if parent >= threshold && current <= threshold

fiery bolt Sep 2, 2024, 12:03 AM

#

no for building error

#

is it max or add

wicked notch Sep 2, 2024, 12:04 AM

#

add I think

#

dont member, let me access the secret sauce (UE's source)

primal shadow Sep 2, 2024, 12:04 AM

#

fiery bolt yeah at least as large implies max imo

So if that's true, then it's this?:

LOD 0 clusters have error = 0.0, parent_error = INFINITY
Group and simplify LOD 0 to form a group
Group error = max(group_error_from_simplify, all_child_errors)
LOD 0 parent errors = group_error
LOD 1 error (not parent) = group_error

fiery bolt Sep 2, 2024, 12:05 AM

#

yeah

wicked notch Sep 2, 2024, 12:06 AM

#

UE is dead

#

😔

fiery bolt Sep 2, 2024, 12:06 AM

#

froge_sad

fiery bolt Sep 2, 2024, 12:07 AM

#

fiery bolt yeah

i do that yep

#

why does megascans not have a way to sort by tri count smh

wicked notch Sep 2, 2024, 12:09 AM

#

it is indeed add

fiery bolt Sep 2, 2024, 12:09 AM

#

they add child errors to the parent error?

wicked notch Sep 2, 2024, 12:09 AM

#

also, 600 LOC function

#

in typical UE style

#

https://github.com/EpicGames/UnrealEngine/blob/release/Engine/Source/Developer/MeshBuilder/Private/StaticMeshBuilder.cpp#L330-L758

wicked notch Sep 2, 2024, 12:09 AM

#

fiery bolt they add child errors to the parent error?

they add the bounds but yes same thing in UE's case

#

the bounds of the group directly affect the error calculation

#

which is shrimply copied

fiery bolt Sep 2, 2024, 12:10 AM

#

me no understand

wicked notch Sep 2, 2024, 12:10 AM

#

let me show you da wae

#

actually I am retarded

#

yes it's max

#

https://github.com/EpicGames/UnrealEngine/blob/release/Engine/Source/Developer/NaniteBuilder/Private/Cluster.cpp#L229

fiery bolt Sep 2, 2024, 12:11 AM

#

lol

#

wtf is that formatting

wicked notch Sep 2, 2024, 12:12 AM

#

UE formatting

fiery bolt Sep 2, 2024, 12:12 AM

#

agonyfrog

wicked notch Sep 2, 2024, 12:12 AM

#

this carries over from 1996 code probably

#

still as beautiful as ever 30 years ago froge_love

primal shadow Sep 2, 2024, 12:12 AM

#

Where the heck does LODError come from?

fiery bolt Sep 2, 2024, 12:13 AM

#

i assume that's set to 0

primal shadow Sep 2, 2024, 12:13 AM

#

Seems like a field in Cluster

#

C++ has implicit this-> I guess

fiery bolt Sep 2, 2024, 12:14 AM

#

it does

#

does megascans not have a high poly source asset with more than 2 mil tris

wicked notch Sep 2, 2024, 12:15 AM

#

blender -> subdivision modifier -> simple

#

be careful because you can turn those 2 million triangles into 2 quintillion

#

speaking from experience bleakforg

fiery bolt Sep 2, 2024, 12:16 AM

#

also for runtime error calculation, you should store the group bounding sphere and place your error test sphere on that, instead of at the center of the group

#

@primal shadow ^

fiery bolt Sep 2, 2024, 12:16 AM

#

wicked notch blender -> subdivision modifier -> simple

but that's booooring

#

i'll just use lucy then

primal shadow Sep 2, 2024, 12:16 AM

#

fiery bolt also for runtime error calculation, you should store the group bounding sphere a...

Wdym?

fiery bolt Sep 2, 2024, 12:17 AM

#

currently you project a sphere with center = group center and radius = error, right?

primal shadow Sep 2, 2024, 12:17 AM

#

Yeah

fiery bolt Sep 2, 2024, 12:17 AM

#

that undercounts error for tris closer than the center

primal shadow Sep 2, 2024, 12:18 AM

#

?

fiery bolt Sep 2, 2024, 12:18 AM

#

you have triangles closer to the camera than the group center right

#

so if you test the error at the group center, those triangles might have an error higher than what you calculate

#

https://virtualglobebook.com/3DEngineDesignForVirtualGlobesSection121.pdf

#

page 11

primal shadow Sep 2, 2024, 12:21 AM

#

Wait so what's the solution?

#

Use the meshlet's center, and don't have any center for the group?

fiery bolt Sep 2, 2024, 12:22 AM

#

store the entire bounding sphere of the group

#

and place the error test sphere at the closest point to the camera

primal shadow Sep 2, 2024, 12:23 AM

#

Oh I see

#

Store the actual bounding sphere for the group, including radius

fiery bolt Sep 2, 2024, 12:23 AM

#

yeah

primal shadow Sep 2, 2024, 12:23 AM

#

Calculate closest point on the sphere to the camera

#

Then place the sphere at center = closet point, radius = error

fiery bolt Sep 2, 2024, 12:23 AM

#

yep!

#

the only issue is that idk what to do when the camera is inside the sphere

#

bounding or error sphere

#

because then your sqrt(d2 - r2) gets a negative value in it

primal shadow Sep 2, 2024, 12:24 AM

#

Bleh that's going to be a lot more involved change, I'l add this to the TODO list

fiery bolt Sep 2, 2024, 12:25 AM

#

fiery bolt because then your `sqrt(d2 - r2)` gets a negative value in it

if you clamp the value inside sqrt to 0, you get infinite projected error

#

which is sus

primal shadow Sep 2, 2024, 12:26 AM

#

Agreed, hmm

#

Does this really matter though?

fiery bolt Sep 2, 2024, 12:28 AM

#

i'm not sure

primal shadow Sep 2, 2024, 12:28 AM

#

I mean yeah we want the LOD calculation to be as accurate as possible, but I feel like it's such a handwave to begin with...

fiery bolt Sep 2, 2024, 12:28 AM

#

idk what the source of my overculling is KEKW

#

it's probably occlusion culling

#

but it might be this

primal shadow Sep 2, 2024, 12:29 AM

#

My occlusion culling is just broken

#

Using SPD to build the depth pyramid is not correct unfortunately

fiery bolt Sep 2, 2024, 12:29 AM

#

i ported themaister's code and the hzb seems to be correct

#

but it's still broken

primal shadow Sep 2, 2024, 12:30 AM

#

Is that the modified SPD one? I wanted to use that

#

I have a question though

fiery bolt Sep 2, 2024, 12:30 AM

#

yeah

primal shadow Sep 2, 2024, 12:30 AM

#

Say your depth texture is a non-power-of-2

#

What size do you make mip 0 of the depth pyramid?

fiery bolt Sep 2, 2024, 12:31 AM

#

half of that

#

rounded down

primal shadow Sep 2, 2024, 12:33 AM

#

So 1800: 1800/2 = 900, rounded down to 512?

fiery bolt Sep 2, 2024, 12:33 AM

#

nono

#

just 900

#

by round down i meant if it's odd

primal shadow Sep 2, 2024, 12:33 AM

#

So you don't enforce that the new depth pyramid is a power of 2? Hmm

fiery bolt Sep 2, 2024, 12:33 AM

#

nope

primal shadow Sep 2, 2024, 12:33 AM

#

I think that may be wrong, 1s

#

https://github.com/Themaister/Granite/blob/93fa4b6c288be1ecf09646f58a16b4d531daa46f/tests/meshlet_viewer.cpp#L802

#

            (depth_view.get_view_width() + 63u) & ~63u,
            (depth_view.get_view_height() + 63u) & ~63u,

#

I think that rounds up to the nearest multiple of 64?

fiery bolt Sep 2, 2024, 12:36 AM

#

yeah

primal shadow Sep 2, 2024, 12:36 AM

#

Are you doing that? That may be you issue if not

fiery bolt Sep 2, 2024, 12:37 AM

#

yeah i'm not

#

but why is he doing that

#

is it because each workgroup does a 64x64 block

#

and that he doesn't bound check in the shader thonk

primal shadow Sep 2, 2024, 12:38 AM

#

Probably? Idk

#

Not sure if it's bounds checks though

#

Could be correctness

#

Like, extra bounds checks probably matters a lot less than extra vram usage from this

fiery bolt Sep 2, 2024, 12:39 AM

#

i'll add that and check

primal shadow Sep 2, 2024, 12:39 AM

#

Hopefully you can figure it out, I don't really want to go stare at hiz code 😭

fiery bolt Sep 2, 2024, 12:39 AM

#

i'll let you know

primal shadow Sep 2, 2024, 12:41 AM

#

Wait before you start I have another question

#

When simplifying meshlet groups, which edges do you need to lock?

fiery bolt Sep 2, 2024, 12:42 AM

#

ones that are only a part of one tri

primal shadow Sep 2, 2024, 12:43 AM

#

eloberate?

fiery bolt Sep 2, 2024, 12:44 AM

#

uhhhh

#

you need to lock the border

primal shadow Sep 2, 2024, 12:44 AM

#

Which border though?

fiery bolt Sep 2, 2024, 12:44 AM

#

of the group

primal shadow Sep 2, 2024, 12:44 AM

#

The border of the group, or border between meshlets?

fiery bolt Sep 2, 2024, 12:44 AM

#

you don't want to lock between meshlets no

#

only the group

#

that's the secret sauce to nanite

#

oh yeah i also use larger groups than 4

#

i think it helps simplification perf

primal shadow Sep 2, 2024, 12:46 AM

#

What do you use?

fiery bolt Sep 2, 2024, 12:46 AM

#

i do 8 meshlets per group

#

you might wanna test it out though

primal shadow Sep 2, 2024, 12:53 AM

#

fiery bolt you might wanna test it out though

Assertion failed: index_count / 3 <= kMeshletMaxTriangles, file vendor/src/clusterizer.cpp, line 717

#

The groups are too big for meshopt to split 😅

primal shadow Sep 2, 2024, 1:15 AM

#

Oh btw, are you using METIS or meshopt for clusterizing?

#

Interested to know if you compared the two at all

fiery bolt Sep 2, 2024, 2:10 AM

#

primal shadow Oh btw, are you using METIS or meshopt for clusterizing?

meshopt

#

might try metis

fiery bolt Sep 2, 2024, 2:10 AM

#

primal shadow The groups are too big for meshopt to split 😅

how are you splitting?

primal shadow Sep 2, 2024, 2:11 AM

#

fiery bolt how are you splitting?

build_meshlets(indices, vertices, 255, 128, 0.0)

fiery bolt Sep 2, 2024, 2:11 AM

#

in each group?

#

i do the same

#

it shouldn't be causing issues

#

meshopt clusterizes the source mesh after all

#

and it does lucy (28 mil tris) in 2-3 min

fiery bolt Sep 2, 2024, 2:49 AM

#

oh i also made this shitty python script that automatically tiles any gltf

📎 gen.py

primal shadow Sep 2, 2024, 3:17 AM

#

fiery bolt oh i also made this shitty python script that automatically tiles any gltf

Hah, you think I can load gltfs 😅

fiery bolt Sep 2, 2024, 3:17 AM

#

primal shadow Hah, you think I can load gltfs 😅

shouldn't bevy be doing that for you thonk

primal shadow Sep 2, 2024, 3:17 AM

#

Yes but not for meshlet meshes

#

Our asset processing APIs are sadly fairly poor atm, so I don't have a good way to convert GLTF -> bunch of meshlet meshes + scene file

#

(yet)

fiery bolt Sep 2, 2024, 3:22 AM

#

ah

#

froge_sad

#

my issue is that my engine is not an engine

#

the only thing it can do is load gltfs KEKW

primal shadow Sep 2, 2024, 3:24 AM

#

Join Bevy 🦀

fiery bolt Sep 2, 2024, 3:32 AM

#

does bevy do vulkan cutecatNE

primal shadow Sep 2, 2024, 3:40 AM

#

No, wgpu.

#

There is an alternative vulkan backend in a non-official crate, but it dosen't work with any existing stuff ofc.

faint crane Sep 2, 2024, 3:59 AM

#

fiery bolt it's minimum effort nanite reimpl

Thinking this is the best description of my own effort TBH.

fiery bolt Sep 2, 2024, 4:02 AM

#

lol

#

you aren't a billion dollar company doing a siggraph presentation though

faint crane Sep 2, 2024, 4:03 AM

#

Now I just need an excuse to skip on my externally broken occlusion culling.

#

I swear it was broken when I found it. No way to put it back together.

fiery bolt Sep 2, 2024, 4:11 AM

#

no no

#

you have a week

#

it better be working by then

primal shadow Sep 3, 2024, 1:10 AM

#

Based on recent discussion, bumped to 255v/128t, and some other changes https://github.com/bevyengine/bevy/pull/15023

GitHub

More triangles/vertices per meshlet by JMS55 · Pull Request #15023 ...

Builder changes

Increased meshlet max vertices/triangles from 64v/64t to 255v/256t (meshoptimizer won't allow 256v sadly). This gives us a much greater percentage of meshlets with max tria...

loud crag Sep 3, 2024, 1:53 AM

#

primal shadow Based on recent discussion, bumped to 255v/128t, and some other changes https://...

that code looks so fragile because you aren’t using any constants for these values

primal shadow Sep 3, 2024, 1:54 AM

#

loud crag that code looks so fragile because you aren’t using any constants for these valu...

Code quality comes later :p

loud crag Sep 3, 2024, 1:55 AM

#

it’s something i’d put under “code correctness” which would be the first thing to work on

primal shadow Sep 3, 2024, 1:56 AM

#

There's too much else to work on in the mean time, small stuff like this is not a priority.

fiery bolt Sep 3, 2024, 2:19 AM

#

did some optimization and now it culls 1 million stanford dragons in 2 ms froge_love

#

raster takes 23 ms though (only hw, no sw) KEKW

#

800 billion tris at 30 fps

fiery bolt Sep 3, 2024, 2:37 AM

#

https://cdn.discordapp.com/attachments/335502453371961344/1280354138449903751/2024-09-02_21-29-13.mp4?ex=66d7c636&is=66d674b6&hm=b8c9ae1029f97d19d33242af466caec008a0c7663920290a035c4c142d0a3091&

▶ Play video

#

https://cdn.discordapp.com/attachments/335502453371961344/1280354963800719493/2024-09-02_21-32-41.mp4?ex=66d7c6fb&is=66d6757b&hm=fc9829bafd1536f4de28b0a46eb59fe1581989c3991a798300b7f1ccf81cedd8&
hehe overdraw go brrrrrrr

▶ Play video

ebon ruin Sep 3, 2024, 2:52 PM

#

But can it support foliage

wicked notch Sep 3, 2024, 2:53 PM

#

overdraw is not real it can't hurt you

#

overdraw:

wispy spear Sep 3, 2024, 2:56 PM

#

https://tenor.com/view/emotional-damage-gif-hurt-feelings-gif-24558392

Tenor

fiery bolt Sep 6, 2024, 4:24 AM

#

i have acheived scene independence

#

7.2 trillion triangles at 60 fps

#

@primal shadow the edge detection really does help simplification massively

#

thanks for the insight

primal shadow Sep 6, 2024, 4:29 AM

#

edge detection?

#

but uhh glad to help

fiery bolt Sep 6, 2024, 4:30 AM

#

like the edge classification

#

internal/external

primal shadow Sep 6, 2024, 4:30 AM

#

oh, np

#

is it not super slow for you?

#

at build time

fiery bolt Sep 6, 2024, 4:30 AM

#

not really no

#

the other stanford dragon with 7.2 million tris imported in about 1.5 min

primal shadow Sep 6, 2024, 4:31 AM

#

¯_(ツ)_/¯

#

Well, I'll take a closer look at your code soon

#

I need a break from DAG building 😅

fiery bolt Sep 6, 2024, 4:31 AM

#

it's probably the multithreading tbf

#

i'm running on a 12900k

primal shadow Sep 6, 2024, 4:32 AM

#

what does your highest LOD look like?

#

Does it collapse to a sphere?

fiery bolt Sep 6, 2024, 4:32 AM

#

good question lemme check

primal shadow Sep 6, 2024, 4:33 AM

#

Also do you have a link to your github? I lost it

fiery bolt Sep 6, 2024, 4:34 AM

#

it is... whatever this is

fiery bolt Sep 6, 2024, 4:34 AM

#

primal shadow Also do you have a link to your github? I lost it

https://github.com/SparkyPotato/radiance/

GitHub

GitHub - SparkyPotato/radiance: Rendering things

Rendering things. Contribute to SparkyPotato/radiance development by creating an account on GitHub.

primal shadow Sep 6, 2024, 4:37 AM

#

fiery bolt it is... whatever this is

uhh

#

I mean it's kinda dragon shaped

#

sure

#

Managed not to turn into a sphere

fiery bolt Sep 6, 2024, 4:38 AM

#

probably because meshopt doesn't generate vertices

primal shadow Sep 6, 2024, 4:39 AM

#

You went back to your own simplifier?

#

Also hey, at some point I'd appreciate an explination on how to implement subpixel SW raster

#

The stuff I found online never made sense to me

#

And it seems like you implemented it

fiery bolt Sep 6, 2024, 4:43 AM

#

primal shadow And it seems like you implemented it

nah i haven't yet

#

but i will soon

#

i'll let you know when i do

fiery bolt Sep 6, 2024, 4:43 AM

#

primal shadow You went back to your own simplifier?

nah i'm using meshopt

primal shadow Sep 6, 2024, 4:48 AM

#

fiery bolt probably because meshopt doesn't generate vertices

Oh nvm you're saying it didn't turn into a sphere because of this

fiery bolt Sep 6, 2024, 4:48 AM

#

yeah

primal shadow Sep 6, 2024, 4:49 AM

#

fiery bolt nah i haven't yet

Ah gotcha, nvm yeah looking at your sw shader I see it was something else. Thanks!

fiery bolt Sep 6, 2024, 4:49 AM

#

my sw rasterizer is actually really bad

#

doing everything in hardware is very slightly faster lmao

primal shadow Sep 6, 2024, 4:55 AM

#

Really? I saw way faster speeds with SW raster

#

Did you have mesh shaders already though?

fiery bolt Sep 6, 2024, 4:58 AM

#

primal shadow Did you have mesh shaders already though?

yeah it's all mesh shader based

#

and i do backface culling in the mesh shader

primal shadow Sep 6, 2024, 4:59 AM

#

That explains it. Nanite was started before mesh shaders, and mesh shaders give a lot of similiar speedup.

fiery bolt Sep 6, 2024, 5:00 AM

#

i think with a bit of optimization and async compute overlap i can eek out a fair bit of perf with sw tbh

#

mainly the async compute overlap

#

the mesh shaders are completely bottleneck on hw so they have terrible util

primal shadow Sep 6, 2024, 5:01 AM

#

Yeah I don't have mesh shaders or async overlap 😭

fiery bolt Sep 6, 2024, 5:05 AM

#

impl mesh shaders into wgpu + naga and use them on native froge_evil

primal shadow Sep 6, 2024, 5:11 AM

#

Too much work... I hate touching wgpu/naga

#

Not my kind of thing

fiery bolt Sep 6, 2024, 5:12 AM

#

naga isn't terrible

#

i rewrote/restructured it for module-level scoping long ago

#

wasn't too bad

#

no idea how the code is today though

wicked notch Sep 6, 2024, 9:30 AM

#

fiery bolt 7.2 trillion triangles at 60 fps

insane

ebon ruin Sep 10, 2024, 10:31 PM

#

Question about Nanite/what you guys do

#

Meshlets are merged as the LOD gets lower to prevent edge cruft

#

Doesn’t this rely on the LOD of both meshlets decreasing? So what would happen if you need to lower one meshlet’s LOD, but another meshlet must stay higher?

fiery bolt Sep 10, 2024, 11:27 PM

#

LOD isn't decide at meshlet level, but at meshlet group level

#

and you only lock the boundary of the group

#

and after simplification, you split the group into meshlets again

primal shadow Sep 10, 2024, 11:29 PM

#

fiery bolt and you only lock the boundary of the group

Lock the boundary of the group, *except for where it intersects the mesh border

fiery bolt Sep 10, 2024, 11:30 PM

#

yeah

#

i think that part of my code is broken actually

#

i see cracks

ebon ruin Sep 11, 2024, 12:20 AM

#

i see

#

thank you all for your time

primal shadow Sep 11, 2024, 5:09 AM

#

I hate tangents

#

Idk why my code isin't correct but it's niot

fiery bolt Sep 11, 2024, 5:14 AM

#

compute analytic derivatives instead of using ddx/ddy perhaps

#

@primal shadow oh yeah i also found out that my border vertex detection was completely wrong, fixed it and pushed to my branch (PRed to your bevy repo)

#

might wanna test it out again lol

primal shadow Sep 11, 2024, 5:17 AM

#

fiery bolt compute analytic derivatives instead of using ddx/ddy perhaps

I am, my ddx/ddy aren't from the ddx/ddy() functions, I compute them as part of the visbuffer resolve

fiery bolt Sep 11, 2024, 5:17 AM

#

ah

#

no idea then KEKW

primal shadow Sep 11, 2024, 5:17 AM

#

fiery bolt <@145540119141679105> oh yeah i also found out that my border vertex detection w...

Oh, thank you, I appreciate it. I am extremely short on time this week, but please do remind me if I don't get to it by saturday,

fiery bolt Sep 11, 2024, 5:17 AM

#

will do if i remember

#

lmao i fixed edge classification which led to fixing cracks which led to significant occlusion culling improvements which now means i can do 7.2 trillion triangles in under 3 ms

#

froge_love

#

4.3 +- 0.3 if i lock clocks to base

wispy spear Sep 11, 2024, 10:36 AM

#

!remindme 3d remind jasmine about the thing

vivid boughBOT Sep 11, 2024, 10:36 AM

#

New Reminder | ID:72591537

Alright deccer, I'll remind you in 3 days about:

remind jasmine about the thing

fiery bolt Sep 11, 2024, 3:47 PM

#

mobile drivers coming in clutch (words you'd never thought you'd ever see)

wispy spear Sep 11, 2024, 4:12 PM

#

do you mean crutches?

fiery bolt Sep 12, 2024, 9:07 PM

#

@languid vector methinks i have found a solution

#

https://jcgt.org/published/0002/02/05/

#

the original sphere projection algo

#

that takes near clipping into account

#

project the full bounds to the screen, and scale the error using that

languid vector Sep 12, 2024, 9:08 PM

#

pog thanks!

fiery bolt Sep 12, 2024, 9:09 PM

#

idk if it works yet

languid vector Sep 12, 2024, 9:36 PM

#

fiery bolt lmao i fixed edge classification which led to fixing cracks which led to signifi...

what was wrong?

#

let me guess - you classified edges without generating shadow index buffer with geometry-only data?

#

the only thing I wonder is how the hell you manage to do 7.2 trillion triangles in 4ms

#

Is hierarchy this major optimization?

fiery bolt Sep 12, 2024, 9:39 PM

#

languid vector let me guess - you classified edges without generating shadow index buffer with ...

nah I was unlocking edges that were mesh borders but not group borders, but did it completely wrong bleaker_kekw

#

so it was unlocking group borders

#

leading to cracks

fiery bolt Sep 12, 2024, 9:39 PM

#

languid vector Is hierarchy **this** major optimization?

yeah it's really useful

#

you can get rid of a bunch of instances early

#

and never even consider more than 1 or 2 meshlets for them

#

you also do frustum and occlusion culling inside hierarchy traversal

#

so you can save a bunch of bandwidth too

fiery bolt Sep 12, 2024, 9:41 PM

#

fiery bolt you also do frustum and occlusion culling inside hierarchy traversal

atleast, I do because it's literally just a conventional bvh with error stuck on top

languid vector Sep 12, 2024, 9:41 PM

#

I've only done the frustum culling so far

#

I can't really understand what am I bound to since not a single metric is nsight is loaded more than 80%

fiery bolt Sep 12, 2024, 9:41 PM

#

memory

languid vector Sep 12, 2024, 9:42 PM

#

though "warp can't launch" is insanely high

fiery bolt Sep 12, 2024, 9:42 PM

#

or raster

fiery bolt Sep 12, 2024, 9:42 PM

#

languid vector though "warp can't launch" is insanely high

which shader type

languid vector Sep 12, 2024, 9:42 PM

#

fiery bolt or raster

solved with sw raster?

languid vector Sep 12, 2024, 9:42 PM

#

fiery bolt which shader type

task/mesh pipeline

fiery bolt Sep 12, 2024, 9:42 PM

#

what's ISBE alloc stalled at

languid vector Sep 12, 2024, 9:42 PM

#

I am not sure I understand what ISBE is 😅

fiery bolt Sep 12, 2024, 9:43 PM

#

idk either

#

it's just a metric

languid vector Sep 12, 2024, 9:43 PM

#

kekw

fiery bolt Sep 12, 2024, 9:43 PM

#

I think it's memory allocation for mesh shader outputs

#

if your rasterizer is cooked you'll stall at that because it's still processing other tris

languid vector Sep 12, 2024, 9:43 PM

#

I also do nanite-style quantization so it should have saved a bit of bandwidth

#

I mean, it is nearly x4 compression

fiery bolt Sep 12, 2024, 9:43 PM

#

I have zero compression lol

#

it's not about mesh shader read bandwidth, it's about how fast the rasterizer can chug triangles

#

you're raster bound

languid vector Sep 12, 2024, 9:44 PM

#

I have never thought I will be raster bound bleakekw

#

and I guess the only way to solve it occlusion culling + sw raster?

#

afaik occlusion culling is a massive optimization

#

really massive

fiery bolt Sep 12, 2024, 9:47 PM

#

languid vector afaik occlusion culling is a massive optimization

yes

#

how large is your test scene

languid vector Sep 12, 2024, 9:48 PM

#

fiery bolt how large is your test scene

25x25 grid of happy buddha meshes :>

fiery bolt Sep 12, 2024, 9:50 PM

#

how many tris is that

#

you should use the xyzrgb dragon froge_love

#

it's got 7.2 million tris in it

#

or Lucy froge_love

languid vector Sep 12, 2024, 9:55 PM

#

fiery bolt how many tris is that

1.1m

fiery bolt Sep 12, 2024, 9:55 PM

#

27 mil iirc

languid vector Sep 12, 2024, 9:55 PM

#

fiery bolt or Lucy <:froge_love:1105211408255295624>

yeah lucy is fat ass mesh

fiery bolt Sep 12, 2024, 9:55 PM

#

languid vector 1.1m

which gpu

languid vector Sep 12, 2024, 9:55 PM

#

1660 Ti

fiery bolt Sep 12, 2024, 9:55 PM

#

yeah you need occlusion cull

languid vector Sep 12, 2024, 9:56 PM

#

I guess so kekw

fiery bolt Sep 12, 2024, 9:56 PM

#

languid vector yeah lucy is fat ass mesh

it takes 17 min to build for me lmao

languid vector Sep 12, 2024, 9:56 PM

#

fiery bolt it takes 17 min to build for me lmao

holy cow

fiery bolt Sep 12, 2024, 9:56 PM

#

on a 12900k

#

full multithreaded

languid vector Sep 12, 2024, 9:56 PM

#

bleakekw

fiery bolt Sep 12, 2024, 9:56 PM

#

still better than unreal

languid vector Sep 12, 2024, 9:56 PM

#

unreal takes longer?

fiery bolt Sep 12, 2024, 9:57 PM

#

I let it run for an hour once and it froze

#

so I killed it

#

if this shit works I can finally start streaming froge_love

languid vector Sep 12, 2024, 10:05 PM

#

fiery bolt if this shit works I can finally start streaming <:froge_love:110521140825529562...

"this shit" is a new sphere projection method?

fiery bolt Sep 12, 2024, 10:05 PM

#

yeah

languid vector Sep 12, 2024, 10:06 PM

#

I was just about to ask you if you already have implementation kekw

#

my dumb ass can't handle the maths now

fiery bolt Sep 12, 2024, 10:07 PM

#

I'm making coffee so I can consume copious amounts of caffeine to understand wtf the paper does

languid vector Sep 12, 2024, 10:08 PM

#

I personally can't understand the meaning of "conservativeness" in all these papers

fiery bolt Sep 12, 2024, 10:09 PM

#

languid vector I personally can't understand the meaning of "conservativeness" in all these pap...

every choice you make must overestimate error, not underestimate

languid vector Sep 12, 2024, 10:09 PM

#

fiery bolt every choice you make must overestimate error, not underestimate

Ahh, it makes sense now

fiery bolt Sep 12, 2024, 10:09 PM

#

your bounding box must be larger than the object, never smaller

#

because otherwise you will cull too much

languid vector Sep 12, 2024, 10:10 PM

#

yess, it makes sense now. thanks!

#

alr, it has 3 out variables:

out vec2 perpendicularDirection, out vec2 U, out vec2 L

now need to figure out what tf to do with it

fiery bolt Sep 12, 2024, 10:30 PM

#

uhhhhhhhhhhh error is decreasing as i get closer

languid vector Sep 12, 2024, 10:31 PM

#

fiery bolt uhhhhhhhhhhh error is decreasing as i get closer

so it aint working

fiery bolt Sep 12, 2024, 10:32 PM

#

i've done something wrong

languid vector Sep 12, 2024, 10:32 PM

#

so as I understand, it takes view and sphere and outputs a polygon with N sides with coordinates in screen-space? I guess I only need AABB from the sphere

fiery bolt Sep 12, 2024, 10:33 PM

#

the core algorithm take a sphere in view space and tells you the min, max along a specific axis

languid vector Sep 12, 2024, 10:33 PM

#

so I simply build an AABB and then compute its area that I use as error estimation?

#

"simply build" using this algorithm I mean

fiery bolt Sep 12, 2024, 10:34 PM

#

no just one axis

#

and you scale error with that

languid vector Sep 12, 2024, 10:34 PM

#

ah I see

#

so phi is always 0?

fiery bolt Sep 12, 2024, 10:35 PM

#

i'm doing y axis so it's always pi / 2

languid vector Sep 12, 2024, 10:35 PM

#

yup, makes sense

#

does it output size in screen-space?

#

or I need to project it further

fiery bolt Sep 12, 2024, 10:36 PM

#

it outputs in view space

#

everything is in view space

languid vector Sep 12, 2024, 10:37 PM

#

I see

#

I guess you are better at maths than me, so if you don't mind, can I come back with questions if have some after reading of the paper?

fiery bolt Sep 12, 2024, 10:38 PM

#

i don't think i am KEKW

#

but sure

languid vector Sep 12, 2024, 10:38 PM

#

thanks!

languid vector Sep 12, 2024, 11:00 PM

#

noice

#

hold on, I am just stupid

fiery bolt Sep 12, 2024, 11:04 PM

#

same

languid vector Sep 12, 2024, 11:11 PM

#

no, it indeed returns nans and I can't understand why

languid vector Sep 12, 2024, 11:48 PM

#

@fiery bolt so I figured out the issue. I have inverse Z camera and was passing inverted clip planes to the algorithm 🤡

#

tho haven't figured out how to use U and L for projection yet

fiery bolt Sep 12, 2024, 11:48 PM

#

you should be doing it in view space though thonk

#

inverse Z shouldn't matter

languid vector Sep 12, 2024, 11:49 PM

#

I mean, near plane was further than far plane

#

which produced a tons of NaNs due to sqrt of negative value

fiery bolt Sep 12, 2024, 11:49 PM

#

ah lmao

languid vector Sep 12, 2024, 11:50 PM

#

did u figure out how to use U and L?

fiery bolt Sep 12, 2024, 11:54 PM

#

that's the screenspace bounds

languid vector Sep 12, 2024, 11:54 PM

#

projected to the far plane?

fiery bolt Sep 12, 2024, 11:54 PM

#

no, projected to the screen

#

which is the near plane

languid vector Sep 12, 2024, 11:54 PM

#

amazing, so no need to project it?

#

if it's already projected to the screen

fiery bolt Sep 12, 2024, 11:55 PM

#

i think no

#

actually no, the sample code they've given projects

languid vector Sep 12, 2024, 11:57 PM

#

Though it doesn't take FOV into account

#

which is kinda strange

fiery bolt Sep 12, 2024, 11:57 PM

#

yeah you need to project

languid vector Sep 12, 2024, 11:58 PM

#

how do I project projected stuff bleakekw

fiery bolt Sep 12, 2024, 11:58 PM

#

idk, the sample code does it

#

just project it frog_dum

languid vector Sep 12, 2024, 11:58 PM

#

Alr I see, I will have a look at sample code

#

so yeah, it works relatively awful without reprojection

fiery bolt Sep 13, 2024, 12:37 AM

#

i think i have something that sorta works?

languid vector Sep 13, 2024, 12:55 AM

#

fiery bolt i think i have something that sorta works?

how did u fix the issue when error gets smaller if you get closer to the sphere?

fiery bolt Sep 13, 2024, 12:56 AM

#

i rewrote the code a few times

#

and projected

languid vector Sep 13, 2024, 12:56 AM

#

vec2 parent_U, parent_L;
    GetBoundsForPhiLengyel(0.0f, parent_projected_bounding_sphere.center, parent_projected_bounding_sphere.radius, camera_data.near_clip_distance, camera_data.far_clip_distance, parent_U, parent_L) ;

    vec4 parent_projected_points[2];
    parent_projected_points[0] = camera_data.proj * vec4(parent_U.x, 0.0f, camera_data.near_clip_distance, 1.0f);
    parent_projected_points[1] = camera_data.proj * vec4(parent_L.x, 0.0f, camera_data.near_clip_distance, 1.0f);

    const float parent_result_error = (parent_projected_points[1].x / parent_projected_points[1].w) - (parent_projected_points[0].x / parent_projected_points[0].w);

this is how I do it

fiery bolt Sep 13, 2024, 12:57 AM

#

    public f32 project_error(f32x4 bounds, f32 error) {
        let center = mul(this.mv, f32x4(bounds.xyz, 1.f)).xyz;
        let radius = bounds.w * this.scale;
        let err_frac = error / bounds.w;
        
        if ((center.z + radius) <= this.near) return 0.f;

        let dist2 = dot(center, center);
        let a = sqrt(dist2 - center.z * center.z);
        let t2 = dist2 - radius * radius;
        let t = sqrt(max(t2, 0.f));
        let in_sphere = t2 <= 0.f;

        f32x2 bounds[2];
        // cos(theta) = t / dist
        // sin(theta) = r / dist
        // T = (rotate(theta) * (a, z) / dist) * t,
        // removing the dist divide in cos, sin
        // ncos(theta) = t
        // nsin(theta) = r
        // rotate(theta) == rotate(ntheta) / dist
        // therefore, T = (rotate(ntheta) * (a, z) / dist2) * t
        // saving us two divides and a sqrt!
        var v = in_sphere ? f32x2(0.f) : f32x2(t, radius);
        let clip_sphere = (center.z + radius) >= this.near;
        let off = this.near - center.z;
        var k = sqrt(radius * radius - off * off);

        [unroll]
        for (int i = 0; i < 2; i++) {
            if (!in_sphere) 
                bounds[i] = mul(f32x2x2(v.x, v.y, -v.y, v.x), f32x2(a, center.z)) * v.x / dist2;
            let clip_bound = in_sphere || (bounds[i].y < this.near);
            if (clip_sphere && clip_bound)
                bounds[i] = f32x2(a + k, this.near);
            v.y = -v.y;
            k = -k;
        }

        let ndc_size = abs(bounds[0].x / bounds[0].y - bounds[1].x / bounds[1].y) * this.h;
        // NDC size has a range of [0, 2] mapping to [0, height], 
        // but don't divide by 2 because the error is divided by 2 at build-time.
        return ndc_size * err_frac * this.screen.y;
    }

languid vector Sep 13, 2024, 12:59 AM

#

wait

#

why ur code is in rust

fiery bolt Sep 13, 2024, 1:00 AM

#

slang

languid vector Sep 13, 2024, 1:00 AM

#

got it

fiery bolt Sep 13, 2024, 1:00 AM

#

my cpu code is in rust tho

#

froge_love

languid vector Sep 13, 2024, 1:01 AM

#

Alr, I will get back to the code tomorrow, gotta sleep. it is 3 AM in my country kekw

fiery bolt Sep 13, 2024, 1:01 AM

#

you still have 3 hours till bedtime!

wicked notch Sep 13, 2024, 1:03 AM

#

make that 6

primal shadow Sep 13, 2024, 1:15 AM

#

Time to debug tangents again

primal shadow Sep 13, 2024, 1:16 AM

#

fiery bolt ```java public f32 project_error(f32x4 bounds, f32 error) { let cent...

Btw after further thought, projecting the LOD sphere based on the culling sphere makes no sense to me

fiery bolt Sep 13, 2024, 1:16 AM

#

why not?

primal shadow Sep 13, 2024, 1:17 AM

#

you'd have different projections for different parts of the same group, which makes no sense

fiery bolt Sep 13, 2024, 1:17 AM

#

how would you?

primal shadow Sep 13, 2024, 1:17 AM

#

and when you do a BVH, it's based on the group, not individual meshlets

fiery bolt Sep 13, 2024, 1:17 AM

#

you're using the group LOD sphere

primal shadow Sep 13, 2024, 1:17 AM

#

wait what culling sphere do you use?

fiery bolt Sep 13, 2024, 1:18 AM

#

a merged sphere of all lower lods' group lod sphere

#

to do BVH you need a bounding sphere or else traversal won't be monotonic

fiery bolt Sep 13, 2024, 3:47 AM

#

oooooooook even this doesn't work

#

time to bust out the pen and paper

languid vector Sep 13, 2024, 12:30 PM

#

fiery bolt you still have 3 hours till bedtime!

@fiery bolt so we have an answer for "how to project the sphere" question, but do we have an answer for "where tf to place the sphere" question? kekw

#

I mean if camera is inside the sphere

#

Just snap to camera position?

dull oyster Sep 13, 2024, 2:06 PM

#

I still don't understand why the sphere should not just be at the center of the group

languid vector Sep 13, 2024, 3:58 PM

#

dull oyster I still don't understand why the sphere should not just be at the center of the ...

So test is conservative

#

You should never underestimate error, you can only overestimate

#

here is 2 groups that have the same error. Obviously the left one will have perceptually bigger error because it is bigger by itself, so it will be closer to camera

fiery bolt Sep 13, 2024, 4:58 PM

#

languid vector <@488643966502436865> so we have an answer for "how to project the sphere" quest...

I ended up doing something different that seems to work

#

but it's rendering way too much

#

like, 2 million meshlets

#

my culling queues are filling up bleaker_kekw

languid vector Sep 13, 2024, 5:59 PM

#

fiery bolt like, 2 million meshlets

ideally u want 128 times less for 1080p monitor kekw

fiery bolt Sep 13, 2024, 6:00 PM

#

agonyfrog

#

debubbing time

dull oyster Sep 13, 2024, 6:01 PM

#

languid vector So test is conservative

The test is made so that the error is not (in theory) perceptible, wouldn't making it more conservative just use more VRAM when you could use a lower fidelity lod for the same visual result?

fiery bolt Sep 13, 2024, 6:01 PM

#

the test only guarantees the error isn't perceptible when you use bounds

dull oyster Sep 13, 2024, 6:01 PM

#

I understand that it keeps higher fidelity lods more, but I'm not sure it's really necessary

fiery bolt Sep 13, 2024, 6:01 PM

#

if you just use the center of the group it can't guarantee that

dull oyster Sep 13, 2024, 6:02 PM

#

I may be missing the point completely but it bothers me that I don't see the issue ^^'

languid vector Sep 13, 2024, 6:02 PM

#

dull oyster I understand that it keeps higher fidelity lods more, but I'm not sure it's real...

it is not required but it is kinda more valid

fiery bolt Sep 13, 2024, 6:03 PM

#

dull oyster I may be missing the point completely but it bothers me that I don't see the iss...

assuming error at the center is 0.999 px, the error for all triangles in the group closer than the center (which should be about half) will be more than a pixel
when you do BVH, you need the outermost node's projected error to always bound that of it's children, so if you just use the center of the node, groups closer than that (again, about half), might have a higher projected error than the BVH node itself

languid vector Sep 13, 2024, 6:03 PM

#

fiery bolt 1. assuming error at the center is 0.999 px, the error for all triangles in the ...

this

languid vector Sep 13, 2024, 6:04 PM

#

fiery bolt 1. assuming error at the center is 0.999 px, the error for all triangles in the ...

btw, do u clamp the error sphere position to camera position if camera is inside the group sphere?

fiery bolt Sep 13, 2024, 6:05 PM

#

i was sending my code but discord seems to be blocking it bleaker_kekw

languid vector Sep 13, 2024, 6:05 PM

#

💀

fiery bolt Sep 13, 2024, 6:06 PM

#

i assume it's being blocked for spam

#

    // 2D Polyhedral Bounds of a Clipped, Perspective-Projected 3D Sphere (Michael Mara, Morgan McGuire).
    // We get the projected bounds on the axis that is the longest upon projection (need to be conservative!),
    // which is the one from (0, 0) to the sphere's center.
    public f32 perceptible_error_distance(f32x4 bounds) {
        let center = mul(this.mv, f32x4(bounds.xyz, 1.f)).xyz;
        let radius = bounds.w * this.scale;

        if (center.z + radius <= this.near)
            return 0.f;

        let dist2 = dot(center, center);
        let a = sqrt(dist2 - center.z * center.z);
        let proj_center = f32x2(a, center.z);
        let t2 = dist2 - radius * radius;
        var t = sqrt(max(t2, 0.f));
        let in_sphere = t2 < 0.f;

        // cos(theta) = t / dist
        // sin(theta) = r / dist
        // T = t * rotate(theta) * proj_center / dist,
        // removing the dist divide in cos, sin
        // ncos(theta) = t
        // nsin(theta) = r
        // rotate(theta) == rotate(ntheta) / dist
        // therefore, T = t * rotate(ntheta) * proj_center / dist2
        // saving us two divides and a sqrt!
        let ncos = t;
        let nsin = radius;
        let wt_z = dot(f32x2(-nsin, ncos), proj_center) / dist2;
        var t_z = t * wt_z;

        if (in_sphere || t_z < this.near) {
            // let off = this.near - center.z;
            // let k = sqrt(radius * radius - off * off);
            // let t = f32x2(a + k, this.near);
            t_z = this.near;
        }

        return t_z;
    }

#

    public f32 error_perceptible_at(f32 error) {
        // Don't divide by 2 because the error is already divided by 2 during build.
        return this.screen.y * this.h * this.min_scale * error;
    }

    public bool should_visit_bvh(f32x4 lod_bounds, f32 parent_error) {
        return this.perceptible_error_distance(lod_bounds) <= this.error_perceptible_at(parent_error);
    }

    public bool should_render(f32x4 lod_bounds, f32 error) {
        return this.perceptible_error_distance(lod_bounds) > this.error_perceptible_at(error);
    }

#

there

#

idk why i'm returning t_z

languid vector Sep 13, 2024, 6:07 PM

#

fiery bolt idk why i'm returning t_z

lol

#

and it works?

fiery bolt Sep 13, 2024, 6:07 PM

#

well it doesn't have holes

#

or overlapping meshlets

languid vector Sep 13, 2024, 6:07 PM

#

fiery bolt well it doesn't have holes

already better

#

but rendering too much?

fiery bolt Sep 13, 2024, 6:08 PM

#

yeah it does that

languid vector Sep 13, 2024, 6:08 PM

#

we can always increase the threshold tho kekw

fiery bolt Sep 13, 2024, 6:08 PM

#

idk how correct it is

jagged valve Sep 13, 2024, 7:02 PM

#

fiery bolt my cpu code is in rust tho

How do you handle your parameter binding business with a rust/slang setup?

fiery bolt Sep 13, 2024, 7:05 PM

#

bindless everything

#

struct defs are duplicated though froge_sad

jagged valve Sep 13, 2024, 7:07 PM

#

fiery bolt bindless everything

ngl I had no idea that was even possible

fiery bolt Sep 14, 2024, 2:51 AM

#

why is removing frustum culling reducing the number of meshlets drawn

#

but increasing the number of bvh nodes traversed (as it should)

languid vector Sep 14, 2024, 4:53 PM

#

fiery bolt why is removing frustum culling reducing the number of meshlets drawn

task failed successfully

primal shadow Sep 14, 2024, 7:17 PM

#

@fiery bolt ok I'm taking a look at your PR today

#

If i pass ptr::null() for vertex_locks, and add back SimplifyOptions::LockBorder, it should be equivilant to the old code right? For comparison purposes

#

So uhh it's a bit... aggressive

#

📎 message.txt

#

231 -> 4 meshlets is... a choice 😅

#

let me try this on the cliff instead of bunnies

#

Cliffs:

#

📎 message.txt

#

Something's a bit off

#

Anyways back to compression

fiery bolt Sep 14, 2024, 7:47 PM

#

primal shadow If i pass ptr::null() for vertex_locks, and add back SimplifyOptions::LockBorder...

probably? but not sure

#

are these numbers with the edge detection or without?

primal shadow Sep 14, 2024, 8:08 PM

#

fiery bolt are these numbers with the edge detection or without?

Wdym? Expand the results, I showed on main/your PR

fiery bolt Sep 14, 2024, 8:20 PM

#

primal shadow Wdym? Expand the results, I showed on main/your PR

like, is this after replacing vertex locks with a nullptr or without

primal shadow Sep 14, 2024, 9:08 PM

#

I used your vertex locks for your pr

#

And used nullptr for mine

fiery bolt Sep 14, 2024, 9:46 PM

#

hmmmmm

fiery bolt Sep 14, 2024, 9:47 PM

#

primal shadow 231 -> 4 meshlets is... a choice 😅

i'm assuming this is the simplification queue length?

#

so it doesn't mean that the whole mesh was 231 meshlets and it's now 4

#

what's the threshold at which you reject a simplified group?

fiery bolt Sep 14, 2024, 10:37 PM

#

my frustum culling has been wrong all along...

#

lmfao

#

@languid vector 7.2 trillion tris in 1.46 +- 0.29 ms

languid vector Sep 14, 2024, 10:38 PM

#

fiery bolt <@323156168614871040> 7.2 trillion tris in 1.46 +- 0.29 ms

holy crap

#

I am rendering 500m tris at 30ms

#

:c

fiery bolt Sep 14, 2024, 10:39 PM

#

occlusion culling

languid vector Sep 14, 2024, 10:39 PM

#

but without occlusion culling

#

yeah

#

and without hierarchy

#

and without sw raster

fiery bolt Sep 14, 2024, 10:39 PM

#

occlusion cull is a bigger win than hierarchy

#

i'm gonna tune my sw raster thresholds rn

#

it's not that much of a difference

#

around 20% boost

languid vector Sep 14, 2024, 10:48 PM

#

what was the best boost for your virtual geom renderer?

#

I mean runtime performance

fiery bolt Sep 14, 2024, 10:49 PM

#

wdym

primal shadow Sep 14, 2024, 11:25 PM

#

fiery bolt i'm assuming this is the simplification queue length?

Yeah, amount of meshlets at each level

primal shadow Sep 14, 2024, 11:26 PM

#

fiery bolt so it doesn't mean that the whole mesh was 231 meshlets and it's now 4

It does though

primal shadow Sep 14, 2024, 11:26 PM

#

fiery bolt what's the threshold at which you reject a simplified group?

I allow simplifying at least 5% (i.e. 95% the same). Maybe I should change that and the test your changes again

primal shadow Sep 14, 2024, 11:26 PM

#

fiery bolt it's not that much of a difference

(if you have mesh shaders :P)

fiery bolt Sep 14, 2024, 11:35 PM

#

primal shadow It does though

no, because some groups would've been rejected and will not be simplified ever again

primal shadow Sep 14, 2024, 11:35 PM

#

also true

fiery bolt Sep 14, 2024, 11:36 PM

#

primal shadow I allow simplifying at least 5% (i.e. 95% the same). Maybe I should change that ...

yeah

#

i do 60% iirc

primal shadow Sep 14, 2024, 11:37 PM

#

Tried 65%

LOD: 0, meshlet count: 15616, meshlet occupancy counts: {128: 15615, 88: 1, }
LOD: 1, meshlet count: 8066, meshlet occupancy counts: {128: 6406, 127: 1114, 64: 352, 63: 156, 126: 29, 62: 7, 43: 1, 125: 1, }
LOD: 2, meshlet count: 4719, meshlet occupancy counts: {128: 2925, 64: 507, 63: 465, 127: 222, 32: 157, 31: 133, 62: 98, 96: 62, 95: 59, 126: 27, 30: 20, 94: 17, 61: 13, 29: 5, 125: 3, 93: 2, 46: 1, 28: 1, 66: 1, 85: 1, }
LOD: 3, meshlet count: 1358, meshlet occupancy counts: {128: 811, 32: 11, 111: 9, 64: 9, 16: 9, 48: 9, 11: 9, 113: 8, 109: 8, 9: 8, 118: 8, 10: 8, 79: 8, 95: 8, 23: 7, 47: 7, 120: 7, 122: 7, 29: 7, 63: 7, 73: 7, 94: 7, 6: 7, 127: 7, 93: 7, 106: 7, 99: 6, 34: 6, 45: 6, 117: 6, 70: 6, 12: 6, 75: 6, 97: 6, 33: 6, 55: 6, 5: 5, 98: 5, 19: 5, 53: 5, 7: 5, 112: 5, 74: 5, 100: 5, 14: 5, 22: 5, 65: 5, 101: 5, 110: 5, 25: 5, 28: 5, 49: 5, 27: 5, 68: 5, 61: 4, 51: 4, 56: 4, 119: 4, 126: 4, 44: 4, 13: 4, 24: 4, 60: 4, 102: 4, 71: 4, 15: 4, 123: 4, 18: 4, 76: 4, 39: 4, 50: 4, 124: 4, 26: 4, 115: 4, 116: 4, 17: 4, 30: 4, 91: 4, 31: 4, 80: 4, 2: 4, 96: 4, 37: 4, 52: 3, 46: 3, 43: 3, 62: 3, 121: 3, 125: 3, 4: 3, 90: 3, 89: 3, 72: 3, 20: 3, 85: 3, 84: 3, 114: 3, 35: 3, 40: 2, 83: 2, 8: 2, 1: 2, 41: 2, 3: 2, 103: 2, 21: 2, 108: 2, 81: 2, 82: 2, 57: 2, 42: 2, 36: 2, 78: 2, 88: 2, 38: 2, 104: 1, 58: 1, 77: 1, 107: 1, 69: 1, 86: 1, }

fiery bolt Sep 14, 2024, 11:38 PM

#

is that the cliff?

#

huh i get
561, 284, 152, 80, 40, 19, 10, 6, 3, 2
meshlets per lod

#

for the bunny

primal shadow Sep 14, 2024, 11:40 PM

#

fiery bolt is that the cliff?

Yes

fiery bolt Sep 14, 2024, 11:41 PM

#

hm that's weird

#

primal shadow Sep 15, 2024, 12:33 AM

#

Idk. I'm doen with trying to improve DAG building atm.

#

Next todos are compress meshlet data and then BVH-based persistent culling

fiery bolt Sep 15, 2024, 12:36 AM

#

primal shadow Next todos are compress meshlet data and then BVH-based persistent culling

you can't do persistent queues with wgpu atm

#

wgsl doesn't support coherency

primal shadow Sep 15, 2024, 12:36 AM

#

Hmm, do you need it?

fiery bolt Sep 15, 2024, 12:37 AM

#

yeah coherency is required for any non-atomic writes to be visible to other workgroups in the same dispatch

#

same reason it's needed for SPD

primal shadow Sep 15, 2024, 12:37 AM

#

Which part needs those?

fiery bolt Sep 15, 2024, 12:37 AM

#

primal shadow Which part needs those?

the whole persistent queue shtick?

primal shadow Sep 15, 2024, 12:37 AM

#

Right I know what it does, I'm curious what needs it though for persistent queues

#

Oh wait

fiery bolt Sep 15, 2024, 12:38 AM

#

you're writing to a queue lol

primal shadow Sep 15, 2024, 12:38 AM

#

you can attomic increment the queue counter

#

but your write won't be visible lol

fiery bolt Sep 15, 2024, 12:38 AM

#

yep

primal shadow Sep 15, 2024, 12:38 AM

#

yeah I see

fiery bolt Sep 15, 2024, 12:38 AM

#

KEKW

#

I do dependent dispatches

primal shadow Sep 15, 2024, 12:38 AM

#

Isin't it litterly just adding coherent (glsl) / globallycoherent (hlsl) to the buffer decleration though?

fiery bolt Sep 15, 2024, 12:38 AM

#

and so does nanite on PC

fiery bolt Sep 15, 2024, 12:38 AM

#

primal shadow Isin't it litterly just adding coherent (glsl) / globallycoherent (hlsl) to the ...

yep

primal shadow Sep 15, 2024, 12:38 AM

#

Ok I'll patch naga, easy enough

fiery bolt Sep 15, 2024, 12:39 AM

#

oh yeah you also need forward progress guarantees

primal shadow Sep 15, 2024, 12:39 AM

#

That, or spirv passthrough in bevy

fiery bolt Sep 15, 2024, 12:39 AM

#

yeah that works

primal shadow Sep 15, 2024, 12:39 AM

#

fiery bolt oh yeah you also need forward progress guarantees

Yeah ik avbout that part

#

let me find the spirv thing

fiery bolt Sep 15, 2024, 12:39 AM

#

metal on M series fails that

primal shadow Sep 15, 2024, 12:39 AM

#

https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#Decoration coherent decoration

fiery bolt Sep 15, 2024, 12:39 AM

#

so you have to fall back to dependent dispatches for apple silicon

loud crag Sep 15, 2024, 12:40 AM

#

fiery bolt metal on M series fails that

fails whag

primal shadow Sep 15, 2024, 12:40 AM

#

fiery bolt metal on M series fails that

Naga is already broken on metal when it comes to atomic<u64? anyways so idc

fiery bolt Sep 15, 2024, 12:40 AM

#

loud crag fails whag

you don't get forward progress on metal

loud crag Sep 15, 2024, 12:40 AM

#

what’s that

fiery bolt Sep 15, 2024, 12:40 AM

#

it'll just keep spinning

fiery bolt Sep 15, 2024, 12:41 AM

#

loud crag what’s that

guarantees that workgroups will get switched out eventually even if they're not waiting for a mem access

loud crag Sep 15, 2024, 12:41 AM

#

primal shadow Naga is already broken on metal when it comes to atomic<u64? anyways so idc

pretty sure 64-bit atomics exist

fiery bolt Sep 15, 2024, 12:41 AM

#

no API actually gives that to you

primal shadow Sep 15, 2024, 12:41 AM

#

loud crag pretty sure 64-bit atomics exist

They do, but naga's MSL backend is bugged on them

loud crag Sep 15, 2024, 12:41 AM

#

fiery bolt guarantees that workgroups will get switched out eventually even if they're not ...

why the hell would you want that

fiery bolt Sep 15, 2024, 12:41 AM

#

but it mostly works on nvidia and amd

fiery bolt Sep 15, 2024, 12:41 AM

#

loud crag why the hell would you want that

spinlocks

loud crag Sep 15, 2024, 12:42 AM

#

primal shadow They do, but naga's MSL backend is bugged on them

skill issue use a better api

primal shadow Sep 15, 2024, 12:42 AM

#

loud crag why the hell would you want that

https://arxiv.org/pdf/2109.06132

fiery bolt Sep 15, 2024, 12:42 AM

#

fiery bolt no API actually gives that to you

so nanite doesn't use them on PC

#

dependent dispatches work well enough™️

#

I might do persistent queues after streaming

#

maybe

wicked notch Sep 15, 2024, 12:44 AM

#

I'm pretty sure cuda does define forward progress guarantees

#

so that's why it mostly works on NV

fiery bolt Sep 15, 2024, 12:44 AM

#

amd too

#

idk about intel

wicked notch Sep 15, 2024, 12:44 AM

#

PT workloads and dynamic parallelism is pretty big in cuda land

#

intel who?

loud crag Sep 15, 2024, 12:44 AM

#

primal shadow https://arxiv.org/pdf/2109.06132

ahhh i understand

fiery bolt Sep 15, 2024, 12:44 AM

#

wicked notch intel who?

those CPUs that crash a lot

#

should I do the whole fixed size page thing or half-ass dynamic allocation thonk

wicked notch Sep 15, 2024, 1:01 AM

#

fixed page all the way

#

why would you even want dynamic alloc

#

just do tlsf on gpu

#

it's just a couple of fls and ffs

#

you might need a lock tho

#

acutally yeah no binning is bad KEKW

#

do standard gpu malloc

fiery bolt Sep 15, 2024, 1:04 AM

#

wicked notch why would you even want dynamic alloc

because I don't have to do the goofy group part nonsense

primal shadow Sep 15, 2024, 1:09 AM

#

@languid vector I have some more questions on the global mesh compression when you get a chance

I don't get the idea behind the function you sent me a bit ago to calculate the bitrate/step_size for the global/mesh quantization grid. Should this not be a fixed value used for all of your meshes? Also, what's even the point of the global quantinization since you store your meshlet centers in a full vec3<f32> anyways no?
Given that each meshlet stores a bitstream of vertex positions, for meshlet X with triangle index Y, how do you read the vertex position data? With fixed-size positions, i.e. one vec4<f32> per vertex, I can just store one u32 per meshlet pointing to the start of the meshlet's vertices in the large array of vertices, and each triangle index can be a single u8 pointing to an offset off of that starting position. But I'm not sure how to structure things with a bitstream.
When quantizing, I'm not entirely sure how to handle sign-ness (i.e. negative or positive). For meshlets, I guess I could map -radius..radius to 0..diameter, and then do ceil2(log2(diameter)) to determine the bitrate. For meshes/global grid, I guess I just store the meshlet centers as a full vec3<f32> still? (but quantized to the grid instead of absolute coordinates)

wicked notch Sep 15, 2024, 1:10 AM

#

fiery bolt because I don't have to do the goofy group part nonsense

but goofy groupy thingies are cool

fiery bolt Sep 15, 2024, 1:11 AM

#

wicked notch but goofy groupy thingies are cool

how do you check if all parts of a group are loaded agonyfrog

#

without Yet More Indirection

wicked notch Sep 15, 2024, 1:11 AM

#

literally who cares

#

you're already bw limited

#

one more buffer

#

and maybe another one after that

fiery bolt Sep 15, 2024, 1:12 AM

#

no but then I have to deal with those buffers

wicked notch Sep 15, 2024, 1:12 AM

#

wym deal

#

alloc the buffer done

fiery bolt Sep 15, 2024, 1:13 AM

#

dynamic gpu malloc

wicked notch Sep 15, 2024, 1:13 AM

#

ye

#

gabe has an impl of gpu malloc

#

you can go take inspiration

fiery bolt Sep 15, 2024, 1:13 AM

#

froge_love

#

gonna take a lot of inspiration

wicked notch Sep 15, 2024, 1:13 AM

#

me when I commit theft

ebon ruin Sep 15, 2024, 1:14 AM

#

wicked notch and maybe another one after that

The Khronos group when creating the Vulkan specification thought about you specifically when creating descriptor sets

fiery bolt Sep 15, 2024, 1:14 AM

#

oh yeah it's also more work when building

wicked notch Sep 15, 2024, 1:14 AM

#

https://tenor.com/view/gato-gato-lamiendo-gato-paleta-lamiendo-gif-15858928605675053640

Tenor

fiery bolt Sep 15, 2024, 1:14 AM

#

ebon ruin The Khronos group when creating the Vulkan specification thought about you speci...

what the fuck is a descriptor set

wicked notch Sep 15, 2024, 1:14 AM

#

ebon ruin The Khronos group when creating the Vulkan specification thought about you speci...

what's a descriptor set

ebon ruin Sep 15, 2024, 1:14 AM

#

fiery bolt what the fuck is a descriptor set

watch your language

wicked notch Sep 15, 2024, 1:14 AM

#

never heard of er

fiery bolt Sep 15, 2024, 1:15 AM

#

me neither

wicked notch Sep 15, 2024, 1:15 AM

#

do you mean pointers maybe

ebon ruin Sep 15, 2024, 1:15 AM

#

I thought you were a vulkan man

wicked notch Sep 15, 2024, 1:15 AM

#

I am yes

fiery bolt Sep 15, 2024, 1:15 AM

#

I just put a Tex2D<f32> in my push constants

wicked notch Sep 15, 2024, 1:15 AM

#

I am a vulkan 1.3 man

ebon ruin Sep 15, 2024, 1:15 AM

#

are you pulling my leg

fiery bolt Sep 15, 2024, 1:15 AM

#

and it just works

wicked notch Sep 15, 2024, 1:16 AM

#

ebon ruin are you pulling my leg

yes we all use bindless and pointers here

ebon ruin Sep 15, 2024, 1:16 AM

#

thats so hot

#

i wish i learned that

wicked notch Sep 15, 2024, 1:16 AM

#

descriptor sets are a bad dream that don't exist anymore

ebon ruin Sep 15, 2024, 1:16 AM

#

now im not so scared of bulkan

#

thanks guys

fiery bolt Sep 15, 2024, 1:16 AM

#

we have vulkaned someone else froge_love

wicked notch Sep 15, 2024, 1:17 AM

#

fiery bolt we have vulkaned someone else <:froge_love:1105211408255295624>

an honest day's work

#

we need more fresh blood for john khronos 🙏

ebon ruin Sep 15, 2024, 1:17 AM

#

not yet

fiery bolt Sep 15, 2024, 1:17 AM

#

ebon ruin not yet

sorry you cannot resist

ebon ruin Sep 15, 2024, 1:17 AM

#

i am a GL guy

wicked notch Sep 15, 2024, 1:17 AM

#

not for long

ebon ruin Sep 15, 2024, 1:18 AM

#

yes for long

#

vulkan offers me nothing

fiery bolt Sep 15, 2024, 1:18 AM

#

pointers

wicked notch Sep 15, 2024, 1:18 AM

#

pointers

fiery bolt Sep 15, 2024, 1:18 AM

#

lol

ebon ruin Sep 15, 2024, 1:18 AM

#

useless

fiery bolt Sep 15, 2024, 1:18 AM

#

no

wicked notch Sep 15, 2024, 1:18 AM

#

fiery bolt pointers

https://tenor.com/view/clown-clowntoclown-conversation-telepathy-clowns-gif-25167584

Tenor

#

me 🤝 you

ebon ruin Sep 15, 2024, 1:18 AM

#

meaningless

fiery bolt Sep 15, 2024, 1:18 AM

#

nanite makes you a real clown

ebon ruin Sep 15, 2024, 1:18 AM

#

its all meaningless

wicked notch Sep 15, 2024, 1:18 AM

#

fiery bolt nanite makes you a real clown

true and real

fiery bolt Sep 15, 2024, 1:19 AM

#

ebon ruin its all meaningless

ok time for you to impl nanite

wicked notch Sep 15, 2024, 1:19 AM

#

watch out for edge cases

ebon ruin Sep 15, 2024, 1:19 AM

#

the only thing desirable is hw rt but everytime i mention it L says “No.”

ebon ruin Sep 15, 2024, 1:19 AM

#

fiery bolt ok time for you to impl nanite

I did on Scratch

fiery bolt Sep 15, 2024, 1:19 AM

#

ok now make it fast

ebon ruin Sep 15, 2024, 1:19 AM

#

no

wicked notch Sep 15, 2024, 1:19 AM

#

you did nanite on scratch but you're scared of vulkan

#

that doesn't compute

fiery bolt Sep 15, 2024, 1:20 AM

#

28 trillion trongles in 1.5ms

ebon ruin Sep 15, 2024, 1:20 AM

#

i’m not scared of vulkan its just useless to me

wicked notch Sep 15, 2024, 1:20 AM

#

it maybe is

#

but pointers

fiery bolt Sep 15, 2024, 1:20 AM

#

hit that and we'll let you not use vulkan

ebon ruin Sep 15, 2024, 1:20 AM

#

pointing to what?

wicked notch Sep 15, 2024, 1:20 AM

#

memory

ebon ruin Sep 15, 2024, 1:20 AM

#

I have not even implemented PBR yet

#

and by the looks of it

fiery bolt Sep 15, 2024, 1:20 AM

#

if you can't do 25 trillion triangles in 1.5ms you have to use vulkan

ebon ruin Sep 15, 2024, 1:20 AM

#

neither have you

fiery bolt Sep 15, 2024, 1:20 AM

#

exactly!

#

I even deleted my brdf.hlsl when porting to slang

wicked notch Sep 15, 2024, 1:21 AM

#

rendering equation?

#

do you mean triangle rasterization equations?

fiery bolt Sep 15, 2024, 1:21 AM

#

wicked notch rendering equation?

is that the error projection equation

#

or edge equations

wicked notch Sep 15, 2024, 1:22 AM

#

quadric error metrics maybe perhaps

fiery bolt Sep 15, 2024, 1:22 AM

#

my error projection makes no sense

ebon ruin Sep 15, 2024, 1:22 AM

#

wicked notch memory

ill stick with IDs thanks

wicked notch Sep 15, 2024, 1:22 AM

#

fiery bolt my error projection makes no sense

real

fiery bolt Sep 15, 2024, 1:22 AM

#

I just did the screen space projection thing and then multiplied two things

wicked notch Sep 15, 2024, 1:22 AM

#

I literally just mul the radius with the projection

#

and scale it

fiery bolt Sep 15, 2024, 1:22 AM

#

because just using t led to holes

wicked notch Sep 15, 2024, 1:22 AM

#

best error function ever

ebon ruin Sep 15, 2024, 1:22 AM

#

this is like the dark wizards trying to convince me to use dark magic

fiery bolt Sep 15, 2024, 1:23 AM

#

yes

#

and you know you want it

#

you need it

wicked notch Sep 15, 2024, 1:23 AM

#

show him

ebon ruin Sep 15, 2024, 1:23 AM

#

https://tenor.com/view/guy-unfunny-dent-head-gif-20625569

Tenor

wicked notch Sep 15, 2024, 1:23 AM

#

the slang push const

fiery bolt Sep 15, 2024, 1:23 AM

#

wicked notch Sep 15, 2024, 1:23 AM

#

give him a taste of slang's ultimate power when combined with vk13

ebon ruin Sep 15, 2024, 1:23 AM

#

https://tenor.com/view/drake-drake-computer-kill-your-self-gif-25432024

Tenor

wicked notch Sep 15, 2024, 1:24 AM

#

(don't tell him about the driver bugs and invalid spirv and out of spec optimizations)

fiery bolt Sep 15, 2024, 1:24 AM

#

fuck nvidia's driver

wicked notch Sep 15, 2024, 1:24 AM

#

all my homies hate nv

ebon ruin Sep 15, 2024, 1:24 AM

#

me too

wicked notch Sep 15, 2024, 1:24 AM

#

I've found like 3 nvidia bugs when using slang

#

it's crazy

ebon ruin Sep 15, 2024, 1:24 AM

#

i shouldve gone with rx

fiery bolt Sep 15, 2024, 1:24 AM

#

fucking useless thing crashes with a misaligned addr if i use groupshared mem in my software rasterizer

wicked notch Sep 15, 2024, 1:24 AM

#

I have zero idea how they say slang is "production ready"

fiery bolt Sep 15, 2024, 1:25 AM

#

because slang is

#

their driver isn't

wicked notch Sep 15, 2024, 1:25 AM

#

actually true

#

garbage driver

fiery bolt Sep 15, 2024, 1:25 AM

#

i've looked through the entire spirv slang shits out

wicked notch Sep 15, 2024, 1:25 AM

#

"release" "driver"

fiery bolt Sep 15, 2024, 1:25 AM

#

it's perfectly fine

fiery bolt Sep 15, 2024, 1:25 AM

#

fiery bolt fucking useless thing crashes with a misaligned addr if i use groupshared mem in...

so i wrote it in hlsl

#

works now

#

now i have three definitions of my gpu scene structs bleaker_kekw

wicked notch Sep 15, 2024, 1:26 AM

#

fiery bolt so i wrote it in hlsl

what's the diff

#

how is dxc's output any different

fiery bolt Sep 15, 2024, 1:26 AM

#

the only thing i can see is that dxc uses unsigned array lengths

#

slang uses signed

wicked notch Sep 15, 2024, 1:26 AM

#

bruh

fiery bolt Sep 15, 2024, 1:27 AM

#

this was in a minimized diff that just wrote zeroes to the arr

#

still crashed with slang so i assume that's the issue

wicked notch Sep 15, 2024, 1:27 AM

#

nv driver btw

#

quality software™️

fiery bolt Sep 15, 2024, 1:28 AM

#

however, using groupshared arrays in my stolen port of SPD works

#

https://tenor.com/bwFVL.gif

Tenor

wicked notch Sep 15, 2024, 1:28 AM

#

ye that probably just "happens" to work

#

idk

fiery bolt Sep 15, 2024, 1:28 AM

#

and using groupshared arrays in my mesh shader also works

wicked notch Sep 15, 2024, 1:28 AM

#

the whole thing is ub

fiery bolt Sep 15, 2024, 1:28 AM

#

fiery bolt and using groupshared arrays in my mesh shader also works

but atomics don't!

#

slang default InterlockedAdd uses device scope for everything so i wrote my own spirv asm thingy for VMM (and all barriers etc)

#

it still died

#

do you think novideo will hire me if i tell them i'll fix their shit shader compiler

#

frog_dum

#

oh yeah VMM make available | make visible doesn't work either for some reason so i had to split my dispatches

wicked notch Sep 15, 2024, 1:52 AM

#

seek god

#

actually seek grass

#

then god

fiery bolt Sep 15, 2024, 2:38 AM

#

wicked notch actually seek grass

what is that

faint crane Sep 15, 2024, 2:52 AM

#

Haven't implemented that yet.

primal shadow Sep 15, 2024, 7:50 AM

#

primal shadow <@323156168614871040> I have some more questions on the global mesh compression ...

I guess to answer my own question for 2., each meshlet can store the first bit of it's vertex positions within the bitstream, as well as it's quantization factor, which can then be used to calculate how many bits each vertex position uses for the meshlet which is fixed per-meshlet, giving you random access within the meshlet.

buoyant summit Sep 15, 2024, 2:33 PM

#

@wicked notch in cuda and the likes you use a special dispatch for forward progress

#

and the api requires that your dispatch size is under some limit

#

for forward progress guarantees to hold

#

and the limit varies device to device

#

and yes that would be useful in vk no less, I'm just saying it's a bit tricky to use

#

it's not super straightforward

wicked notch Sep 15, 2024, 2:34 PM

#

ye it's flimsy

wispy spear Sep 15, 2024, 4:12 PM

#

@primal shadow did you do the thing?

#

#

i think it was about a/the PR

primal shadow Sep 15, 2024, 5:00 PM

#

wispy spear <@145540119141679105> did you do the thing?

Yeah yesterday

primal shadow Sep 15, 2024, 6:47 PM

#

pub struct MeshletMesh {
    /// Bitstream-packed vertex positions.
    pub vertex_positions: Arc<[u8]>,
    /// Octahedral compressed normals and uncompressed texture coordinates for vertices.
    pub vertex_attributes: Arc<[u8]>,
    /// Triangle indices for meshlets.
    pub indices: Arc<[u8]>,
    /// The list of meshlets making up this mesh.
    pub meshlets: Arc<[Meshlet]>,
    /// Spherical bounding volumes.
    pub bounding_spheres: Arc<[MeshletBoundingSpheres]>,
}

/// A single meshlet within a [`MeshletMesh`].
#[repr(C)]
pub struct Meshlet {
    /// The bit offset within the parent mesh's [`MeshletMesh::vertex_positions`] buffer where the vertex positions for this meshlet begin.
    pub start_vertex_position_bit: u32,
    /// The offset within the parent mesh's [`MeshletMesh::vertex_attributes`] buffer where the vertex attributes for this meshlet begin.
    pub start_vertex_attribute_id: u32,
    /// The offset within the parent mesh's [`MeshletMesh::indices`] buffer where the indices for this meshlet begin.
    pub start_index_id: u32,
    /// The amount of vertices in this meshlet.
    pub vertex_count: u8,
    /// The amount of triangles in this meshlet.
    pub triangle_count: u8,
    /// Number of bits used to quantize vertex positions within this meshlet.
    pub bits_per_vertex_position: u8,
    /// Unused. (TODO: Get rid of this in the disk representation?)
    pub padding: u8,
}

Ok, got this so far.

wicked notch Sep 15, 2024, 7:00 PM

#

why is everything Arc nervous

loud crag Sep 15, 2024, 7:01 PM

#

what's Arc anyway

#

I only know it from objc's Automatic Reference Counting

frank sail Sep 15, 2024, 7:05 PM

#

I thought it was atomic ref counted

#

Idk what the ref countedness means in the context of a single uint

wicked notch Sep 15, 2024, 7:12 PM

#

it's just shared_ptr

primal shadow Sep 15, 2024, 7:14 PM

#

wicked notch why is everything Arc <:nervous:836935166437883934>

Reasons internal to how bevy's renderer works to avoid copying the data across threads

wicked notch Sep 15, 2024, 7:14 PM

#

damn

#

can't you just make MeshletMesh arc?

primal shadow Sep 15, 2024, 7:20 PM

#

wicked notch can't you just make MeshletMesh arc?

Yeah, but I also need some sort of smart pointer inside anyways to store unbounded arrays [u8]

#

It was either Arc (shared_ptr) or Box (unique_ptr)

fiery bolt Sep 15, 2024, 7:40 PM

#

or you can do a custom DST cutecatNE

primal shadow Sep 15, 2024, 7:41 PM

#

?

fiery bolt Sep 15, 2024, 7:43 PM

#

you can make your own DST with a header and an unsized tail of bytes bleakekw

primal shadow Sep 15, 2024, 7:49 PM

#

Ehhh maybe some other time if it becomes an issue lol

primal shadow Sep 19, 2024, 6:57 AM

#

Compression continues to frustrate me greatly

#

I still have not seen good explinations for how half of it works

#

Mostly the purpose of the global grid

wide shadow Sep 19, 2024, 8:32 AM

#

primal shadow Mostly the purpose of the global grid

The global grid is there to avoid cracks in the model between different meshes. If every mesh would have its own grid then there would be cracks because each mesh aligned differently

faint crane Sep 19, 2024, 8:34 AM

#

https://x.com/BenSimsTech/status/1836630148986388641

Ben Sims (@BenSimsTech) on X

What are people's guidelines for minimum mesh vertex counts for instancing? I understand you want at least 64 to fill a threadgroup, but various quotes online seem to indicate a few hundred is better. (In this situation, it's an instanced quadtree where I can control vert count)

fiery bolt Sep 19, 2024, 11:48 AM

#

wide shadow The global grid is there to avoid cracks in the model between different meshes. ...

yeah this is required for kitbashing or any sort of modular design

primal shadow Sep 19, 2024, 3:04 PM

#

wide shadow The global grid is there to avoid cracks in the model between different meshes. ...

I don't understand why

#

Nor do I think that's right

#

I think nanite was saying you need the same grid to avoid cracks between objects you would get from different grids

#

But unclear why they have one in the first place

glass sphinx Sep 19, 2024, 4:11 PM

#

primal shadow I think nanite was saying you need the same grid to avoid cracks between objects...

but isnt that what lukasino said?

primal shadow Sep 19, 2024, 4:25 PM

#

Right, but it's missing some context

#

Yes if you do different grid sizes per mesh there would be issues

#

But what's the purpose of the grid in the first place?

#

Like you could also solve the same problem by... Not quantizing anything to a grid

glass sphinx Sep 19, 2024, 4:41 PM

#

how would you quantize it

fiery bolt Sep 19, 2024, 5:00 PM

#

without quantizing you need the full 12 bytes for position

#

and it won't byte-compress very well I think

primal shadow Sep 19, 2024, 5:15 PM

#

But you can compress with the per-cluster encoding, which afaik(?) is lossless(?)

#

So it'll compress to a more compact bitstream regardless

#

I think the global quantizing step beforehand is just to reduce the precision, in order to need less bits for the second step? Idk for sure

#

And then it makes a bunch of other things more complicated

#

There's two steps. The quantization with a fixed step size for all meshes, and then encoding vertices per-cluster

#

I think the purpose of the first is to reduce excess precision (so less bits are needed), and the second is just a more compact encoding of the same data

glass sphinx Sep 19, 2024, 5:32 PM

#

if your compression doesnt yield the exact same vertex positions ror border vertices you get cracks

#

i guess the world grid helps keep the qualtization consistent

#

i dont think the quantization is lossless

wide shadow Sep 19, 2024, 5:45 PM

#

it isnt going to be the easiest thing to do

#

also encountered slang bug...

wispy spear Sep 19, 2024, 5:49 PM

#

looks like caldera hotel

wicked notch Sep 19, 2024, 5:51 PM

#

wide shadow it isnt going to be the easiest thing to do

I'm so cooked

#

why do I recognize that this is bistro

wide shadow Sep 19, 2024, 5:52 PM

#

its bistro + sponza 😉

#

shoot its very loseless...

wispy spear Sep 19, 2024, 5:55 PM

#

oh hehe

wicked notch Sep 19, 2024, 6:00 PM

#

wide shadow shoot its very loseless...

no loss of data

#

ship it

#

KEKW

wheat haven Sep 19, 2024, 6:09 PM

#

wicked notch why do I recognize that this is bistro

I did too, don't feel bad 😄

primal shadow Sep 19, 2024, 6:39 PM

#

glass sphinx i guess the world grid helps keep the qualtization consistent

Why though? How does that work? I haven't figured out why it helps

fiery bolt Sep 19, 2024, 11:22 PM

#

primal shadow Why though? How does that work? I haven't figured out why it helps

because within a cluster you may just happen to have vertices at 0, 2, 4, 6, 8.0000000901, 10

wispy spear Sep 22, 2024, 8:27 PM

#

@wicked notch i might have posted it somewhere before, im not sure, but it mentions mesh decimation/remeshing etc not sure if its useful or was useful as an alternative or better working meshlet thingy https://github.com/pmp-library/pmp-library but ill link it anyway

GitHub

GitHub - pmp-library/pmp-library: The Polygon Mesh Processing Library

The Polygon Mesh Processing Library. Contribute to pmp-library/pmp-library development by creating an account on GitHub.

wicked notch Sep 22, 2024, 8:37 PM

#

very nice, I'll check it out in a bit

primal shadow Sep 23, 2024, 9:58 PM

#

Interesting issue I've found, shadows look too bad atm in bevy for meshlets

#

It's choosing too low of a lod I'm guessing, and the error is very visible when viewed by the main camera

#

I probably need to add a lod bias to shadow views

wicked notch Sep 23, 2024, 9:59 PM

#

Unreal fixes this using SSS btw

#

they mentioned that in the original talk

primal shadow Sep 23, 2024, 9:59 PM

#

Ironically nanite does the same, but they bias towards a less accurate lod, as VSM is so high res anyways it's never a problem

#

Mhmm also true

wicked notch Sep 23, 2024, 10:00 PM

#

VSM is too powerful

primal shadow Sep 23, 2024, 10:03 PM

#

Nah Ray traxing is

#

VSM is ok

wicked notch Sep 23, 2024, 10:03 PM

#

fake

#

blatant RT propaganda

#

try again

primal shadow Sep 23, 2024, 10:05 PM

#

Rasterization is a hack and does not work for non-primary views

#

And I won't pretend otherwise

frank sail Sep 23, 2024, 10:06 PM

#

Banned

wicked notch Sep 23, 2024, 10:06 PM

#

banned

finite yacht Sep 23, 2024, 10:10 PM

#

Promoted to Admin

frank sail Sep 23, 2024, 10:15 PM

#

not you too

loud crag Sep 23, 2024, 10:25 PM

#

primal shadow Rasterization is a hack and does not work for non-primary views

writing a raster renderer just feels like strapping hacks upon hacks together

fiery bolt Sep 24, 2024, 12:21 AM

#

don't waste your 0.25 rays per pixel on shadows, use them for GI and real specular smh

primal shadow Sep 25, 2024, 5:29 AM

#

Ok after much research, I finally understand nanite's vertex quantization now

primal shadow Sep 25, 2024, 6:03 AM

#

/// A single meshlet within a [`MeshletMesh`].
#[derive(Copy, Clone, Pod, Zeroable)]
#[repr(C)]
pub struct Meshlet {
    /// The bit offset within the parent mesh's [`MeshletMesh::vertex_positions`] buffer where the vertex positions for this meshlet begin.
    pub start_vertex_position_bit: u32,
    /// The offset within the parent mesh's [`MeshletMesh::vertex_normals`] and [`MeshletMesh::vertex_uvs`] buffers
    /// where non-position vertex attributes for this meshlet begin.
    pub start_vertex_attribute_id: u32,
    /// The offset within the parent mesh's [`MeshletMesh::indices`] buffer where the indices for this meshlet begin.
    pub start_index_id: u32,
    /// The amount of vertices in this meshlet.
    pub vertex_count: u8,
    /// The amount of triangles in this meshlet.
    pub triangle_count: u8,
    /// Number of bits used to quantize vertex positions within this meshlet.
    pub quantization_bits: u8,
    /// Number of bits used to to store the X channel of vertex positions within this meshlet.
    pub bits_per_vertex_position_channel_x: u8,
    /// Number of bits used to to store the Y channel of vertex positions within this meshlet.
    pub bits_per_vertex_position_channel_y: u8,
    /// Number of bits used to to store the Z channel of vertex positions within this meshlet.
    pub bits_per_vertex_position_channel_z: u8,
    /// Unused. (TODO: Get rid of this in the disk representation?)
    pub padding: u16,
    /// Minimum quantized X channel value of vertex positions within this meshlet.
    pub min_vertex_position_channel_x: f32,
    /// Minimum quantized Y channel value of vertex positions within this meshlet.
    pub min_vertex_position_channel_y: f32,
    /// Minimum quantized Z channel value of vertex positions within this meshlet.
    pub min_vertex_position_channel_z: f32,
}

#

The perfect 256 bits of metadata

wide shadow Sep 25, 2024, 8:44 AM

#

primal shadow ```rust /// A single meshlet within a [`MeshletMesh`]. #[derive(Copy, Clone, Pod...

Do you have duplicated vertex positions for each meshlet? Meaning each vertex had its own "vertices"

ebon ruin Sep 25, 2024, 3:51 PM

#

i am uh

#

making Nanite

#

on Scratch

fiery bolt Sep 25, 2024, 4:02 PM

#

good

primal shadow Sep 25, 2024, 5:12 PM

#

wide shadow Do you have duplicated vertex positions for each meshlet? Meaning each vertex ha...

I assume you meant each meshlet has it's own set of vertices, and yes I do. I was skeptical at first, but it allows streaming and much better compression, and vertex data memory usage is a large bottleneck.

wide shadow Sep 25, 2024, 5:26 PM

#

primal shadow I assume you meant each meshlet has it's own set of vertices, and yes I do. I wa...

Yes, thats what I meant. I am just checking 😄

primal shadow Sep 26, 2024, 3:17 AM

#

Update: It's been taking a while due to learning and then being sick, but my fever finally broke and I finished all the CPU-side changes for compressed per-meshlet vertices.

#

Just need to figure out how to do the GPU bitstream reader

#

I am also using a fixed quantization factor per mesh rn, nanite has an "auto" mode that chooses the best one, I'll have to figure out how they did that.

fiery bolt Sep 26, 2024, 3:37 AM

#

don't all meshes need to have the same quantization factor

#

because it would lead to cracks otherwise

primal shadow Sep 26, 2024, 3:56 AM

#

Maybe, idk. The original nanite presentation said it's user-selectable and has to be the same for different meshes, but unreal has an "auto" option.

#

Choose the precision this mesh should use when generating the Nanite mesh. Auto determines the appropriate precision based on the size of the mesh. The precision can be overridden to improve precision or optimize disk footprint.

#Iris - A Journey through OpenGL and beyond to learn Graphics