Iris - A Journey through OpenGL and beyond to learn Graphics | Graphics Programming | Page 21

primal shadow · 2024-06-11T22:25:51.422Z

For reference here's the roadmap: * Removing material depth writing from the raster pass * Implicit tangents * Moving to persistent culling over a BVH-per-object of cluster groups instead of the huge flat dispatch over clusters * Software raster and making hardware raster less memory-hungry without mesh shaders (blocked on wgpu texture atomics) * Better mesh conversion workflows and asset processing, and making it faster and higher quality * Compute-based shading and software VRS (long term) * Streaming (long term)

primal shadow Jun 11, 2024, 10:25 PM

#

For reference here's the roadmap:

Removing material depth writing from the raster pass
Implicit tangents
Moving to persistent culling over a BVH-per-object of cluster groups instead of the huge flat dispatch over clusters
Software raster and making hardware raster less memory-hungry without mesh shaders (blocked on wgpu texture atomics)
Better mesh conversion workflows and asset processing, and making it faster and higher quality
Compute-based shading and software VRS (long term)
Streaming (long term)

left jacinth Jun 12, 2024, 5:14 PM

#

@wicked notch I copied the compute rasterization function you had, and modified it to render quads instead of triangles, and of course I know your code is directly ported from UE5 source, which I can see correlates very highly. I was wondering if you had thought at all about the licensing of UE5's source

#

but also they say this

#

you can't copy paste UE code without being subject to royalties, but you can "learn" from UE code...

#

At what point does it count as copying

#

especially if the code in question is a software triangle rasterizer, which was understood and modified to instead be a software quad rasterizer

left jacinth Jun 12, 2024, 5:18 PM

#

left jacinth but also they say this

https://www.unrealengine.com/en-US/faq

wicked notch Jun 12, 2024, 5:19 PM

#

yeah, I was definitely thinking of removing that and replacing it, I guess I'll expedite that

left jacinth Jun 12, 2024, 5:20 PM

#

what would you replace it with?

#

I don't understand copyright law man...

wicked notch Jun 12, 2024, 5:37 PM

#

back in the day I was looking at various softrast methods

#

I dunno what to replace the subpixel calcs with tho

#

I gotta look at something public somehow, iirc AMD has some subpixel AA going on in their FFX repos

faint crane Jun 12, 2024, 5:39 PM

#

Does "it was revealed to me in a dream" not work with Unreal's lawyers?

wicked notch Jun 12, 2024, 5:39 PM

#

I mean my project is purely for personal use

#

so I don't really care KEKW

left jacinth Jun 12, 2024, 5:40 PM

#

^

left jacinth Jun 12, 2024, 6:06 PM

#

is there a place to find the logic for how hardware computes the scan-lines so that you make sure they match?

#

I didn't do too much searching but didn't find anything obvious

#

I don't really even know where to look

wicked notch Jun 20, 2024, 8:28 AM

#

exam was so easy I almost forgot it existed (I was late thank god the prof was too KEKW )

left jacinth Jun 20, 2024, 5:52 PM

#

dang

rocky schooner Jun 21, 2024, 8:21 PM

#

Does the market have any demand for OpenGL?

wispy spear Jun 21, 2024, 8:32 PM

#

the market has demand for people who understand graphics and compute pipelines, the api does not matter much

buoyant summit Jun 21, 2024, 8:55 PM

#

no, I personally don't demand any opengl, tyvm (I'm the market)

frank sail Jun 21, 2024, 9:09 PM

#

rocky schooner Does the market have any demand for OpenGL?

The market doesn't demand any API in particular

#

Customers (of any variety) dgaf about graphics APIs beyond what they may see when launching a game

#

Hmm, imagine launching cod 202x and seeing the choice to use fwog or vuk backends

#

add daxa to the list

wicked notch Jun 21, 2024, 9:14 PM

#

how do you know cod doesn't use daxa already

#

potrick may be pulling some strings there

velvet marsh Jun 21, 2024, 9:15 PM

#

the way I read this is that the market demand is for engineers who don't have problems learning a new graphics api quickly

wicked thorn Jun 22, 2024, 3:23 AM

#

a

#

b

#

c

#

d

#

20000

wispy spear Jun 22, 2024, 9:09 AM

#

damn, italians do talk a lot

#

🤏 🤌

wicked notch Jun 22, 2024, 10:16 AM

#

real

ebon ruin Jun 22, 2024, 3:02 PM

#

rocky schooner Does the market have any demand for OpenGL?

I know i lack context

#

but why ask here of all places?

severe dome Jun 22, 2024, 5:41 PM

#

it is a graphics server

ebon ruin Jun 23, 2024, 12:16 AM

#

but specifically in this thread

severe dome Jun 23, 2024, 11:02 AM

#

oh lol i didn't even notice that lmao

primal shadow Jun 25, 2024, 2:37 AM

#

@dull oyster something I thought of, given that we need to calculate the screen space AABB for occlusion culling anyways, why even bother with calculating the screen space diameter/radius of a cluster using a different formula? I guess it's slightly cheaper, which matters when you have millions of clusters that are likely to be skipped?

primal shadow Jun 25, 2024, 3:56 AM

#

I'm the GPU now: https://github.com/JMS55/bevy/blob/meshlet-sw-raster/crates/bevy_pbr/src/meshlet/software_visibility_buffer_raster.wgsl

wicked notch Jun 25, 2024, 2:40 PM

#

GPU lost GPU rights

#

only compute machine now

pale horizon Jun 25, 2024, 3:23 PM

#

Time to do a software renderer on GPU froge_love

loud crag Jun 25, 2024, 3:25 PM

#

the RTX 60 series will just be a big fat compute processor with nothing else

buoyant summit Jun 25, 2024, 3:32 PM

#

loud crag the RTX 60 series will just be a big fat compute processor with nothing else

we already have these it's called data center GPU

#

AMD doesn't put drawing hw into that at all, neither they put format conversion and sampling stuff

#

NV has some very tiny and weak drawing hw and not sure about sampling etc

primal shadow Jun 25, 2024, 6:19 PM

#

Hmm, I can't figure out how to map the meshoptimizer meshlet triangles/indices into a meshlet-local index

#

let index_ids = meshlet.start_index_id + vec3(triangle_id * 3u) + vec3(0u, 1u, 2u);
let indices = meshlet.start_vertex_id + vec3(get_meshlet_index(index_ids.x), get_meshlet_index(index_ids.y), get_meshlet_index(index_ids.z));
let vertex_ids = vec3(meshlet_vertex_ids[indices.x], meshlet_vertex_ids[indices.y], meshlet_vertex_ids[indices.z]);
let vertex_1 = unpack_meshlet_vertex(meshlet_vertex_data[vertex_ids.x]);
let vertex_2 = unpack_meshlet_vertex(meshlet_vertex_data[vertex_ids.y]);
let vertex_3 = unpack_meshlet_vertex(meshlet_vertex_data[vertex_ids.z]);

#

I have 64 vertices and 64 triangles max per meshlet

#

You can take the triangle_id [0, 64) and map that to an index [0, 192)

#

Then adding that to the start_index_if of the meshlet gives you the position within the larger buffer

#

And then you can do the same thing with vertices

#

But now I have a 64 thread workgroup load 1 vertex per thread into shared memory

#

And then 1 thread per triangle can load those vertices and build the triangle

#

let index_ids = meshlet.start_index_id + (local_invocation_id.x * 3u) + vec3(0u, 1u, 2u);
let index_1 = get_meshlet_index(index_ids[0]) - meshlet.start_vertex_id;
let index_2 = get_meshlet_index(index_ids[1]) - meshlet.start_vertex_id;
let index_3 = get_meshlet_index(index_ids[2]) - meshlet.start_vertex_id;
let vertex_1 = screen_space_vertices[index_1];
let vertex_2 = screen_space_vertices[index_2];
let vertex_3 = screen_space_vertices[index_3];

#

But I'm not actually sure my index calculations are correct here. I don't think subtracting by start_vertex_id works to get the meshlet indices back into the [0, 64) range :/

wide shadow Jun 25, 2024, 7:34 PM

#

Might be interesting. Now you can lock vertices when simplifying and new algorithm for optimizing mesh let's for Nvidia GPUs
https://github.com/zeux/meshoptimizer/releases/tag/v0.21

GitHub

Release v0.21 · zeux/meshoptimizer

This release contains improvements to the meshoptimizer library and many gltfpack enhancements! Notably, the introduction of sparse and absolute error simplification options in meshopt_simplify as ...

primal shadow Jun 25, 2024, 7:38 PM

#

wide shadow Might be interesting. Now you can lock vertices when simplifying and new algorit...

Tested on bevy's virtual geometry renderer 🙂

wispy spear Jun 25, 2024, 7:46 PM

#

lustri got sucked in completely by the education programs

primal shadow Jun 25, 2024, 7:48 PM

#

Zeux isin't in this discord, right? I have a question :/

#

I'll use github discussions

wispy spear Jun 25, 2024, 7:50 PM

#

he used to

#

hes on the vk server iirc

primal shadow Jun 25, 2024, 7:51 PM

#

There's a VK server? til

wispy spear Jun 25, 2024, 7:51 PM

#

-,-

primal shadow Jun 25, 2024, 8:11 PM

#

https://github.com/zeux/meshoptimizer/discussions/711

GitHub

How to map meshlet triangle indices into meshlet-local indexes? · z...

Lets say I take a mesh, and convert it into meshlets with a max of 64 vertices per meshlet [0, 64), and 64 triangles per meshlet [0, 64). The data looks like this: pub struct Meshlets { pub meshlet...

wicked notch Jun 25, 2024, 10:46 PM

#

2/3 exams done

#

last one and I graduate let's goo

wispy spear Jun 25, 2024, 10:48 PM

#

frogapprove

#

good luck with the last one my froggi

primal shadow Jun 25, 2024, 10:53 PM

#

gl

#

Bunnies fight in hell (bugged)

dull oyster Jun 26, 2024, 10:39 AM

#

primal shadow <@163242187084005377> something I thought of, given that we need to calculate th...

won't the error depend on the cluster group's orientation relative to the camera? I am not sure how you could compare the parent group's error to the current group's error with AABBs.

faint crane Jul 6, 2024, 6:01 PM

#

briannaPls

#

Incredible that every single statement is wrong.

dull oyster Jul 6, 2024, 6:56 PM

#

"not compatible with raytracing", not with that attitude that's for sure

#

also "not that useful compared to traditional LOD", artists and coders have been struggling a lot with them, a Nanite-like system more or less makes it automatic

#

well obviously I'm preaching to the choir here
but still

wicked notch Jul 6, 2024, 7:04 PM

#

the cooking's been on hold for a while now sadly, but I've been thinking a while about RT with nanite

wicked notch Jul 6, 2024, 7:05 PM

#

dull oyster "not compatible with raytracing", not with that attitude that's for sure

do you make a BVH per cluster group in your thingy?

dull oyster Jul 6, 2024, 7:06 PM

#

My tests are currently on hold for the GP Direct demo, but that's what I stopped on: a BLAS per cluster group, and an instance inside the TLAS per instance of the cluster group*

wicked notch Jul 6, 2024, 7:07 PM

#

did you ever get around measuring tracing and building perf (for the TLAS)?

#

compared to full LOD base BLASes

primal shadow Jul 6, 2024, 7:07 PM

#

Traverse Research has also done RT with hierchal LODs, but idr if they ever explained how

wicked notch Jul 6, 2024, 7:07 PM

#

primal shadow Traverse Research has also done RT with hierchal LODs, but idr if they ever expl...

they promised a followup post but it's nowhere to be found :(

dull oyster Jul 6, 2024, 7:08 PM

#

I did not attempt to measure the difference between LOD 0 and this BLAS-per-group attempt

#

Partly because it is not finished
Partly because I don't care, I want to stream the cluster in and out of memory, so I will need to be able to do it with raytracing enabled too

primal shadow Jul 6, 2024, 7:08 PM

#

wicked notch they promised a followup post but it's nowhere to be found \:(

Contact them and ask them. I know @ Darius (they're on this server) worked on their GI stuff, so you could ask them, or ask if they can get you in contact with the person who did

wicked notch Jul 6, 2024, 7:09 PM

#

dull oyster Partly because it is not finished Partly because I don't care, I want to stream ...

ye the streaming is a big advantage, coupled with BLAS serialization too

dull oyster Jul 6, 2024, 7:10 PM

#

dull oyster I did not attempt to measure the difference between LOD 0 and this BLAS-per-grou...

though in my tests, tracing looked ok, did not notice that big of a difference
but building is a disaster with my current implementation

dull oyster Jul 6, 2024, 7:17 PM

#

wicked notch ye the streaming is a big advantage, coupled with BLAS serialization too

once my video for GP direct is done, I want to check if BLAS serialization can be a way to "fix" my perf

glass sphinx Jul 6, 2024, 8:38 PM

#

dull oyster "not compatible with raytracing", not with that attitude that's for sure

such a dum opinion

#

everyone uses compressed clusters anyways lmao thats already not compatible with rt

delicate rain Jul 6, 2024, 8:51 PM

#

wicked notch the cooking's been on hold for a while now sadly, but I've been thinking a while...

I'll be doing it as my dp most probably

#

There already is a paper on it

wicked notch Jul 6, 2024, 8:52 PM

#

damn

#

we don't deserve saky

delicate rain Jul 6, 2024, 8:53 PM

#

https://graphics.cs.utah.edu/research/projects/ray-tracing-hw-adaptive-lod/

#

Not done by me

#

Lmao

#

Soon though perhaps 🧙‍♂️

#

I'm kinda cooking the topic while on vacation, exploring directions

wicked notch Jul 6, 2024, 8:54 PM

#

delicate rain <https://graphics.cs.utah.edu/research/projects/ray-tracing-hw-adaptive-lod/>

huh this is actually not nanite

delicate rain Jul 6, 2024, 8:54 PM

#

There are also two papers from adobe, talking about displacement without tessellation for rt

delicate rain Jul 6, 2024, 8:54 PM

#

wicked notch huh this is actually not nanite

It's not?

#

I didn't read yet, but I got the impression of it being nanite

wicked notch Jul 6, 2024, 8:55 PM

#

it doesn't rely on meshlets at the very least

frank sail Jul 6, 2024, 8:55 PM

#

more like nanot

wicked notch Jul 6, 2024, 8:55 PM

#

I'm reading it for the first time rn

delicate rain Jul 6, 2024, 8:56 PM

#

I'll try to read it today or tomorrow

wispy spear Jul 6, 2024, 8:56 PM

#

its from the utah pot people?

wicked notch Jul 6, 2024, 8:56 PM

#

it actually looks like something Brian Karis talked about before going to meshlets

delicate rain Jul 6, 2024, 8:56 PM

#

wispy spear its from the utah pot people?

I think so

delicate rain Jul 6, 2024, 8:57 PM

#

wicked notch it actually looks like something Brian Karis talked about before going to meshle...

They get some nice results, I wonder if exploring meshlets with this might be viable

#

I have a lot of learning ahead of me

wicked notch Jul 6, 2024, 8:57 PM

#

I was looking at this some time ago: https://www.intel.com/content/www/us/en/developer/articles/technical/real-time-ray-tracing-of-micro-poly-geometry.html

#

this is actually good ol' nanite

delicate rain Jul 6, 2024, 8:58 PM

#

Nice nice, thank you!

wicked notch Jul 6, 2024, 8:58 PM

#

Intel suggests creating new BVH formats inside Vulkan/DX12 to make nanite + RT easier

wicked notch Jul 6, 2024, 8:59 PM

#

delicate rain <https://graphics.cs.utah.edu/research/projects/ray-tracing-hw-adaptive-lod/>

yeah ok this one is """"just"""" tessellation but RT

#

very nice

delicate rain Jul 6, 2024, 9:00 PM

#

https://dl.acm.org/doi/pdf/10.1145/3478513.3480535 this one is also cool

#

Regardless of that, I wanted to say that I'll probably be hopping onto the nanite hype

#

Just in rt world

wicked notch Jul 6, 2024, 9:02 PM

#

let's fucking gooo.meme

#

https://tenor.com/view/rage-emoji-rage-gif-2922587064166244739

Tenor

delicate rain Jul 6, 2024, 9:07 PM

#

Btw what are good resources for nanite, still the deep dive into nanite video, or something else?

dull oyster Jul 6, 2024, 9:10 PM

#

The slides from the presentation have a lot of information: https://advances.realtimerendering.com/s2021/Karis_Nanite_SIGGRAPH_Advances_2021_final.pdf

#

If you want to look at actual implementations:
https://jms55.github.io/posts/2024-06-09-virtual-geometry-bevy-0-14/
https://jglrxavpok.github.io/ (shameless plug)

wicked notch Jul 6, 2024, 9:15 PM

#

delicate rain <https://graphics.cs.utah.edu/research/projects/ray-tracing-hw-adaptive-lod/>

@ deccer could you pin this so I know who to blame when I dive deep into this rabbithole

wispy spear Jul 6, 2024, 9:17 PM

#

delicate rain <https://graphics.cs.utah.edu/research/projects/ray-tracing-hw-adaptive-lod/>

#

(you guys can ping me any time, no need for the space :P)

wicked notch Jul 6, 2024, 9:19 PM

#

I found the thing

#

https://youtu.be/NRnj_lnpORU?t=809

YouTube

High-Performance Graphics

HPG 2022 Keynote: The Journey to Nanite - Brian Karis, Epic Games

Plan on attending HPG 2023 in Delft, Netherlands, June 26-28, 2023.
Sign up for conference emails at http://eepurl.com/hZvXb1 .

HPG 2022 In-Person Event Keynote: The Journey to Nanite
Brian Karis, Epic Games
August 7, 2022, Fletcher Challenge Theatre, Harbour Centre, Simon Fraser University
https://www.highperformancegraphics.org/2022/in-person...

▶ Play video

wicked notch Jul 6, 2024, 9:19 PM

#

wicked notch https://youtu.be/NRnj_lnpORU?t=809

@delicate rain this is basically what that paper on RT tessellation is about (except they use a more complicated watertight type of tessellation scheme)

primal shadow Jul 6, 2024, 9:23 PM

#

glass sphinx everyone uses compressed clusters anyways lmao thats already not compatible with...

Can I get the TLDR on cluster compression. Rn I have a very basic scheme of just running the serialized bytes through lz4. I plan on making tangents implicit, and using oct-encoded normals, which brings me down to 32 bytes/vertex, but I don't have any further plans for compression, much less per-cluster.

wispy spear Jul 6, 2024, 9:24 PM

#

wicked notch <@226726721133477888> this is basically what that paper on RT tessellation is ab...

glass sphinx Jul 6, 2024, 9:26 PM

#

primal shadow Can I get the TLDR on cluster compression. Rn I have a _very_ basic scheme of ju...

normal tangent bitangent fit in 32 bit and any other attribute compresses well with delta compression across the meshlet

primal shadow Jul 6, 2024, 9:28 PM

#

Do you save data that way vs refrencing a single set of vertices for all meshlets?

#

I'll have to look into delta position/uv though

#

Iirc nanite makes position relative to the cluster bounding sphere center

#

Not sure if that's only disk, or runtime too, as extra fetches for the bounding sphere data seems bleh

delicate rain Jul 6, 2024, 9:32 PM

#

wicked notch <@226726721133477888> this is basically what that paper on RT tessellation is ab...

Perfect, I will consume the knowledge

wicked notch Jul 6, 2024, 9:32 PM

#

one more wrinkle for the brain

glass sphinx Jul 6, 2024, 9:46 PM

#

primal shadow Do you save data that way vs refrencing a single set of vertices for all meshlet...

stored per meshlet

primal shadow Jul 7, 2024, 4:19 AM

#

@wicked notch you use the screen-space AABB size of the meshlet for determing SW/HW raster right? For the first pass, do you compute one AABB using last frame's transform for culling against last frame's depth pyramid, and a second using the current frame's transform for choosing SW/HW?

wicked notch Jul 7, 2024, 11:14 AM

#

yes

primal shadow Jul 8, 2024, 1:48 AM

#

I have most of software raster prototyped, just need to write the actual triangle raster code 😛

#

Spent some time removing the off-the-shelf serializer I was using in favor of writing the bytes out myself. Meshlet asset loading is ~9x faster now, and probably uses less temporary memory https://github.com/bevyengine/bevy/pull/14193

buoyant summit Jul 8, 2024, 2:22 PM

#

dull oyster "not compatible with raytracing", not with that attitude that's for sure

tbf it's kinda annoying with current AS build APIs

#

but also lods aren't as necessary with rt as they are with rast

#

shorter build times, less memory and only somewhat faster trace

delicate rain Jul 8, 2024, 6:08 PM

#

In the paper I posted earlier they did seem to provide a significant speedup

primal shadow Jul 10, 2024, 4:33 AM

#

Started SW raster: https://github.com/JMS55/bevy/blob/3bd72429675286a05eaf1fa1254645c8799b44e2/crates/bevy_pbr/src/meshlet/visibility_buffer_software_raster.wgsl

#

Not working yet though

wheat haven Jul 10, 2024, 5:03 PM

#

~~stealing from~~ reading Retina code has me questioning some of my long-held C++ habits, how dare you try and make me improve

pale horizon Jul 11, 2024, 9:48 PM

#

~~stole code from~~ trained my brain LLM on 😎

wispy spear Jul 14, 2024, 8:16 PM

#

lustri, i wish you good luck for the eksems

wicked notch Jul 14, 2024, 8:18 PM

#

I've done 16 of them what's one more

wispy spear Jul 14, 2024, 8:20 PM

#

still 🙂

#

what sleep deprivation does to a mf

wicked notch Jul 14, 2024, 8:21 PM

#

real

glass sphinx Jul 14, 2024, 9:03 PM

#

lvstri we need to absorb you into daxa

#

https://tenor.com/view/eatthatcookie-gif-10982703

Tenor

wispy spear Jul 14, 2024, 9:29 PM

#

😄

glass sphinx Jul 14, 2024, 9:55 PM

#

grand unified vulkan-abstraction theory

#

also jaker and martty and everyone else

#

we can all join and make one crippled library instead of many

#

😳

wispy spear Jul 14, 2024, 9:58 PM

#

https://tenor.com/view/rick-and-morty-kiss-the-gif-24878522

Tenor

glass sphinx Jul 14, 2024, 9:59 PM

#

at least i feel like there should be a lib to recommend to people other then using opengl thats more of a c api on vulkan

wispy spear Jul 14, 2024, 9:59 PM

#

webgpu?

glass sphinx Jul 14, 2024, 9:59 PM

#

yea that is ok

#

but nih

#

but we can nih together

wispy spear Jul 14, 2024, 10:00 PM

#

i should be quiet, im totally unqualified for this

glass sphinx Jul 14, 2024, 10:00 PM

#

webgpu also has some old-isms because of safety

#

i think we can be more ghetto

glass sphinx Jul 14, 2024, 10:00 PM

#

wispy spear i should be quiet, im totally unqualified for this

you join too

#

we need an army

pale horizon Jul 14, 2024, 10:20 PM

#

glass sphinx at least i feel like there should be a lib to recommend to people other then usi...

vma + vkb + vkguide = you’re all set

glass sphinx Jul 14, 2024, 10:56 PM

#

vkguide does not have a good vulkan abstraction for regular use tho

#

its tutorial code

wispy spear Jul 14, 2024, 11:02 PM

#

im also not a fan and abandoned it

minor root Jul 15, 2024, 7:22 AM

#

One vulkan lib to rule them all

wheat haven Jul 15, 2024, 7:33 AM

#

wispy spear Jul 15, 2024, 10:30 AM

#

the problem is, that it feels like every other week there is a new way of how one should use vulkan

#

similar to every week a new js framework comes out which every web-dev needs to use

frank sail Jul 15, 2024, 10:31 AM

#

the right way is my way

wispy spear Jul 15, 2024, 10:31 AM

#

: )

#

and then there is the whole shader compiler bs in between which doesnt seem to allow certain things sometimes and you cant use vulkan as intended or something like that

frank sail Jul 15, 2024, 10:32 AM

#

hmmm sounds like fuddery

#

I'm not particularly hindered by using glslang, as far as glsl compilation goes

wispy spear Jul 15, 2024, 10:41 AM

#

good good

pale horizon Jul 15, 2024, 11:00 AM

#

wispy spear the problem is, that it feels like every other week there is a new way of how on...

Only 1.3 changed quite a lot, and it’s much easier than before now
Sort of like GL changed with DSA

frank sail Jul 15, 2024, 11:01 AM

#

except with vulkan it's like, actual hardware tiers rather than moronic API limitations being lifted

wispy spear Jul 15, 2024, 11:25 AM

#

i remember when those tiers came around in dx12, and nobody used them

#

or dx11_2?

wispy spear Jul 15, 2024, 11:25 AM

#

pale horizon Only 1.3 changed quite a lot, and it’s much easier than before now Sort of like ...

yes yes, but even then, when you scroll through our discord, there is not a single opinion on how to use it "properly" 😄

#

perhaps certain features of modern vulkan make sense for specific use cases only, but that was never clear or obvious to me, when reading/following those lose conversations about "what one should use/do" when it comes to bulkan

#

an actual guide would be nice (vkguide is not that)

#

also also dont listen to my random bs too much, i am not even focusing on vk right now, but i will from january onwards! mark my words lol

pale horizon Jul 15, 2024, 11:28 AM

#

99% of what vkuide teaches is all that you need gpAkkoShrug
You can disregard what anyone else says KEKW

wispy spear Jul 15, 2024, 11:28 AM

#

i dont like vkguide

#

its half assed

#

and a similar thing like logl right now - it feels like

frank sail Jul 15, 2024, 11:29 AM

#

so vulkan offers multiple ways to do everything, but each way has genuine pros and cons

wispy spear Jul 15, 2024, 11:29 AM

#

yeah

#

a "guide" (perhaps the wrong word) around those pros and cons for way x y and z could be something to build upon

frank sail Jul 15, 2024, 11:30 AM

#

a simple guide won't cut it if you want to be able to pick the right tool every time

#

you have to read lots of stuff

#

like there's a bunch of ways to upload data, and even though my requirements are pretty narrow it's still hard to choose

wispy spear Jul 15, 2024, 11:31 AM

#

yeah thats the stuff im talking about,

#

those things are mentioned across the server somewhat frequently too

#

but its 20 different people all the time 🙂

frank sail Jul 15, 2024, 11:31 AM

#

like in #1128020727380054046 right now kekwfroggified

wispy spear Jul 15, 2024, 11:31 AM

#

: )

frank sail Jul 15, 2024, 11:32 AM

#

maybe you could make a rube goldberg uploading system like what devsh and co. cooked up

wispy spear Jul 15, 2024, 11:33 AM

#

maybe i cant read, but thats exactly what the pros keep saying

#

but then its person a vs person b "yes you shouldnt to it this way, do it that way" 🙂

frank sail Jul 15, 2024, 11:33 AM

#

at some point you have to decide for yourself based on the information presented

wispy spear Jul 15, 2024, 11:33 AM

#

ofc

#

if you just want to display voxels for a minecraft clone its perhaps very different

#

compared to some open world terrain/planetrender streaming thingy

#

or maybe not, and you could use the same mekanism for both things

frank sail Jul 15, 2024, 11:35 AM

#

it's also way too easy to overthink unless you have a concrete problem in front of you

wispy spear Jul 15, 2024, 11:35 AM

#

yep

#

it would help if you know what you want to make exactly

frank sail Jul 15, 2024, 11:35 AM

#

most of the time it's kinda obvious what you need to do to solve a problem in vulkan

#

like most cases of uploading data

#

don't tell the others I said this, but vkCmdUpdateBuffer works quite well in many cases

wispy spear Jul 15, 2024, 11:36 AM

#

: )

frank sail Jul 15, 2024, 11:40 AM

#

e.g. for a basic model viewer you could get away with vkCmdUpdateBuffer for per-frame data and then a simple duplicated buffer for per-object uniforms

#

or just vkCmdUpdateBuffer if you have few enough objects frognant

distant lodge Jul 15, 2024, 12:08 PM

#

wispy spear i dont like vkguide

rip vblanco

loud crag Jul 15, 2024, 12:21 PM

#

vkguide is my lord and saviour

wicked notch Jul 15, 2024, 12:22 PM

#

vblanco in shambles (it's joever)

#

I just used the khronos samples to learn vk honestly

#

I already knew GL so it was pretty easy froge

#

the GL -> Vk pipeline is real

delicate rain Jul 15, 2024, 12:32 PM

#

Meh imo it doesn't really matter what you use to learn, even if you pick up some not so optimal practices, just listening to people on this server will eradicate most of them anyways

#

The biggest step imo is understanding the API, everything else you can just learn from other nih abstractions

glass sphinx Jul 15, 2024, 12:34 PM

#

https://tenor.com/view/fish-sad-blob-fish-gif-15806450

Tenor

glass sphinx Jul 15, 2024, 12:35 PM

#

minor root One vulkan lib to rule them all

the dream

distant lodge Jul 15, 2024, 2:37 PM

#

I'm glad vkguide/this server got me off of cached command buffers and on pipeline dynamic state

#

I think vulkan-tutorial presents this now too

glass sphinx Jul 15, 2024, 4:55 PM

#

distant lodge I'm glad vkguide/this server got me off of cached command buffers and on pipelin...

niiiiice

delicate rain Jul 17, 2024, 3:07 PM

#

delicate rain <https://graphics.cs.utah.edu/research/projects/ray-tracing-hw-adaptive-lod/>

Okay this is cope

#

It basically all stands on the fact that someone will implement a part of the algorithm in hardware

#

Otherwise it reduces memory bandwidth but compute cost will go nuclear

#

It's also unfit for use with current hw rt acceleration

#

So they basically say "currently this will only work in software but it will be slow. If someone adds a new hardware unit and makes it a part of the API it will be fast"

#

Sadge

buoyant summit Jul 17, 2024, 3:16 PM

#

delicate rain It's also unfit for use with current hw rt acceleration

this bit is fine it just needs AS build API adjustments

#

tbf

delicate rain Jul 17, 2024, 3:18 PM

#

Interesting, I did not know that, but still I was kinda talking from the user perspective

buoyant summit Jul 17, 2024, 3:18 PM

#

yes

delicate rain Jul 17, 2024, 3:18 PM

#

As in as of right now it's basically useless for me 😅

buoyant summit Jul 17, 2024, 3:18 PM

#

ok nvm

#

yes

#

that stuff in paper requires hw changes because it affects

#

but

#

I think global LOD is still useful

#

and global LOD doesn't affect traversal

#

with global LOD you'd just swap in differently detailed BLAS depending hit feedback or whatever

#

this is arguably more useful for something like a game as it lets you use less memory for the scene than needing full LOD in memory at all times

delicate rain Jul 17, 2024, 3:20 PM

#

They mentioned that in the introduction - I thought you already could do that with the API (I know close to nothing about the API though)

buoyant summit Jul 17, 2024, 3:20 PM

#

yes you could but

#

for very high tri meshes build cost is oof

#

and that's the bit that would be nice to address

delicate rain Jul 17, 2024, 3:21 PM

#

I see, I need to experiment and build some intuition

buoyant summit Jul 17, 2024, 3:21 PM

#

otherwise yes just do global LODs with rt

#

I think LOD decisions at traversal time are kinda whatever

delicate rain Jul 17, 2024, 3:21 PM

#

There is also the paper from Intel about micro poly rt stuff, guess I'll read that next

buoyant summit Jul 17, 2024, 3:21 PM

#

delicate rain There is also the paper from Intel about micro poly rt stuff, guess I'll read th...

that one you also can't do in current apis

delicate rain Jul 17, 2024, 3:21 PM

#

But I'm kinda worried

buoyant summit Jul 17, 2024, 3:21 PM

#

but it only modifies build AFAIU

delicate rain Jul 17, 2024, 3:22 PM

#

buoyant summit that one you also can't do in current apis

Ah right

buoyant summit Jul 17, 2024, 3:22 PM

#

so the hw can do it

#

just the api kinda can't (just for now, hopefully)

#

well it can but not in a very useful capacity I mean

#

you'll succumb to AS build costs with the kind of meshes they're working with

delicate rain Jul 17, 2024, 3:23 PM

#

I wonder how sophisticated the hw BVHs are

buoyant summit Jul 17, 2024, 3:23 PM

#

@hallow cedar

hallow cedar Jul 17, 2024, 3:23 PM

#

what

buoyant summit Jul 17, 2024, 3:23 PM

#

delicate rain I wonder how sophisticated the hw BVHs are

this

delicate rain Jul 17, 2024, 3:23 PM

#

It seems like that's 90% of all the rt research now - more efficient bvh (aabb-kdop-obb)

hallow cedar Jul 17, 2024, 3:24 PM

#

ah

#

nvidia i think quite a bit more than amd, intel potentially too

#

amd is a pretty standard BVH4, nvidia was doing CWBVH-y stuff I heard somewhere but I'm really not sure

delicate rain Jul 17, 2024, 3:25 PM

#

I imagine it's all secret, especially the details

hallow cedar Jul 17, 2024, 3:25 PM

#

well amd's not really

delicate rain Jul 17, 2024, 3:25 PM

#

But AMD's is slow no?

hallow cedar Jul 17, 2024, 3:25 PM

#

https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/amd/vulkan/bvh/bvh.h here you go

hallow cedar Jul 17, 2024, 3:25 PM

#

delicate rain But AMD's is slow no?

that's the catch frog_pregnant

buoyant summit Jul 17, 2024, 3:26 PM

#

all vendors likely use a simple bvh of some kind

#

but some vendors have more interesting hw bits for traversal

hallow cedar Jul 17, 2024, 3:26 PM

#

speaking of i still have next to no idea about rdna4 rt and that's kinda weird to me

buoyant summit Jul 17, 2024, 3:27 PM

#

rdna4 removes RT hw mindblown

hallow cedar Jul 17, 2024, 3:27 PM

#

most accurate hw leak

delicate rain Jul 17, 2024, 3:28 PM

#

buoyant summit but some vendors have more interesting hw bits for traversal

It's sad, as I'd like to do something with hw rt but it seems all is either proposals or closed door stuff not really open to academia

buoyant summit Jul 17, 2024, 3:28 PM

#

bro

#

get amd

delicate rain Jul 17, 2024, 3:28 PM

#

So if you really wanna publish/research in that field, you're bound to software

buoyant summit Jul 17, 2024, 3:29 PM

#

(or intel if you like extra pain)

#

hw rt is there

#

the open drivers are there

#

buy steam deck

#

you can hack the vk driver in whatever way you want

#

do whatever

#

with hw

#

directly

delicate rain Jul 17, 2024, 3:29 PM

#

Uh

buoyant summit Jul 17, 2024, 3:29 PM

#

nv rt in nvk does not yet exist and who knows when will

#

but the setup for rt on nv is a bit complex so it will be some time

#

I doubt you'd be up for REing nv prop blob

delicate rain Jul 17, 2024, 3:31 PM

#

Yeah no and even if I were, there's a bunch of external limitations forced on me so it wouldn't be viable anyways

buoyant summit Jul 17, 2024, 3:31 PM

#

ye

buoyant summit Jul 17, 2024, 3:31 PM

#

buoyant summit you can hack the vk driver in whatever way you want

this bit is very easy

#

very easy

delicate rain Jul 17, 2024, 3:31 PM

#

Guess there is still the adobe displacement stuff that looked interesting though

pale horizon Jul 17, 2024, 3:32 PM

#

buoyant summit rdna4 removes RT hw <:mindblown:608763537900437524> <:mindblown:6087635379004375...

good

buoyant summit Jul 17, 2024, 3:33 PM

#

pale horizon good

bro go back to your thread before I remove u from lyf

#

I swear on me gpu

hallow cedar Jul 17, 2024, 3:34 PM

#

be careful with that

#

it might leave you hanging

pale horizon Jul 17, 2024, 3:35 PM

#

They should remove TAA hw next

buoyant summit Jul 17, 2024, 3:35 PM

#

what if there's no taa hw 😳

#

bro wants to remove the entire gpu

hallow cedar Jul 17, 2024, 3:36 PM

#

CUs can do TAA with shaders -> remove CUs

#

only command processor pls thxbai

buoyant summit Jul 17, 2024, 3:36 PM

#

computing on command processor be like frog_turtle

pale horizon Jul 17, 2024, 3:36 PM

#

Retvrn to software rendering

hallow cedar Jul 17, 2024, 3:36 PM

#

but what if software taa

#

maybe we will have to go outside

buoyant summit Jul 17, 2024, 3:37 PM

#

pale horizon Retvrn to software rendering

we have that already it's called datacenter gpu

pale horizon Jul 17, 2024, 3:37 PM

#

Remove TAA CPU instruction

buoyant summit Jul 17, 2024, 3:37 PM

#

those don't have drawing hw

#

or format conversion, image load/store, sampling support

#

just software

pale horizon Jul 17, 2024, 3:37 PM

#

buoyant summit those don't have drawing hw

Good

buoyant summit Jul 17, 2024, 3:37 PM

#

just compute

#

only CUs

#

no drawing

hallow cedar Jul 17, 2024, 3:37 PM

#

but what if our eyes do taa internally
your eyes do taa get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out get them out

buoyant summit Jul 17, 2024, 3:37 PM

#

bro

#

https://tenor.com/view/spray-bottle-cat-spray-bottle-spray-bottle-meme-loop-gif-25594440

Tenor

pale horizon Jul 17, 2024, 3:38 PM

#

Me when I turn my head in game with TAA fr fr

pale horizon Jul 17, 2024, 3:40 PM

#

hallow cedar but what if our eyes do taa internally your eyes do taa get them out get them ou...

My vision is kinda blurry when I turn my head fast
It’s joever 😦

frank sail Jul 17, 2024, 9:33 PM

#

delicate rain But AMD's is slow no?

Same order of magnitude tbf

delicate rain Jul 17, 2024, 9:35 PM

#

I was mostly meming

#

Although I've heard some hate from Patrick

frank sail Jul 17, 2024, 9:37 PM

#

Oh, AMD's rt acceleration is indeed a lot slower, but it's still very usable

delicate rain Jul 17, 2024, 9:39 PM

#

Last I've heard it's 4-5x faster than software, but maybe that is outdated

#

I want to do hw rt so much, but it's hard to find something to wrap it with (as in a topic for uni)

finite quartz Jul 17, 2024, 9:43 PM

#

buoyant summit https://tenor.com/view/spray-bottle-cat-spray-bottle-spray-bottle-meme-loop-gif-...

https://cdn.discordapp.com/emojis/1157634614857314394.gif?size=80&quality=lossless

fiery bolt Jul 19, 2024, 3:05 AM

#

frank sail Same order of magnitude tbf

~~jaker you don't need to shill amd anymore~~

primal shadow Jul 20, 2024, 5:28 PM

#

@wicked notch What's the theory behind the math for this? How did you get this equation? https://github.com/LVSTRI/IrisVk/blob/master/shaders/0.1/rasterizer.comp#L225

wispy spear Jul 20, 2024, 5:30 PM

#

stolen from ue sources

wicked notch Jul 20, 2024, 5:52 PM

#

you can see that in AMD's small triangle culling code

#

I don't feel qualified to explain that (also because it's been so long) but AMD should have everything you need

primal shadow Jul 20, 2024, 6:09 PM

#

wicked notch you can see that in AMD's small triangle culling code

What do they call the project? Looking up small triangle culling code amd did not find anything

twin musk Jul 20, 2024, 6:15 PM

#

primal shadow What do they call the project? Looking up small triangle culling code amd did no...

geometryfx possibly

primal shadow Jul 20, 2024, 6:16 PM

#

I didn't find anything there

#

Also https://github.com/GPUOpen-LibrariesAndSDKs/WorkGraphComputeRasterizer/tree/main seems empty, and has no shaders?? idk

GitHub

GitHub - GPUOpen-LibrariesAndSDKs/WorkGraphComputeRasterizer: A com...

A compute/workgraph workload running inside the Cauldron framework - GPUOpen-LibrariesAndSDKs/WorkGraphComputeRasterizer

#

Somewhere here, maybe https://github.com/GPUOpen-Effects/GeometryFX/blob/02c4139eff3ee9a1c180a25fa4a6020a477f9fee/amd_geometryfx/src/Shaders/AMD_GeometryFX_Filtering.hlsl#L165

GitHub

GeometryFX/amd_geometryfx/src/Shaders/AMD_GeometryFX_Filtering.hlsl...

DirectX 11 library that provides convenient access to compute-based triangle filtering (CTF) - GPUOpen-Effects/GeometryFX

wispy spear Jul 22, 2024, 7:36 PM

#

finish it first

faint ruin Jul 22, 2024, 7:41 PM

#

oh wait I didn’t even realize Elias was the one that wrote that vulkan article about how he learned wowza

#

sorry my mind goes a million miles a second sometimes

wicked notch Jul 22, 2024, 7:49 PM

#

elias made it

#

insane

faint ruin Jul 22, 2024, 7:52 PM

#

wicked notch elias made it

I wasn’t paying attention to the names until after reading the messages :(((

#

I’m a little dense sometimes

wicked notch Jul 22, 2024, 7:54 PM

#

it's ok

#

don't worry about it chief

faint ruin Jul 22, 2024, 7:55 PM

#

I’ll finish Vkguide up first
I’ve been super happy with it this far, probably the best thing to do before asking further questions

wispy spear Jul 22, 2024, 7:55 PM

#

doesnt hurt to take notes about things which make no sense yet

#

and there is a post (something something vkguide) in #1019722539116802068 from vblanco, the author of that guide

faint ruin Jul 22, 2024, 7:57 PM

#

thank ya thank ya

pale horizon Jul 22, 2024, 8:11 PM

#

What was the deleted post about? 😅

faint ruin Jul 22, 2024, 8:15 PM

#

pale horizon What was the deleted post about? 😅

Knee jerk reaction to delete it buuuut
My initial question was essentially asking once one finishes vkguide, compared to say Vulkan-tutorial, it’s much more applied in that you’re making an engine. The question was essentially in regards to making a smaller renderer inspired by vkguide to better fit a different use case, or if basing a new project off what I did in vkguide makes sense where things do not see

#

Like, if I wanted to make a simulation application or a tool that lets me test different rendering techniques or run different demos of implemented graphics papers. Or I want to make a nice terrain generator. How can I take what I learn in vkguide into these tutorials in a way that tailors it to my use cases?

pale horizon Jul 22, 2024, 8:18 PM

#

The renderer in vkguide is already quite minimal, and you’ll likely need much more for a “small” renderer, tbh
You can drop “drawing transparent objects” part if you don’t need it.

Also you don’t need “GPU driven” part most likely, unless you plan to render 1000s of objects

wispy spear Jul 22, 2024, 8:19 PM

#

i would read the whole thing anyway

faint ruin Jul 22, 2024, 8:21 PM

#

wispy spear i would read the whole thing anyway

That’s my plan to at least read up to and through gpu driven rendering (maybe not before actually iterating on ideas), cause I have time to learn and wanna improve my understanding and application of the API

pale horizon Jul 22, 2024, 8:21 PM

#

wispy spear i would read the whole thing anyway

Yeah, you can read the last chapter, but just READ don’t implement, haha

faint ruin Jul 22, 2024, 8:23 PM

#

I have ideas on projects I wanna take on, it’s just a matter of reasonably converting what I have here to my needs.

I also feel a certain way of “cheating” using sample code and tutorials like this without actually understanding the underlying API (which happens through reading and actually getting your hands dirty)

#

But that’s a personal disposition I’m working on discarding

#

it makes no sense to have that especially if I am new to Vulkan.

wispy spear Jul 22, 2024, 8:24 PM

#

you also better create a post in #1019722539116802068 about the projects you have in mind, like the other frogs did/do too

#

https://tenor.com/view/or-else-wednesday-addams-family-cut-neck-gif-5060381

Tenor

Or else

▶ Play video

faint ruin Jul 22, 2024, 8:26 PM

#

oh no, I will not be subject to the threats of those who came before me
post my projects I shall

wispy spear Jul 22, 2024, 8:26 PM

#

hehe

faint ruin Jul 22, 2024, 8:27 PM

#

Vulkan has been fun, learning it has been exhausting but I like the challenge
it definitely helps to have done stuff with other APIs too
I’m hyped, thanks guys

pale horizon Jul 22, 2024, 8:34 PM

#

faint ruin I have ideas on projects I wanna take on, it’s just a matter of reasonably conve...

You should copy it once
Then you can re-read it again, but start organizing things more to your liking.
Also, the best understanding happens when you debug something, or implement some technique not described in the tutorial.

E.g. shadow mapping, post-processing effects, PBR stuff and so on

#

I can also highly recommend getting into bindless textures ASAP
Descriptor sets are not that good and bindless + buffer direct access allows you to mostly get rid of them

faint ruin Jul 22, 2024, 8:42 PM

#

pale horizon You should copy it once Then you can re-read it again, but start organizing thin...

Hard hard agree. That’s where I’m most excited cause I know I’m gonna learn the most through the struggles.
But are descriptor sets really not that good to you?

pale horizon Jul 22, 2024, 8:44 PM

#

It’s not like they’re “bad”. It’s just managing/allocating/defining them is PITA when with bindless you just pass your textures with push constants and that’s it

faint ruin Jul 22, 2024, 8:45 PM

#

I see

#

Push constants seems to be something I might wanna use later on

#

But yeah, the process for setting everything up with the sets and layouts does seem rather tedious

pale horizon Jul 22, 2024, 8:50 PM

#

faint ruin Push constants seems to be something I might wanna use later on

No, you need them right at the start
And vkguide uses them extensively

faint ruin Jul 22, 2024, 8:50 PM

#

wispy spear finish it first

you know this guy made a good point to me to finish it lol

#

seems like I’ll keep digging through then

faint ruin Jul 22, 2024, 8:51 PM

#

pale horizon No, you need them right at the start And vkguide uses them extensively

I was considering stopping cause I felt I was escaping the fundamentals of VULKAN and dealing with engine architecture rather than Vulkan itself, hence why I supplement with the specs and the Wien lectures

pale horizon Jul 22, 2024, 8:52 PM

#

Push constants are fundamentals of vk 😅
But yeah, it’s good to reference multiple resources. Also TU Wien lectures are quite good indeed.

#

Don’t worry about “not getting” it - you’ll understand more with practice and when you solve real problems and add code on your own.

#

I think I know like 10% of vk and still I managed to make something presentable

faint ruin Jul 22, 2024, 8:56 PM

#

pale horizon I think I know like 10% of vk and still I managed to make something presentable

That makes me feel better about going forward on this route of learning

#

Not necessarily the 10%, but the knowledge that leaning into the fundamentals and just… working with it. You don’t need to learn everything at once

#

It’s like a fighting game. You don’t need to learn every combo at the beginning

pale horizon Jul 22, 2024, 8:58 PM

#

Yeah. You’ll rewrite it 10 times anyways, lmao

faint ruin Jul 22, 2024, 9:00 PM

#

LOL probably true

faint ruin Jul 22, 2024, 9:01 PM

#

pale horizon Yeah. You’ll rewrite it 10 times anyways, lmao

Your article gave me the kick to just go in and get started so I appreciate you talking to me about this. Major appreciate and a follow from me :)

wicked notch Jul 24, 2024, 2:46 PM

#

man

#

you ever try to sit down and study

#

but while you read your eyes go out of focus

#

I think I'm going to die soon

hallow cedar Jul 24, 2024, 2:47 PM

#

while you read
wish I'd get that far

#

i know the "going out of focus" stuff stuff though

#

it usually happens when I'm tired

wicked notch Jul 24, 2024, 2:49 PM

#

it's barely 5pm tho agonyfrog

#

the 32°C of hellfire in my room doesn't help

hallow cedar Jul 24, 2024, 2:50 PM

#

ah you too

#

i had 35°C earlier this week

#

death

delicate rain Jul 24, 2024, 2:59 PM

#

wicked notch but while you read your eyes go out of focus

Did you just start?

#

Usually happens for me the first hour or so that I start

#

Once I get into it properly it's usually fine

#

And then once I get tired it comes back yeah

buoyant summit Jul 24, 2024, 3:37 PM

#

wicked notch the 32°C of hellfire in my room doesn't help

no AC?

#

today was my first moderately productive day in a while actually

#

did some CTS work that I can't talk about details of

#

god VK-GL-CTS is a chonker

hallow cedar Jul 24, 2024, 3:41 PM

#

it's insane

#

on my old laptop I had to basically compile until I run OOM, then compile with lower thread-count (singlethreaded at time) for a bit, then abort and continue with higher thread count

#

because -j1 is incredibly dog slow but -j6 (yes it's a hexcore 🐸) makes me run oom

buoyant summit Jul 24, 2024, 3:43 PM

#

buy chungus ram

#

why are there no laptops with option for 64G

#

or 128G

#

pls

#

ayymd how can I survive off 32G with a 12c SMT2 part

#

my current work laptop is actually a

#

4c SMT2 16G thing

#

which is...

#

well..

#

it works

hallow cedar Jul 24, 2024, 3:44 PM

#

i was considering retrofitting 64g into my laptop

buoyant summit Jul 24, 2024, 3:44 PM

#

with swap it's actually fine, so browser etc can get swapped out while important stuff gets to use ram

hallow cedar Jul 24, 2024, 3:44 PM

#

went with 32g for now and it's ok

buoyant summit Jul 24, 2024, 3:44 PM

#

you should

#

32G felt a bit uncomfortable on my 5900X PC

hallow cedar Jul 24, 2024, 3:44 PM

#

yes on pc i'm 64G

#

on laptop i'm 8c16t32gb

#

but mediumterm I kinda want to look at alternatives

buoyant summit Jul 24, 2024, 3:45 PM

#

I just want a nice reasonable laptop

hallow cedar Jul 24, 2024, 3:45 PM

#

when I get a non-shit internet connection I'm considering just having a remote power switch for my pc and doing remote-y dev

buoyant summit Jul 24, 2024, 3:45 PM

#

where 5p69e cpu pls

hallow cedar Jul 24, 2024, 3:46 PM

#

I suppose my case is sorta special though

buoyant summit Jul 24, 2024, 3:46 PM

#

ye that sounds nice

hallow cedar Jul 24, 2024, 3:46 PM

#

I need chonker gpu for rt, but I've come to hate gamer bricks

hallow cedar Jul 24, 2024, 3:49 PM

#

hallow cedar when I get a non-shit internet connection I'm considering just having a remote p...

(the idea would be like a raspi that operates some digital power/reset button in order to do power cycling and so on)

#

though I'd like to do boot/kernel selection perhaps

#

hmmmm

#

anyway I should probably ramble about this in my own thread instead 🐸 sorry

glass sphinx Jul 24, 2024, 4:23 PM

#

wicked notch but while you read your eyes go out of focus

slam stimulants

wicked notch Jul 24, 2024, 4:26 PM

#

buoyant summit no AC?

'tis but a temporary relief

#

btw it's two hours later and the situation has barely improved

#

here's a conversation between me & my friends about the situation at hand (the exam is tomorrow)

#

first message is "how is it going"

loud crag Jul 24, 2024, 4:32 PM

#

L

wicked notch Jul 24, 2024, 4:33 PM

#

loud crag L

real

velvet marsh Jul 24, 2024, 4:38 PM

#

it's going to be ok, unless it won't

buoyant summit Jul 24, 2024, 4:59 PM

#

wicked notch 'tis but a temporary relief

wym temporary

#

tbh heat is also temporary

frank sail Jul 24, 2024, 6:50 PM

#

life is also temporary

velvet marsh Jul 24, 2024, 6:54 PM

#

do you all need a hug today

wicked notch Jul 24, 2024, 7:01 PM

#

I need to pass this godforsaken exam

wheat haven Jul 24, 2024, 7:01 PM

#

what subject

wicked notch Jul 24, 2024, 7:01 PM

#

image processing and neural networks

#

it's 6 credits but it should be 1000

#

there's like 5 books we had as material

#

my notes are 600k words and a 400 pages

#

it's insane

velvet marsh Jul 24, 2024, 7:23 PM

#

sounds over the top and unnecessary

wicked notch Jul 25, 2024, 9:58 AM

#

lads

#

it is

#

finally

#

(for real this time)

#

over

faint crane Jul 25, 2024, 10:00 AM

#

Is it over, or are we so back?

wicked notch Jul 25, 2024, 10:01 AM

#

we are truly back

#

exam time is finally over

delicate rain Jul 25, 2024, 10:02 AM

#

Ez clap

#

Time to shill some gp

wicked notch Jul 25, 2024, 10:08 AM

#

ngl I was so lucky bleakekw

#

I knew nothing about morphological processing (had to skip because it was impossible to understand all that garbage in a few days)

#

but they didn't ask me about it so

frank sail Jul 25, 2024, 10:09 AM

#

you did it laddie

frank sail Jul 25, 2024, 10:10 AM

#

wicked notch my notes are 600k words and a 400 pages

damn son

delicate rain Jul 25, 2024, 10:13 AM

#

Wtf

#

I think I took like 500 words worth of notes accumulated across all my years studying lol

frank sail Jul 25, 2024, 10:15 AM

#

I was so stupid I would take "notes" in math lectures just copying what the prof wrote, hoping it would make me magically understand the topic better

#

normally I'd barely write down anything though

delicate rain Jul 25, 2024, 10:16 AM

#

I did that a little the first year

#

But yeah just listen and try to understand

#

Much better spent time than copying and not catching what he says

frank sail Jul 25, 2024, 10:16 AM

#

my math prof was foreign and had an accent that I could hardly understand (my bad hearing didn't help either) froge_bleak

delicate rain Jul 25, 2024, 10:17 AM

#

Oh god, yeah that does not sound good bleakekw

frank sail Jul 25, 2024, 10:17 AM

#

anyway I somehow pulled through

delicate rain Jul 25, 2024, 10:17 AM

#

Having pretty much all lectures on YouTube kinda allowed me to get away with almost no notes too btw

pale horizon Jul 25, 2024, 10:26 AM

#

LVSTRI gonna come up with novel waifu generators next

wispy spear Jul 25, 2024, 11:48 AM

#

wb lustri, time to bikeshed UE6 clone

loud crag Jul 25, 2024, 2:36 PM

#

@wicked notch where did you get this piece of code from?
https://github.com/LVSTRI/Retina/blob/1152267563683bb6a8a6731ce31b8bc07ffe88c4/src/Retina/Sandbox/Shaders/Visbuffer.mesh.glsl#L99

    const vec3 v0 = sh_ClipVertices[indices.x];
    const vec3 v1 = sh_ClipVertices[indices.y];
    const vec3 v2 = sh_ClipVertices[indices.z];
    const float det = determinant(mat3(v0, v1, v2));

    gl_MeshPrimitivesEXT[currentIndex].gl_CullPrimitiveEXT = det > 0.0;

#

since idk why this works, I'm a little confused

#

because that does backface culling, right?

#

since it does det > 0.0

#

however in my Metal renderer that does frontface culling, and I need to do det < 0.0 instead

#

is this related to Y+ being up instead of down?

wide shadow Jul 25, 2024, 2:48 PM

#

the code is from here I guess and if you do proj_mat[1][1] *= -1.0f to flip Y axis then you do frontface culling instead of backface culling
https://zeux.io/2023/04/28/triangle-backface-culling/

Fine-grained backface culling

Backface culling is something we take for granted when rendering triangle meshes on the GPU. In general, an average mesh is expected to have about 50% of its triangles facing away from the camera. Unless you forget to set appropriate render states in your favorite graphics API, the hardware will reject these triangles as early in the rasterizati...

wicked notch Jul 25, 2024, 4:28 PM

#

loud crag is this related to Y+ being up instead of down?

yes this is related to your coordinate system

loud crag Jul 25, 2024, 5:02 PM

#

ok thank

velvet marsh Jul 25, 2024, 5:39 PM

#

a negative determinant indicates a reflection

#

so it's not just a up thing, but a handedness thing

#

you could have a different up axis and still have a positive determinant

#

and you could have the same up axis and a negative determinant

primal shadow Jul 25, 2024, 10:31 PM

#

Help me brainstorm names that are better than meshlets, but less verbose than virtual geometry 😅
I was thinking maybe MicroMesh, but that's kinda a thing already (RT technique)

wicked notch Jul 25, 2024, 10:32 PM

#

meshlet = cluster

#

if that's what you want

wispy spear Jul 25, 2024, 10:32 PM

#

micromesh is a thing already

primal shadow Jul 25, 2024, 10:32 PM

#

No a name for the user-facing feature in bevy. Something catchy like nanite.

wispy spear Jul 25, 2024, 10:33 PM

#

i linked the papers somewhere in frogfood a week ago or so

primal shadow Jul 25, 2024, 10:33 PM

#

wicked notch meshlet = cluster

In my code meshlet = from asset, cluster = instance of meshlet

wicked notch Jul 25, 2024, 10:33 PM

#

that's weird

#

could've just done cluster and clusterInstance innit?

#

or is that reserved somewhere else

primal shadow Jul 25, 2024, 10:34 PM

#

Ehh that's a bit verbose with how often I use the word cluster 😅

wicked notch Jul 25, 2024, 10:35 PM

#

this is a matter of preference ofc

#

personally I'd rather a little bit of verbosity if that means not using two synonymous words in different contexts

primal shadow Jul 25, 2024, 10:36 PM

#

It's subject to change, I need to clean all my code up, it's on the TODO. I'll think about it.

faint crane Jul 25, 2024, 10:36 PM

#

primal shadow Jul 25, 2024, 10:36 PM

#

I want to try and come up with a better name for the feature though since it's confusing that it's called meshlet, which is also used everywhere in the code 😛

wispy spear Jul 25, 2024, 10:37 PM

#

who cares whether rust goes into nodejs? (im sorry if thats you cody)

wicked notch Jul 25, 2024, 10:37 PM

#

primal shadow I want to try and come up with a better name for the feature though since it's c...

you could reuse what unreal calls HLOD 😈

faint crane Jul 25, 2024, 10:37 PM

#

Nah, alcohol as a naming scheme.

wispy spear Jul 25, 2024, 10:37 PM

#

ah

wicked notch Jul 25, 2024, 10:38 PM

#

(in UE HLOD is a very different thing from virtual geometry)

primal shadow Jul 25, 2024, 10:38 PM

#

We have HLODs already as a seperate thing, although I think we call them VisibilityRange

wispy spear Jul 25, 2024, 10:38 PM

#

just keep the same naming scheme jasmine, everyone doing it knows what x and y means, and you cooking up some new terminology will be conchfusing

#

unless you want to coin those new shit for some weird reason in the bevy world, to stand out or something idk

primal shadow Jul 25, 2024, 10:39 PM

#

Ok forget the variable naming

#

I need a new name for the rendering feature itself, like Nanite

wispy spear Jul 25, 2024, 10:39 PM

#

Bevvite 😉

primal shadow Jul 25, 2024, 10:40 PM

#

Currently it's just "MeshletPlugin" and "MeshletMesh"

#

Which are terrible

wispy spear Jul 25, 2024, 10:40 PM

#

Meshlette

#

MeshlettifiedMesh

primal shadow Jul 25, 2024, 10:40 PM

#

Those are so much worse 😅

wispy spear Jul 25, 2024, 10:40 PM

#

ey 😄

#

why not just Mesh

#

and the engine decides whether it needs to meshlettify it or not, via some switch

primal shadow Jul 25, 2024, 10:41 PM

#

We have a Mesh, but it's an entirely seperate thing, and would be very confusing to users who don't use the feature

wispy spear Jul 25, 2024, 10:41 PM

#

remove it then

wicked notch Jul 25, 2024, 10:41 PM

#

do you fancy acronyms? KEKW

primal shadow Jul 25, 2024, 10:41 PM

#

Rip transparency and animated meshes I guess 😛

#

Sure, acrynyoms are fine

wispy spear Jul 25, 2024, 10:42 PM

#

and make Mesh 2.0 the new Mesh

#

and add animations back in heh

wicked notch Jul 25, 2024, 10:42 PM

#

CBVHLOD (cluster based virtualized hierarchical level of detail bleakforg )

primal shadow Jul 25, 2024, 10:42 PM

#

wispy spear and make Mesh 2.0 the new Mesh

Also remove half of bevy's userbase... Even my brand new laptop dosen't support Int64 atomics (thanks intel)

wicked notch Jul 25, 2024, 10:42 PM

#

intel skill issue

wispy spear Jul 25, 2024, 10:42 PM

#

thats what you get for targeting peasant hardware

primal shadow Jul 25, 2024, 10:43 PM

#

wicked notch CBVHLOD (cluster based virtualized hierarchical level of detail <:bleakforg:1259...

Oh, I'm like I don't actually do the BVH part yet, but it dosen't stand for that lmao

wicked notch Jul 25, 2024, 10:43 PM

#

primal shadow Oh, I'm like I don't actually do the BVH part yet, but it dosen't stand for that...

yes yes, the confusion is part of a good acronym

primal shadow Jul 25, 2024, 10:43 PM

#

wicked notch intel skill issue

Arc is annoyingly stuck at SM 6.5 and not 6.6

wicked notch Jul 25, 2024, 10:43 PM

#

the more confusing the better the acronym

primal shadow Jul 25, 2024, 10:43 PM

#

I think I'll just go with VirtualMeshPlugin and VirtualMesh 😛

wicked notch Jul 25, 2024, 10:44 PM

#

but yeah naming is hard

primal shadow Jul 25, 2024, 10:44 PM

#

What makes a mesh "virtual"? Idk it's a meaningless term, but it sounds good

wicked notch Jul 25, 2024, 10:45 PM

#

I think the virtual is mostly referring to virtual texturing, as nanite author explains it the logical transition was "virtual texturing" (decoupling memory budget and texture sizes from rendering) to "virtual geometry" (again, removing concerns about poly budgets and automating LODs)

primal shadow Jul 25, 2024, 10:45 PM

#

No I know, but for like non-rendering people, virtual is such a meaningless term lol

wicked notch Jul 25, 2024, 10:45 PM

#

o

#

yeah idk how to explain it to non rendering people

primal shadow Jul 25, 2024, 10:50 PM

#

The other annoying thing with VirtualMesh is that it has mesh in the name

#

So now I'm going to have dumb variables like virtual_mesh_meshlets

#

Which I mean, ig it's fine

#

Best I can come up with

wicked notch Jul 25, 2024, 10:52 PM

#

tell your users "you better pray I don't call my next variables x and y"

#

make them appreciate smart

velvet marsh Jul 25, 2024, 10:52 PM

#

primal shadow No a name for the user-facing feature in bevy. Something catchy like nanite.

LLM's are pretty good at coming up with names imo, at least sparking ideas

#

vmeshes is my contribution

primal shadow Jul 25, 2024, 10:53 PM

#

VMesh is not bad, thanks

frank sail Jul 25, 2024, 11:01 PM

#

primal shadow Help me brainstorm names that are better than meshlets, but less verbose than vi...

nangons

#

nanogons

#

minimesh

wicked notch Jul 25, 2024, 11:02 PM

#

nanogoon

frank sail Jul 25, 2024, 11:03 PM

#

geographs

faint crane Jul 25, 2024, 11:10 PM

#

nanot

buoyant summit Jul 26, 2024, 12:34 AM

#

primal shadow Help me brainstorm names that are better than meshlets, but less verbose than vi...

megageometry

#

megametry

#

ur welcome

buoyant summit Jul 26, 2024, 12:34 AM

#

wicked notch nanogoon

bro I swear one more and your walls will collapse

primal shadow Jul 26, 2024, 8:48 PM

#

@wicked notch do you understand nanite's scanline code? I don't get how it's solving for the passing X-interval

#

It's also not working for me 😦

primal shadow Jul 27, 2024, 6:01 AM

#

Nvm think I got it

#

Idk why I swear I've tried this code before, but now it's suddenly working...

#

Mostly working, some pixels are slightly off still, but close!

primal shadow Jul 27, 2024, 7:12 AM

#

Haven't finished the heuristics for SW/HW render switching, but only getting maybe a ~10% speedup vs HW only... Nowhere near what Nanite is quoting.

#

On the plus side I save a lot of memory having to only allocate data per-cluster instead of per-triangle, as I don't have access to mesh shaders to make HW raster fast

wispy spear Jul 27, 2024, 8:21 AM

#

time to extend webgpu to support mesh shaders

primal shadow Jul 27, 2024, 9:46 PM

#

wispy spear time to extend webgpu to support mesh shaders

The wgpu side stuff wouldn't be so bad, I don't look forward to the naga changes needed though :/

#

I suppose I could use spirv passthrough and write it in glsl instead ig..

wispy spear Jul 27, 2024, 9:47 PM

#

what is naga? the shader middleware?

primal shadow Jul 27, 2024, 9:53 PM

#

Shader transpiler. Wgsl -> msl/hlsl/spirv/glsl

wispy spear Jul 27, 2024, 9:54 PM

#

ah

fiery bolt Jul 28, 2024, 12:00 AM

#

primal shadow Haven't finished the heuristics for SW/HW render switching, but only getting may...

possibly has to do with the heuristics, but also shader micro-optimization and triangle density you're testing on thonk

primal shadow Jul 28, 2024, 12:04 AM

#

fiery bolt possibly has to do with the heuristics, but also shader micro-optimization and t...

I'm guessing it's the triangle density yeah. There's not really much left to optimize in the shader.

fiery bolt Jul 28, 2024, 12:19 AM

#

try out some raw megascans assets perhaps

wispy spear Jul 28, 2024, 8:54 PM

#

@primal shadow https://youtu.be/xT_-oSo-w-E?t=601 that reminded me of your shtuff

YouTube

High-Performance Graphics

HPG 2024: Day 3 - July 28th

Full program: https://www.highperformancegraphics.org/2024/program/

▶ Play video

primal shadow Jul 28, 2024, 9:05 PM

#

wispy spear <@145540119141679105> https://youtu.be/xT_-oSo-w-E?t=601 that reminded me of you...

Thanks, will take a look in a bit

#

Trying to figure out why tf triangles wrap around to the other side of the screen when doing scanline...

primal shadow Jul 28, 2024, 9:46 PM

#

wispy spear <@145540119141679105> https://youtu.be/xT_-oSo-w-E?t=601 that reminded me of you...

Not much that applies to me idt, but early-depth test (do non-atomic load and depth test, before atomic write) seems interesting, I can't actually test it though, as wgsl dosen't allow mixing atomic/non-atomic ops on the same buffer.

#

Also so many interesting RTGI papers from HPG this years, mainly https://www.highperformancegraphics.org/slides24/hpg24_oscgi.pdf, https://www.highperformancegraphics.org/posters24/Salmi - Fast Local Neural Regression For Low-Cost Global Illumination Denoising.pdf, and https://www.highperformancegraphics.org/posters24/Kawala - Work Graphs based Denoising for Real-Time Ray Tracing.pdf

#

I don't have any time to experiment though, as I need to focus on meshlets :((

wispy spear Jul 28, 2024, 9:47 PM

#

ah oki

primal shadow Jul 28, 2024, 9:47 PM

#

And my sutpid fucking shader won't work and renderdoc is giving me different results when viewing different threads

#

Any idea why renderdoc says thread 2 writes a "good" value to LDS, but then threads 0 reads garbage at index 2??

wispy spear Jul 28, 2024, 9:48 PM

#

primal shadow And my sutpid fucking shader won't work and renderdoc is giving me different res...

schrodingers triangle

frank sail Jul 28, 2024, 9:49 PM

#

primal shadow Any idea why renderdoc says thread 2 writes a "good" value to LDS, but then thre...

I don't think RenderDoc simulates shared memory

#

Well, interactions between multiple threads

primal shadow Jul 28, 2024, 9:49 PM

#

Really? Ahh

#

Well then idk how to even debug this tbh

frank sail Jul 28, 2024, 9:50 PM

#

Ok, it doesn't run other threads at all

To debug a compute thread simply go to the compute shader section of the pipeline state viewer and enter the group and thread ID of the thread you would like to debug. This thread will be debugged in isolation with no other threads in the group running.

This means there can be no synchronisation with any other compute thread running and the debugging will run from start to finish as if no other thread had run.

primal shadow Jul 28, 2024, 9:51 PM

#

Yeahh idk how to even go about trying to figure out to figure out why my triangle is wrapping around the screen when it should be clipped

frank sail Jul 28, 2024, 9:53 PM

#

Make a version of it that doesn't use shared memory

primal shadow Jul 28, 2024, 9:53 PM

#

I guess do all the math on the CPU?

#

Oh, good idea lol

faint crane Jul 28, 2024, 10:00 PM

#

wispy spear <@145540119141679105> https://youtu.be/xT_-oSo-w-E?t=601 that reminded me of you...

Didn’t sign up but I’m at SIGGRAPH if you want to show your frogs.

#

#

Or bunnies apparently.

wispy spear Jul 28, 2024, 10:01 PM

#

unfortunately im not in the gaming/graphics industry and therefore neither at siggraph nor hpg nor any other giraffics related conference : (

faint crane Jul 28, 2024, 10:12 PM

#

I can spread the gospel of VSM or whatever acronym we want to give it.

wispy spear Jul 28, 2024, 10:15 PM

#

please do 🙂

primal shadow Jul 28, 2024, 10:28 PM

#

frank sail Make a version of it that doesn't use shared memory

Ok so I've learned so far that for the X-axis, the triangle's bounding box min > max

#

Which is obviously backwards...

#

How even though?? ahh

#

// Compute triangle bounding box
let min_x = u32(min3(vertex_0.x, vertex_1.x, vertex_2.x));
let min_y = u32(min3(vertex_0.y, vertex_1.y, vertex_2.y));
var max_x = u32(ceil(max3(vertex_0.x, vertex_1.x, vertex_2.x)));
var max_y = u32(ceil(max3(vertex_0.y, vertex_1.y, vertex_2.y)));
max_x = min(max_x, u32(view.viewport.z) - 1u);
max_y = min(max_y, u32(view.viewport.w) - 1u);

#

I think it's because I clamp the max values to be within screen bounds, but not min...

#

https://github.com/LVSTRI/IrisVk/blob/master/shaders/0.1/rasterizer.comp#L94C1-L96C6 stealing @wicked notch 's code should fix it 😛

GitHub

IrisVk/shaders/0.1/rasterizer.comp at master · LVSTRI/IrisVk

Contribute to LVSTRI/IrisVk development by creating an account on GitHub.

wicked notch Jul 28, 2024, 10:30 PM

#

it's UE's so beware KEKW

primal shadow Jul 28, 2024, 10:32 PM

#

Ah ok

#

Still, I figured out the issue myself, and I just wanted to check if it's ok to early-out here or not

#

Which yeah, it's fine

#

Ok so that fixes the weird wrap-around issue

#

Things are still a bit pixelated with the scanline variant though, not sure why

#

might be an off-by-1 pixel somewhere

#

Like something's weird here

primal shadow Jul 30, 2024, 3:30 AM

#

Idk how Nanite quotes 1.1ms for raster time in lumen in the land of nanite

#

That's insane

fiery bolt Jul 30, 2024, 6:40 AM

#

the persistent threads bvh traversal probably saves a bunch of time, as well as whatever micro-opt they did in the compute rasterizer

#

still, wasn't it 2-3 ms?

#

or was that the second demo

primal shadow Jul 30, 2024, 3:40 PM

#

No it was 1.1ms just for the raster, nothing else

#

The only thing I can think of is I use 64 triangle clusters, and tend to have low fill rate. They use 128 triangle clusters (better warp occupancy), and probably do better on fill rate.

fiery bolt Jul 30, 2024, 5:40 PM

#

meshopt asserts at more than 124 tris per meshlet doesn't it KEKW

#

and the simplifier doesn't work all too well (it went from simplifying 14000 -> 144 meshlets once)

#

I need to make my simplifier work

wicked notch Jul 30, 2024, 7:29 PM

#

I'm cookin lads

#

I'm cooking something special

primal shadow Jul 30, 2024, 7:33 PM

#

What of?

wicked notch Jul 30, 2024, 7:38 PM

#

a special sauce

#

the sauce that clusterizes and simplifies stuff

#

that I promised jaker 10 decades ago KEKW

faint crane Jul 30, 2024, 7:44 PM

#

There was a presentation on dynamic LoD and “nanomesh” for mobile at the part 1 of “advancements in real-time graphics in games” course. Slides expected to be out next week.

#

I recall them having a 32 bit visibility buffer which stored cluster and triangle id at 25 and 7 bits. Also an exponential distance based heuristic for selecting a cut and culling.

dull oyster Jul 30, 2024, 7:46 PM

#

an exponential distance based heuristic for selecting a cut
Exactly what I need for selecting clusters for raytracing 👀

faint crane Jul 30, 2024, 7:47 PM

#

They have another presentation Thursday. I’ll have to try my hand at it when slides are out since I couldn’t follow verbally.

#

Results were impressive for high and low end mobile though. Very encouraging.

frank sail Jul 30, 2024, 7:53 PM

#

wicked notch that I promised jaker 10 decades ago <:KEKW:666849321462792234>

let's friggin go

primal shadow Jul 30, 2024, 7:55 PM

#

faint crane There was a presentation on dynamic LoD and “nanomesh” for mobile at the part 1 ...

Slides are already up somewhere, they gave the presentation earlier

#

At gdc iirc

fiery bolt Jul 30, 2024, 8:25 PM

#

wicked notch that I promised jaker 10 decades ago <:KEKW:666849321462792234>

yes pls my metis is giving me completely unconnected meshlets in my partitions

faint crane Jul 30, 2024, 8:26 PM

#

Your Bevy article was referenced on screen.

fiery bolt Jul 30, 2024, 8:55 PM

#

i'm getting trolled by metis

primal shadow Jul 30, 2024, 8:57 PM

#

faint crane Your Bevy article was referenced on screen.

What screen?

faint crane Jul 30, 2024, 9:09 PM

#

Visibility Buffer Rendering during part 2

primal shadow Jul 30, 2024, 9:12 PM

#

Oh, cool! Are the slides posted anywhere?

#

Website doesn't have them yet 😦

faint crane Jul 30, 2024, 9:15 PM

#

Sometime next week. Photography isn’t allowed so I didn’t try.

#

You can have this though.

wide shadow Jul 30, 2024, 9:16 PM

#

might be interesting for benchmarks
https://github.com/activision/caldera?tab=readme-ov-file

GitHub

GitHub - Activision/caldera: Caldera data set from Call of Duty®: W...

Caldera data set from Call of Duty®: Warzone™. Contribute to Activision/caldera development by creating an account on GitHub.

frank sail Jul 30, 2024, 9:19 PM

#

loading it is a pain in the butt I'm finding

#

it's split into a million tiny usd files

fiery bolt Jul 30, 2024, 9:25 PM

#

frank sail it's split into a million tiny usd files

dumb them all into blender, pray it doesn't crash, export as gltf, pray it doesn't crash, repeat until it stops crashing

glass sphinx Jul 30, 2024, 9:29 PM

#

no way blender can handle that

#

prob faster to write a cpp program to do that

fiery bolt Jul 30, 2024, 11:42 PM

#

it finally works froge_love

wispy spear Jul 30, 2024, 11:47 PM

#

is that the atomium sculpture in brussels

fiery bolt Jul 30, 2024, 11:49 PM

#

that's a random table i ~~stole~~ obtained legally from megascans at max quality

wispy spear Jul 30, 2024, 11:50 PM

#

fiery bolt Jul 30, 2024, 11:50 PM

#

fiery bolt that's a random table i ~~stole~~ obtained legally from megascans at max quality

(it is repeated 1536 times in that scene)

#

(for a total of 3 billion tris)

wicked notch Jul 31, 2024, 1:08 AM

#

fiery bolt yes pls my metis is giving me completely unconnected meshlets in my partitions

it is a truly painful lib to work with

#

plus it was never meant for this shit

#

I'll roll my own

#

and then give it to you frogs

buoyant summit Jul 31, 2024, 1:25 AM

#

make sure it's written in rust

#

or I will oxidize the pins on your cpu

#

or pads, depending on which ones your cpu has

wicked notch Jul 31, 2024, 1:27 AM

#

this channel is anti rust

#

I like my iron clean and with no oxygen

fiery bolt Jul 31, 2024, 2:22 AM

#

all my code is rust frog_bath

faint crane Jul 31, 2024, 2:34 AM

#

faint crane Sometime next week. Photography isn’t allowed so I didn’t try.

This was part of the talk BTW. https://x.com/SebAaltonen/status/1818429250908610782

Sebastian Aaltonen (@SebAaltonen) on X

If you spam enough in Twitter, your Tweet might eventually transform into a SIGGRAPH presentation slide :)

Thanks for @FilmicWorlds for a great presentation!

#

"Variable Rate Shading with Visibility Buffer"

primal shadow Jul 31, 2024, 7:09 PM

#

Ok so I peeked at nanite's code and was able to fix my scanline SW raster

#

Adding the "is point in triangle" test to each pixel of the scanline instead of assuming it's covered fixed it

#

Not sure why that's neccesary, I guess some kind of partial coverage/subpixel thing

wicked notch Aug 3, 2024, 12:45 AM

#

man

#

https://tenor.com/view/jeonghan-horse-on-beach-jeonghan-man-jeonghan-jeonghan-silly-horse-on-beach-man-gif-13991646989968971533

Tenor

#

the kitchen is overloaded

#

why are there so many edge cases

#

I've been doing this for hours and I'm nowhere near done

#

and it's 3am

#

guess I'll keep going tomorrow, my eyes are not staying open bleakekw

#

this reminds me of self balancing rbtrees but worse

#

"if the node has no black successors, at least one ancestor is red or if it has no left children and at least one right child or if ..."

ebon ruin Aug 3, 2024, 2:05 AM

#

wicked notch and it's 3am

i had a feeling you dont sleep

#

sleep bro

#

or you’ll become munted and die

fiery bolt Aug 3, 2024, 6:23 AM

#

wicked notch why are there so many edge cases

edges cases for what thonk

primal shadow Aug 3, 2024, 6:25 PM

#

Did a massive refactor of the >1000 lines MeshletGpuScene into separate parts. Took ages, but much cleaner now, minus one of the new subsystems that's basically a copy paste. I still need to clean that up, and then optimize things so that we're not spending ~1ms/frame of CPU time on extract + preparing resources. https://github.com/JMS55/bevy/commit/b8ab371a2566c5c0f2fe743224854f8cad452b12

#

Overall CPU timings

faint crane Aug 5, 2024, 1:14 AM

#

Some stuff on meshlets from HPG: https://gpuopen.com/download/publications/DGF.pdf

#

https://www.youtube.com/watch?v=lx1v-BczaBM&t=19005s

YouTube

High-Performance Graphics

HPG 2024: Day 2 - July 27th

Full program: https://www.highperformancegraphics.org/2024/program/

▶ Play video

#

wicked notch Aug 5, 2024, 1:27 AM

#

pog

#

gotta read this

faint crane Aug 5, 2024, 1:28 AM

#

Also some good stuff for BVH fans. https://gpuopen.com/download/publications/HPLOC.pdf

#

https://www.youtube.com/live/qPmwbp7A3BQ?t=2151s

YouTube

High-Performance Graphics

HPG 2024: Day 1 - July 26th

Full program: https://www.highperformancegraphics.org/2024/program/

▶ Play video

#

I only attended SIGGRAPH, missed this was going on until they held a concluding panel at SIGGRAPH.

#

Just need a METIS killer.

wicked notch Aug 5, 2024, 1:29 AM

#

I'm cooking

#

I only need to figure out testing because spamming node allocations is fun and all but eh kekkedsadge

#

I wonder if someone made some test suite for this algo

faint crane Aug 5, 2024, 1:31 AM

#

Test in prod. PR to Unreal.

#

briannaPls

wicked notch Aug 5, 2024, 1:31 AM

#

you know what

#

you're actually a genius

#

I forgot I had ue cloned and built

#

I'll just drop my shit in there and see what happens KEKW

faint crane Aug 5, 2024, 1:32 AM

#

Cat is in lap. Maybe I gained a brain cell.

#

METIS but with good parameter names would be the most exciting thing out of this last week TBH.

#

I tried porting it once, but saw a bunch of GOTOs and cursed control flow.

primal shadow Aug 5, 2024, 2:56 AM

#

wicked notch gotta read this

Iirc it's proposed for hardware implementation, not (neccesairly) software, although don't quote me on it

faint crane Aug 5, 2024, 2:57 AM

#

Froge computing when?

primal shadow Aug 5, 2024, 2:59 AM

#

This paper seeks to close the gap by defining a block-compressed geometry format that is designed for arbitrary geometry topologies and can be directly consumed by future fixed-function hardware.

#

It's a bit of both, they say you can use it in mesh shaders and stuff, but also suggest it as a future hardware-level representation

primal shadow Aug 5, 2024, 5:40 AM

#

Capped off the weekend by opening up the next meshlet PR and writing up the description for it https://github.com/bevyengine/bevy/pull/14623. Once this is merged, I plan to improve the CPU performance, and then either look into fixing occlusion culling bugs, or do persistent threads style culling and save a large chunk of time + eliminate a large amount of memory allocations.

fiery bolt Aug 5, 2024, 5:42 AM

#

occlusion culling bugs froge_bleak

#

that's what I'm trying to fix (or was anyways until I decided to rewrite all my shaders in slang)

primal shadow Aug 5, 2024, 5:45 AM

#

I'm using SPD for my depth pyramid generation, which is defeinitly not conservative 😛

#

I might just have to give up and write a more complex, slower, multi-dispatch downsampling pass

fiery bolt Aug 5, 2024, 5:46 AM

#

you can set your own reduction tho

primal shadow Aug 5, 2024, 5:46 AM

#

Only the reduction op (e.g. average, max, min, whatever)

#

But the problem is sometimes you need to read more than a 2x2 when downsampling

#

Something like that, I haven't looked into it too much

fiery bolt Aug 5, 2024, 5:47 AM

#

yeah a max (for inverse z) should be conservative

primal shadow Aug 5, 2024, 5:47 AM

#

Read https://miketuritzin.com/post/hierarchical-depth-buffers/

Hierarchical Depth Buffers

Overview A hierarchical depth buffer is a multi-level depth (Z) buffer used as an acceleration structure for depth queries. As with normal texture mip chains, the dimensions of each level are generally successive power-of-2 fractions of the full-resolution buffer’s dimensions. In this article I present two techniques for generating a hierarchica...

fiery bolt Aug 5, 2024, 5:47 AM

#

I was also fixing my SPD before rewriting everything in slang lol

#

dxc infinite looped when I made my 6th mip globallycoherent bleakekw

primal shadow Aug 5, 2024, 5:51 AM

#

Fun. I don't even have access to that, I just split it into 2 dispatches (the second is only needed if you have a large enough initial texture, which is usually no)

fiery bolt Aug 5, 2024, 6:06 AM

#

primal shadow Fun. I don't even have access to that, I just split it into 2 dispatches (the se...

isn't the limit 512x512

primal shadow Aug 5, 2024, 6:12 AM

#

idr off the top of my head

fiery bolt Aug 5, 2024, 6:18 AM

#

it'd be 2^7 or 8 depending on how you generate mips

#

unless you go to quarter res directly

fiery bolt Aug 9, 2024, 7:26 PM

#

@primal shadow i noticed you use the current camera view in the early pass - shouldn't that be the previous frame view?

primal shadow Aug 9, 2024, 7:45 PM

#

fiery bolt <@145540119141679105> i noticed you use the current camera view in the early pas...

So I do, crap. previous_view is imported, but it somehow isin't used anymore... I could've sworn I had fixed this. Thanks for pointing it out.

#

Oh you know what, maybe I bind previous view as the current view? Hmm

fiery bolt Aug 9, 2024, 7:55 PM

#

primal shadow Oh you know what, maybe I bind previous view as the current view? Hmm

then you'd be frustum culling and making lod decisions against the previous view

primal shadow Aug 9, 2024, 7:55 PM

#

Yeah it's not that

#

I really don't know how this got messed up, because I explicitly fixed it in a PR dedicated to this kind of thing 😅

#

Maybe a lost commit? idk

fiery bolt Aug 9, 2024, 7:55 PM

#

lol

primal shadow Aug 9, 2024, 8:06 PM

#

I'm so confused what my code is doing 🤔

#

Oh ok I think I've fixed it

#

This should be it https://github.com/JMS55/bevy/commit/d49854ab521c09b997161a66605a31ab6c55d6d2?diff=unified&w=0

fiery bolt Aug 10, 2024, 9:54 PM

#

hmmm 90ms for 3 billion tris, I should probably get started with the bvh and persistent threads cyberpoonk

#

I'm gonna TDR so much

wispy spear Aug 10, 2024, 9:55 PM

#

increase the TDR timeout think

dull oyster Aug 10, 2024, 9:55 PM

#

render 1 billion tris and temporaly accumulate them

wicked notch Aug 10, 2024, 9:56 PM

#

fiery bolt hmmm 90ms for 3 billion tris, I should probably get started with the bvh and per...

are you actually bound by culling tho

#

or just raster

fiery bolt Aug 10, 2024, 9:57 PM

#

wicked notch or just raster

it shouldn't be raster, my lods and occlusion culling sorta work

wicked notch Aug 10, 2024, 9:57 PM

#

did you profile 🐸

fiery bolt Aug 10, 2024, 9:57 PM

#

and nosight shows me that my task shader has the shittiest occupancy known to man

#

like, 8 warps per SM

#

30% instruction issue rate

wicked notch Aug 10, 2024, 9:58 PM

#

show TS

fiery bolt Aug 10, 2024, 9:58 PM

#

TS?

wicked notch Aug 10, 2024, 9:58 PM

#

task shader

fiery bolt Aug 10, 2024, 9:58 PM

#

ah

#

📎 common.l.hlsl 📎 early.a.hlsl 📎 task.l.hlsl

#

the worst code you're ever gonna read

wicked notch Aug 10, 2024, 10:04 PM

#

I mean you could've done the meshlet emit count thingy with subgroup ops instead of atomicAdd'ing one every time

#

but it's fine otherwise

fiery bolt Aug 10, 2024, 10:05 PM

#

wicked notch I mean you could've done the meshlet emit count thingy with subgroup ops instead...

as in subgroup add, then index 0 does the atomic?

wicked notch Aug 10, 2024, 10:05 PM

#

doesn't need the atomic anymore innit

#

because your group size will be equal to subgroup size

fiery bolt Aug 10, 2024, 10:05 PM

#

my workgroup size isn't subgroup size thonk

wicked notch Aug 10, 2024, 10:06 PM

#

yes

#

you will set it to that though if you want it to work KEKW

#

with subgroup ops that is

fiery bolt Aug 10, 2024, 10:06 PM

#

well I guess

wicked notch Aug 10, 2024, 10:06 PM

#

but it's fine either way

fiery bolt Aug 10, 2024, 10:06 PM

#

yeah

wicked notch Aug 10, 2024, 10:06 PM

#

profile your shader and look which inst is taking up the most cycles

fiery bolt Aug 10, 2024, 10:07 PM

#

wicked notch Aug 10, 2024, 10:07 PM

#

me when LGSB

fiery bolt Aug 10, 2024, 10:09 PM

#

actually, can nosight tell me the number of mesh workgroups dispatched

wicked notch Aug 10, 2024, 10:10 PM

#

update to the beta nosight

fiery bolt Aug 10, 2024, 10:22 PM

#

wait where do you get beta versions

#

am i stupid

wicked notch Aug 10, 2024, 10:22 PM

#

it's actually just the 2024.2 version

fiery bolt Aug 10, 2024, 10:22 PM

#

oh

#

lmao

wicked notch Aug 10, 2024, 10:23 PM

#

nsight is in release only the other stuff is in public beta

#

I'm dumb

fiery bolt Aug 10, 2024, 10:23 PM

#

KEKW

wicked notch Aug 10, 2024, 10:23 PM

#

I just updated too

#

when september comes around I need to see if I can ask my supervisor to sign an NDA with NVIDIA

#

so I can get nsight pro froge_love

fiery bolt Aug 10, 2024, 10:24 PM

#

where do i get the mesh shader stats, i've been using 2024.2 for a bit an never seen them

wicked notch Aug 10, 2024, 10:24 PM

#

it's been a long time since I last used nsight bleakekw

#

(literally only 2 months)

fiery bolt Aug 10, 2024, 10:26 PM

#

i need to get myself a supervisor, i'm just a lowly first year froge_sad

#

well technically second soon

#

top stall 1: LGSB 49% bleaker_kekw

#

i wonder if i can (ab)use mesh shaders to switch between hardware and software raster in the same shader thonk

#

SetMeshOutputCounts(0, 0) and software raster or something

ebon ruin Aug 10, 2024, 10:53 PM

#

What did gp do before virtual geometry lol

fiery bolt Aug 10, 2024, 10:53 PM

#

actually release games

wicked notch Aug 10, 2024, 10:55 PM

#

too real

fiery bolt Aug 10, 2024, 10:56 PM

#

i reduced my mesh shader payload size and went from 60 to 27 ms

#

what the fuck

wicked notch Aug 10, 2024, 10:56 PM

#

you should keep it to less than 104 bytes

#

as per nv's suggestions

fiery bolt Aug 10, 2024, 10:57 PM

#

i was at... 768

#

bleaker_kekw

wicked notch Aug 10, 2024, 10:57 PM

#

incredible

fiery bolt Aug 10, 2024, 10:57 PM

#

now it's 256

#

one index per meshlet

#

how do i make it smaller

wicked notch Aug 10, 2024, 10:58 PM

#

like this https://github.com/LVSTRI/Retina/blob/1152267563683bb6a8a6731ce31b8bc07ffe88c4/src/Retina/Sandbox/Shaders/Visbuffer.task.glsl

fiery bolt Aug 10, 2024, 10:59 PM

#

good idea

#

but my late culling pass has fragmented indices

#

i should get rid of the whole meshlet pointer shit tbh

loud crag Aug 10, 2024, 11:43 PM

#

wicked notch I mean you could've done the meshlet emit count thingy with subgroup ops instead...

my apple gpu hung when i used subgroup ops instead of an atomic

wicked notch Aug 10, 2024, 11:45 PM

#

gpu issue ig

faint crane Aug 10, 2024, 11:50 PM

#

There are frogs in your computer eating up all the threads.

frank sail Aug 10, 2024, 11:51 PM

#

frogs in my computer forgeeep

ebon ruin Aug 11, 2024, 12:08 AM

#

i wonder if apple had a good reason to use their own graphics api

#

it wouldnt surprise me if they did it just to make programmers slightly more suicidal

buoyant summit Aug 11, 2024, 1:12 AM

#

loud crag my apple gpu hung when i used subgroup ops instead of an atomic

apple compiler miscompiles subgroup ops

loud crag Aug 11, 2024, 1:22 AM

#

in what way

#

more details pls

buoyant summit Aug 11, 2024, 1:24 AM

#

cc @craggy shale

buoyant summit Aug 11, 2024, 1:24 AM

#

loud crag in what way

in doing lots of invalid transforms to these ops like moving across control flow, and not using correct tangles (sets of threads that participate in a simd_ op)

loud crag Aug 11, 2024, 1:26 AM

#

so the asahi compiler wouldnt have had the same hang issues?

#

how tf can they not compile correctly for their own hardware

#

also still not eure what you mean

#

too eeepy for this

#

forgeeep

faint crane Aug 11, 2024, 1:37 AM

#

Frogs in your brain.

buoyant summit Aug 11, 2024, 1:39 AM

#

loud crag how tf can they not compile correctly for their own hardware

they use llvm

primal shadow Aug 11, 2024, 2:10 AM

#

wicked notch I mean you could've done the meshlet emit count thingy with subgroup ops instead...

I never tested to see if that was faster, but I'm also switching to persistent threads soon anyways.

fiery bolt Aug 11, 2024, 3:09 AM

#

I don't understand what exactly they use to bound their BVH froge_sad

#

probably need to put some thought into it

#

but first I need to fix my cursed task shader abuse

fiery bolt Aug 11, 2024, 5:19 AM

#

https://github.com/Themaister/Granite/blob/master/assets/shaders/post/hiz.comp

GitHub

Granite/assets/shaders/post/hiz.comp at master · Themaister/Granite

My personal Vulkan renderer. Contribute to Themaister/Granite development by creating an account on GitHub.

loud crag Aug 11, 2024, 1:30 PM

#

buoyant summit they use llvm

is that the reason for the uniform control flow issues

#

i remember a presentation from apple about that in llvm and how they managed that

buoyant summit Aug 11, 2024, 1:30 PM

#

idk

#

you can make llvm actually work

#

NV has done that somehow

#

@craggy shale pls msl simd_ miscompilation examples

loud crag Aug 11, 2024, 1:32 PM

#

the guy from apple i spoke to about this said he had no idea why my shader was causing it by looking at it briefly

#

just blamed it on mesh shaders

craggy shale Aug 11, 2024, 1:32 PM

#

buoyant summit <@263681346872803331> pls msl simd_ miscompilation examples

just what i had on mastodon

buoyant summit Aug 11, 2024, 1:32 PM

#

https://tenor.com/view/honestly-quite-incredible-drinking-water-underwater-cool-gif-22285702

Tenor

primal shadow Aug 11, 2024, 1:38 PM

#

fiery bolt https://github.com/Themaister/Granite/blob/master/assets/shaders/post/hiz.comp

Wait, someone adapted SPD to do conservative downscaling??? This is litterly what I need

loud crag Aug 11, 2024, 1:39 PM

#

havent we all done this now

#

min reduction sampler or with just min() when reading the 4 samples

loud crag Aug 11, 2024, 1:40 PM

#

craggy shale just what i had on mastodon

i cant figure out mastodon search so i can‘t find shit

primal shadow Aug 11, 2024, 1:43 PM

#

loud crag min reduction sampler or with just min() when reading the 4 samples

In a single pass in a compute shader? You can do it with more than 1 pass of a fragment shader, but that's slooow

loud crag Aug 11, 2024, 1:47 PM

#

primal shadow In a single pass in a compute shader? You can do it with more than 1 pass of a f...

ye lvstri does this and ive done this (semi-successfully)

#

with the nvidia spd

buoyant summit Aug 11, 2024, 1:51 PM

#

loud crag i cant figure out mastodon search so i can‘t find shit

https://mastodon.gamedev.place/@gob/109548806532038622

mastodon.gamedev.place

Hugo Devillers

Hugo Devillers (@[email protected])

so far it seems that in metal-land:

we have LLVM-style loop reconvergence with the usual broken consequences (I expected that...)
multiple calls to simd_active_threads_mask end up merged, even if they appear in different blocks (🤡)
I don't know what games simd_first() is playing, maybe it tries to lie about the first point or someth...

#

it's old tho so maybe they have fixed things since

#

gob also had a screenshot somewhere where they cse'd simd_add(1) across control flow

primal shadow Aug 11, 2024, 1:55 PM

#

loud crag with the nvidia spd

As-is, or did you need to modify the algorithm at all?

wicked notch Aug 11, 2024, 2:02 PM

#

no changes required

#

it was very straightforward actually

loud crag Aug 11, 2024, 2:39 PM

#

buoyant summit gob also had a screenshot somewhere where they cse'd simd_add(1) across control ...

i was only using simd_prefix_exclusive_sum and simd_sum

#

oh wait i get it

#

if i was using 32 meshlets per invocation, i had no loop (optimised away) and then it worked fine

#

anything else and it hung, probably from using the simd stuff in the loop

#

might still be related tbf

#

though again im no expert on these compiler shenanigans so no idea what this “loop reconvergence” means

#

mayhaps i could inspect the generated IR and see whats going on

fiery bolt Aug 11, 2024, 6:06 PM

#

primal shadow Wait, someone adapted SPD to do conservative downscaling??? This is litterly wha...

yup

fiery bolt Aug 11, 2024, 6:06 PM

#

loud crag ye lvstri does this and ive done this (semi-successfully)

https://themaister.net/blog/2024/01/

Building the mip-chain in one pass is great for performance, but causes some problems. With NPOT textures and single pass, there is no obvious way to create a functional HiZ, and the go-to shader for this, FidelityFX SPD, doesn’t support that use case.

The problem is that the size of mip-maps round down, so if we have a 7×7 texture, LOD 1 is 3×3 and LOD 2 is 1×1. In LOD2, we will be able to query a 4×4 depth region, but the edge pixels are forgotten.

The “obvious” workaround is to pad the texture to POT, but that is a horrible waste of VRAM. The solution I went with instead was to fold in the neighbors as the mips are reduced. This makes it so that the edge pixels in each LOD also remembers depth information for pixels which were truncated away due to NPOT rounding.

I rolled a custom HiZ shader similar to SPD with some extra subgroup shenanigans because why not (SubgroupShuffleXor with 4 and 8).

primal shadow Aug 11, 2024, 6:18 PM

#

fiery bolt https://themaister.net/blog/2024/01/ > Building the mip-chain in one pass is gre...

Huh, I've read that before, but must have forgot that part

wicked notch Aug 11, 2024, 8:19 PM

#

this refactor is going very well

#

I say to myself as I wonder why I can't keep working on the same thing

fiery bolt Aug 12, 2024, 1:38 AM

#

9.6 ms to lod and cull 210384384 meshlets thonk

#

85% SM utilization

#

i doubt i can get eek out more perf out of this

#

which means it's BVH time

wicked notch Aug 12, 2024, 9:13 PM

#

single ownership rules

glass sphinx Aug 13, 2024, 1:23 AM

#

this is now the meshlet rendering channel

wicked notch Aug 13, 2024, 1:24 AM

#

always has been

fiery bolt Aug 13, 2024, 1:25 AM

#

@primal shadow https://github.com/bevyengine/bevy/blob/6183b56b5d6fd7e2e8cf1f3c85da7fd3dab25ea4/crates/bevy_pbr/src/meshlet/from_mesh.rs#L90
another thing i noticed, shouldn't the fold acc start with 0, not the parent error?

GitHub

bevy/crates/bevy_pbr/src/meshlet/from_mesh.rs at 6183b56b5d6fd7e2e8...

A refreshingly simple data-driven game engine built in Rust - bevyengine/bevy

fiery bolt Aug 13, 2024, 1:25 AM

#

glass sphinx this is now the meshlet rendering channel

nanite channel

#

wish.com nanite channel bleaker_kekw

wicked notch Aug 13, 2024, 1:25 AM

#

our nanite is better

#

by virtue of being open source

#

and that we're all learning from each other froge_love

fiery bolt Aug 13, 2024, 1:26 AM

#

my occlusion culling still decides to cull shit randomly

#

for no reason whatsoever

#

it's so random i think the bounding sphere generation is fucking up somewhere

primal shadow Aug 13, 2024, 1:30 AM

#

fiery bolt <@145540119141679105> https://github.com/bevyengine/bevy/blob/6183b56b5d6fd7e2e8...

Yeah good catch, I think it should :P. Thanks for pointing this out.

fiery bolt Aug 13, 2024, 1:31 AM

#

your performance is now gonna half lol

primal shadow Aug 13, 2024, 1:31 AM

#

Lol we'll see

fiery bolt Aug 13, 2024, 1:32 AM

#

actually no it might improve

primal shadow Aug 13, 2024, 1:32 AM

#

I have a ton of stuff to implement still, there's so much I can improve

fiery bolt Aug 13, 2024, 1:32 AM

#

because it'll switch to the lower LOD earlier

fiery bolt Aug 13, 2024, 1:32 AM

#

primal shadow I have a ton of stuff to implement still, there's so much I can improve

yeah same

#

if only this occlusion culling started to work properly...

primal shadow Aug 13, 2024, 1:33 AM

#

Mine is broken, I need to fix it

fiery bolt Aug 13, 2024, 1:33 AM

#

froge_bleak

#

who's isn't

primal shadow Aug 13, 2024, 1:35 AM

#

The fold might be better written as a map and then a max, hmmm

fiery bolt Aug 13, 2024, 1:41 AM

#

i do map(..).reduce(f32::max).unwrap()

faint crane Aug 13, 2024, 5:22 AM

#

fiery bolt my occlusion culling still decides to cull shit randomly

Stochastic occlusion culling. Eventually it will converge into something which works.

fiery bolt Aug 13, 2024, 5:36 AM

#

unfortunately it doesn't

#

maybe i should let it run for longer

wicked notch Aug 14, 2024, 7:43 PM

#

I now have the the greatest timeline abstraction thanks to Dolkar

glass sphinx Aug 14, 2024, 7:49 PM

#

dolkar is indeed the timeline god

wispy spear Aug 14, 2024, 7:52 PM

#

viriboop

loud crag Aug 14, 2024, 8:00 PM

#

@wicked notch in your idea of sparse vsm, when would you free pages that are not visible? In the normal VSM we have those two buffers for (1) free pages, and (2) cached/invisible pages, and we allocate from free pages before re-using cached pages.
Now the problem I am having is this line in the Metal docs:

If the heap runs out of memory, Metal skips any remaining tiles in the request.
This means I need some way of figuring out when to free cached pages, so that I don't run out of memory... What were you imagining to solve this kind of problem?

wispy spear Aug 14, 2024, 8:01 PM

#

count up the space you consume as you request pages, and if above a certain threshold you flush old pages?

loud crag Aug 14, 2024, 8:02 PM

#

hmm sure i guess i could query the amount of sparse memory I am currently using, and then when I get close to OOM I could just evict a few unused but cached pages

wicked notch Aug 14, 2024, 8:03 PM

#

loud crag <@320895822394818561> in your idea of sparse vsm, when would you free pages that...

"my sparse" is the same as everyone else's, so you just free all pages that are not visible

loud crag Aug 14, 2024, 8:03 PM

#

what about caching though

wicked notch Aug 14, 2024, 8:03 PM

#

we only cache in view pages

loud crag Aug 14, 2024, 8:03 PM

#

oh bruh

#

ok ig that simplifies a lot

wicked notch Aug 14, 2024, 8:04 PM

#

to keep out of view pages cached you can just implement an LRU cache

loud crag Aug 14, 2024, 8:04 PM

#

what's that

wicked notch Aug 14, 2024, 8:04 PM

#

and pop pages as needed before allocating new ones

wicked notch Aug 14, 2024, 8:05 PM

#

loud crag what's that

a cache policy

#

one of many

#

least recently used (LRU)

loud crag Aug 14, 2024, 8:05 PM

#

well that could be something for later then

buoyant summit Aug 14, 2024, 8:10 PM

#

wicked notch I now have the the greatest timeline abstraction thanks to Dolkar

details?

wicked notch Aug 14, 2024, 8:12 PM

#

it's actually the easiest solution but my dumb ass brain couldn't reason about it

#

so you just have one timeline semaphore per queue as usual

#

but the timeline value is shared between them and it's always monotonically increasing

#

to ensure that stuff gets deleted properly, you just collect every queue's last reached value (from the semaphore if needed) and take the min of that

#

then you use that as the value to compare when deleting stuff

#

also if a queue isn't used for a while, you just fast forward its last reached value to the current maximum value (otherwise the deletion queue would get stuck)

#

the source is here https://github.com/Dolkar/Tephra/blob/f488fae39bf97819bb7f3df7007fceaa1bb504c3/src/tephra/device/timeline_manager.cpp

#

I took heavy inspiration (~~to avoid saying straight up copied~~) from Dolkar's abstraction

buoyant summit Aug 14, 2024, 8:15 PM

#

I see

#

cool

fiery bolt Aug 14, 2024, 8:31 PM

#

i just realized, instead of taking LOD error as a sphere with center = centroid of the group, shouldn't it be a sphere with center = closest point of the group bounding sphere thonk

primal shadow Aug 14, 2024, 11:35 PM

#

Wdym closet point?

fiery bolt Aug 15, 2024, 2:21 AM

#

primal shadow Wdym closet point?

closest point to the camera

#

so you'd have to store the group bounding sphere, project that, and then place your 'lod sphere' at center - (0, 0, radius)

delicate rain Aug 15, 2024, 7:10 AM

#

wicked notch we only cache in view pages

I don't 🥸

#

LRU cache is cope, what I'll do is sort based on the cascade index, so that I free the cached pages in the lowest cascade first

#

Since those will have the shortest lifespan anyways

wicked notch Aug 15, 2024, 12:21 PM

#

smart

fiery bolt Aug 18, 2024, 11:44 PM

#

why is meshopt generating meshlets with an average of 42 tris

frank sail Aug 18, 2024, 11:51 PM

#

meshopt knows the meaning of life

wicked notch Aug 18, 2024, 11:54 PM

#

real

#

I'm almost back btw

#

I just need this one last feature I promise

#

just one last feature

fiery bolt Aug 20, 2024, 2:56 AM

#

100 billion trogne

#Iris - A Journey through OpenGL and beyond to learn Graphics