wicked notch Oct 29, 2023, 12:54 PM

#

what's frustumSize here?

#

the width of the projection?

frank sail Oct 29, 2023, 12:54 PM

#

yeah

#

like the params you supply to glm::ortho

#

couldn't figure it out so I'm gonna keep my unhinged inverse thing

delicate rain Oct 29, 2023, 1:06 PM

#

I think you should project the basis from ndc virtual space into world space -> x = offset * unprojected_basis -> translate by x

#

you also need to translate by z in order for the camera to slide along the same plane no?

frank sail Oct 29, 2023, 1:07 PM

#

not in view space

delicate rain Oct 29, 2023, 1:07 PM

#

right but you translate the view matrix that does not mean translating in the view space no?

#

or am I dumb

frank sail Oct 29, 2023, 1:08 PM

#

idk

delicate rain Oct 29, 2023, 1:08 PM

#

translating the view matrix imo means "moving the camera"

#

in world space

frank sail Oct 29, 2023, 1:08 PM

#

o

delicate rain Oct 29, 2023, 1:08 PM

#

thats why you need to unproject from the viewspace of the vsm first

#

or multiply by the vsm view matrix to get it into vsm space translate there and then go back to world space

#

these should be identical

frank sail Oct 29, 2023, 1:09 PM

#

or do glm::inverse(stableProjections[i]) * shiftedProjection * stableViewMatrix; smart

delicate rain Oct 29, 2023, 1:10 PM

#

I have no clue what that does lmao

frank sail Oct 29, 2023, 1:10 PM

#

it does what you described probably

#

the main thing is that it works

delicate rain Oct 29, 2023, 1:10 PM

#

tru

frank sail Oct 29, 2023, 1:11 PM

#

I don't like how translating a view matrix does not translate in view space

#

someone should fix that

delicate rain Oct 29, 2023, 1:13 PM

#

you can do that by doing projection * view_space_translation * view_space * world_space_translation

wicked notch Oct 29, 2023, 1:13 PM

#

bug fix for math.exe

wicked notch Oct 29, 2023, 1:13 PM

#

delicate rain you can do that by doing projection * view_space_translation * view_space * wor...

this translates in view space?

delicate rain Oct 29, 2023, 1:13 PM

#

yes because you first project into view space and then translate

#

which does translation in view space

#

I had the order wrong my bad

wicked notch Oct 29, 2023, 1:18 PM

#

Are glm::ortho's parameters in ndc units or world units?

delicate rain Oct 29, 2023, 1:18 PM

#

world

wicked notch Oct 29, 2023, 1:25 PM

#

Hmmmm

wicked notch Oct 29, 2023, 4:18 PM

#

I have come to the realization that with HWVSM none of the techniques you guys use for caching work, so I'll have to invent my own

#

either that, or I go back to SWVSM

distant lodge Oct 29, 2023, 5:06 PM

#

you should do vulkan 1.2 compatible SWVSM :^^^)

#

that can run on a gtx 760

#

at 60fps on bistro

delicate rain Oct 29, 2023, 5:15 PM

#

wicked notch I have come to the realization that with HWVSM none of the techniques you guys u...

how come?

wicked notch Oct 29, 2023, 5:16 PM

#

In any possible case, I can't simply set gl_Position and write

#

I need to shift the ndc position by some factor and have it wrap around to raster to the correct page

wispy spear Oct 29, 2023, 5:21 PM

#

i took the liberty to acquire a potential new vsm customer in #wip

raven orchid Oct 29, 2023, 5:27 PM

#

wicked notch I have come to the realization that with HWVSM none of the techniques you guys u...

Hpb building should still work though right?

#

Like if your internal marker says the page is cached, don’t include it in hpb

#

Which means hpb culling will delete it frogdelet

wicked notch Oct 29, 2023, 5:28 PM

#

raven orchid Hpb building should still work though right?

ye that's fine

#

the big issue is stable addressing

raven orchid Oct 29, 2023, 5:28 PM

#

Oh yeah

#

I’ve had some success with fractional shifting of render matrices and caching

#

And then for addressing I use the untranslated version

wicked notch Oct 29, 2023, 5:30 PM

#

yeah, my only hope right now is to do some unholy math

#

much more unholy than what Jaker cooked up kekkedsadge

wispy spear Oct 29, 2023, 5:39 PM

#

abusing div by zero incoming 😄

wicked notch Oct 29, 2023, 5:46 PM

#

ok simplest case

#

truncate camera's position

#

time to draw

#

suppose camera position = stable origin = 0

#

then view = stable_view

#

now what happens when I move one unit to the right

#

only god knows (jk I'm computing)

wispy spear Oct 29, 2023, 5:50 PM

#

: )

cold sky Oct 29, 2023, 5:51 PM

#

wicked notch I need to shift the ndc position by some factor and have it wrap around to raste...

yep, you're kinda fucked

#

btw Light Perspective Shadow Mapping used to do some unholy trick with Z/W divide to have shit wraparound with weird perspective

#

but here you need to wraparound by 2 axes

#

in HW the only way to do that is to set up 4 separate viewports and multi-cast/cull your meshlets into them

#

ngl, SW raster (at least SW Raster Output Processing) is sounding nicer and nicer by the day

wicked notch Oct 29, 2023, 5:56 PM

#

ok no matter

#

caching is dead

#

very good

raven orchid Oct 29, 2023, 5:57 PM

#

lvstri for sw raster

#

didn't you say yesterday nanite only needs to call render shadows once?

wicked notch Oct 29, 2023, 5:57 PM

#

yeah I'm switching back to SWVSM

#

goodbye HWVSM, it was fun while it lasted

raven orchid Oct 29, 2023, 5:58 PM

#

found it https://discordapp.com/channels/318590007881236480/1090390868449558618/1167959620145401906

wicked notch Oct 29, 2023, 5:59 PM

#

ye they do it only once

#

multiview ftw

raven orchid Oct 29, 2023, 5:59 PM

#

wow that's extremely powerful

wicked notch Oct 29, 2023, 7:34 PM

#

the HWVSM branch is gone

#

I am not crying

wispy spear Oct 29, 2023, 7:58 PM

#

maybe one day, while sitting on the loo, it will hit you : ) but not today hehe

frank sail Oct 29, 2023, 8:57 PM

#

cold sky in HW the only way to do that is to set up 4 separate viewports and multi-cast/c...

You literally don't need that here because you address pages yourself

#

Even with hw sparse actually

wicked notch Oct 29, 2023, 8:58 PM

#

what do you mean

frank sail Oct 29, 2023, 9:00 PM

#

Idk just felt like saying stuff

#

Actually I'm probably misinfo about hw sparse there but I can't think bc I just woke up

wicked notch Oct 29, 2023, 9:01 PM

#

giving me false hope smh

delicate rain Oct 29, 2023, 9:01 PM

#

wicked notch giving me false hope smh

https://tenor.com/view/team-fortress2-bonk-america-sings-gif-26134549

Tenor

#

NO HARDWARE

wicked notch Oct 29, 2023, 9:03 PM

#

no hardware :(

wicked notch Oct 29, 2023, 9:41 PM

#

I thought I'd take a bit of a detour and do ImGUI

#

so that I can have sexy debug visuals like Jaker and Saky

#

however there is a small issue, to render an image in ImGUI I need to give it a descriptor set with an image bound

#

...How do I make this descriptor set, I don't have access the internal layout ImGUI creates

frank sail Oct 29, 2023, 9:44 PM

#

either guess or read the source

wicked notch Oct 29, 2023, 9:48 PM

#

I need the handle itself tho

#

The VkDescriptorSetLayout inside of ImGUI

#

It's hidden here: https://github.com/ocornut/imgui/blob/master/backends/imgui_impl_vulkan.cpp#L110

#

how the hell do I do this bleakekw

#

I don't wanna create a new backend

distant lodge Oct 29, 2023, 9:49 PM

#

just write your own ba-

#

nvm

wicked notch Oct 29, 2023, 9:51 PM

#

@delicate rain how does daxa handle it

frank sail Oct 29, 2023, 9:51 PM

#

Use OpenGL interop and use the gl backend

#

frog_shrimple le as that

wicked notch Oct 29, 2023, 9:51 PM

#

This is the universe calling me back to GL innit

delicate rain Oct 29, 2023, 9:51 PM

#

I believe we have our own shaders into which we pass the daxa::ImageIds

#

and samplers

wicked notch Oct 29, 2023, 9:52 PM

#

so you have your own backend, epic

#

Ok since I really don't want to spend the night writing a backend for shitty ImGUI

delicate rain Oct 29, 2023, 9:52 PM

#

Yeah

wicked notch Oct 29, 2023, 9:52 PM

#

I'll do something cursed

delicate rain Oct 29, 2023, 9:52 PM

#

Just spend the night porting to Daxa

#

😈😈

wispy spear Oct 29, 2023, 9:54 PM

#

hehe the purple fits

frank sail Oct 29, 2023, 9:55 PM

#

wicked notch Ok since I *really* don't want to spend the night writing a backend for shitty I...

I mean obviously there must be a way to use the premade backend

glass sphinx Oct 29, 2023, 9:55 PM

#

delicate rain Just spend the night porting to Daxa

https://tenor.com/view/nasferatu-rise-creepy-smile-gif-13983854

Tenor

frank sail Oct 29, 2023, 9:55 PM

#

Just find someone's code that uses it

glass sphinx Oct 29, 2023, 9:56 PM

#

wicked notch I don't wanna create a new backend

you make a new backend

#

thats how daxa handles it

#

the imgui backends are not meant to be used really they are mostly examples

#

you should make your own

wicked notch Oct 29, 2023, 9:56 PM

#

sigh

#

I will

glass sphinx Oct 29, 2023, 9:57 PM

#

i love how the devil emoji is daxa colored

frank sail Oct 29, 2023, 9:58 PM

#

glass sphinx the imgui backends are not meant to be used really they are mostly examples

tf

#

it's not impossible to use them though bleakekw

wispy spear Oct 29, 2023, 9:59 PM

#

it should also be fairly simple to make an imgui backend

#

its literally just shoving verticles and indices into a vbo/ibo, loading 2 shrimple shaders, and textures : )

glass sphinx Oct 29, 2023, 10:00 PM

#

they are fine

#

but usually they start to be annoying at some point

#

for daxa it took me a few hours (i had to debug a while cause the stupit in my head)

wicked notch Oct 29, 2023, 10:12 PM

#

Thank god vulkan is well designed

wispy spear Oct 29, 2023, 10:12 PM

#

lol

wicked notch Oct 29, 2023, 10:12 PM

#

as it turns out you don't need to create a descriptor set out of the exact handle, you just need the layouts to be compatible

wispy spear Oct 29, 2023, 10:12 PM

#

that is something i noticed during my early days with bulkan

#

the validator layer didnt always slap me in the face, only occassionally

#

while i was expecting it would at all times i fook something up

#

like having uniforms and shaders not be compatible

#

i think i forgor a part in the shader, while it was described in all the descriptorisms or other way around, it didnt yell at me

delicate rain Oct 29, 2023, 10:26 PM

#

I was recently creating graphics pipeline with zero color and depth attachments and it didn't say anything

#

and still wrote to the first bound output from frag shader

wispy spear Oct 29, 2023, 10:27 PM

#

ah i think i had something like that too

delicate rain Oct 29, 2023, 10:27 PM

#

also does not complain when you fuck up the usage flags - for example when you don't set texture as sampled it will say nothing but the sampler reads will just return nothing

#

very fun to debug haha

wispy spear Oct 29, 2023, 10:27 PM

#

speaking of vkguide, i should also continue : (

delicate rain Oct 29, 2023, 10:28 PM

#

yes!

wispy spear Oct 29, 2023, 10:28 PM

#

memory transfer nonsense : )

frank sail Oct 29, 2023, 10:28 PM

#

delicate rain I was recently creating graphics pipeline with zero color and depth attachments ...

That should be fine though

#

I do that for VSM because I just need image store

delicate rain Oct 29, 2023, 10:29 PM

#

yeah but I was binding two color attachments during begin rendering

#

which should not be compatible with the pipeline no?

frank sail Oct 29, 2023, 10:30 PM

#

Yeah that seems wrong

wicked notch Oct 29, 2023, 10:36 PM

#

pog

#

This is my first time setting up ImGui on Vulkan lol

#

how do I get rid of that shitty border around my window

#

not the viewport, the main window

#

this border

wispy spear Oct 29, 2023, 10:53 PM

#

ImGui_NextWindowItemPos or something

#

or per style

#

one sec

wicked notch Oct 29, 2023, 10:53 PM

#

can you pass your own style somehow

#

or is it copyrighted

frank sail Oct 29, 2023, 10:54 PM

#

Certain imgui calls require registering a license

wispy spear Oct 29, 2023, 10:54 PM

#

: )

#

https://github.com/deccer/EngineKit/blob/main/src/EngineKit/UI/UIRenderer.cs#L540C19-L540C19

#

thats my dark style, i think, the one jaker stole for frogfood too

frank sail Oct 29, 2023, 10:55 PM

#

All of my imguiisms are in Gui.cpp

wispy spear Oct 29, 2023, 10:56 PM

#

was about to say, perhaps steal from frogfood, because of c# isms and whatnot

frank sail Oct 29, 2023, 10:56 PM

#

Including the pirated style

wispy spear Oct 29, 2023, 10:56 PM

#

and dont forget that one little thing

#

style.WindowMenuButtonPosition = ImGuiDir.None;

#

to get rid of that dropdown thingy next to the window "tab"

frank sail Oct 29, 2023, 10:56 PM

#

I don't get the point of that button tbh

wispy spear Oct 29, 2023, 10:57 PM

#

you also need to y offset your fonts, when using fontawesome

#

https://github.com/deccer/EngineKit/blob/main/src/EngineKit/UI/UIRenderer.cs#L219 as reference

frank sail Oct 29, 2023, 10:57 PM

#

I had to use trial and error to find the correct offset agonyfrog

wispy spear Oct 29, 2023, 10:58 PM

#

pobalbyl different per font too somehow

frank sail Oct 29, 2023, 10:58 PM

#

It is

distant lodge Oct 29, 2023, 11:11 PM

#

it's the ascent of the font probably

frank sail Oct 29, 2023, 11:12 PM

#

I'll let you judge
https://github.com/JuanDiegoMontoya/Frogfood/blob/a3271d30930ecc121b3c4728ef4b31a40877cbba/src/Gui.cpp#L53

wicked notch Oct 29, 2023, 11:12 PM

#

sick style

#

but the fix was using those two little shits

ImGui::PushStyleVar(ImGuiStyleVar_WindowBorderSize, 0.0f);
ImGui::PushStyleVar(ImGuiStyleVar_WindowPadding, ImVec2());```

#

now border is no more

frank sail Oct 29, 2023, 11:14 PM

#

Nice

#

Needs FSR 2 though

wicked notch Oct 29, 2023, 11:15 PM

#

it really does

#

this night will be spent implementing basic features

#

like AA KEKW

wispy spear Oct 29, 2023, 11:15 PM

#

wicked notch but the fix was using those two little shits ```cpp ImGui::PushStyleVar(ImGuiSty...

ye those were the ones i mean 😄

#

what is this lvstri, irisvk 2.0?

wicked notch Oct 29, 2023, 11:17 PM

#

irisvk except it's not a VSM only showcase bleakekw

frank sail Oct 29, 2023, 11:17 PM

#

irisvuk

wispy spear Oct 29, 2023, 11:20 PM

#

irisfukall

wicked notch Oct 29, 2023, 11:20 PM

#

and there's no sRGB issues at all!

#

amazing

#

epic

#

there's a severe lack of shadows

#

but I'm too deep into the refactoring now

wispy spear Oct 29, 2023, 11:22 PM

#

noice

#

with all these 4kisms too

distant lodge Oct 29, 2023, 11:25 PM

#

frank sail I'll let you judge <https://github.com/JuanDiegoMontoya/Frogfood/blob/a3271d3093...

trying to decipher the magic in your magic numbers... it sure is magic

frank sail Oct 29, 2023, 11:41 PM

#

It's just a pixel offset

cold sky Oct 30, 2023, 8:25 AM

#

frank sail Even with hw sparse actually

There's no easy way to scroll them, you'd have to unbind them all and bind them scrolled

frank sail Oct 30, 2023, 8:26 AM

#

continue reading the convo

cold sky Oct 30, 2023, 8:26 AM

#

frank sail Actually I'm probably <:misinfo:1073720407452045322> about hw sparse there but I...

This?

frank sail Oct 30, 2023, 8:26 AM

#

yep

wicked notch Oct 30, 2023, 11:09 AM

#

// they are the same image
layout (rgba32f, set = 0, binding = 0) uniform writeonly image2D u_history_storage;
layout (set = 0, binding = 1) uniform sampler2D u_history_sampled;

void main() {
    const vec4 x = textureLod(u_history_sampled, gl_GlobalInvocationID / vec2(textureSize(u_history_sampled, 0)), 0);
    x = do_something(x);
    imageStore(u_history_storage, gl_GlobalInvocationID.xy, x);
}``` is this legal

frank sail Oct 30, 2023, 11:12 AM

#

If u never read after a write to that texel, I think it's fine

#

But I do not think stores are visible to samplers without a pipeline barrier

wicked notch Oct 30, 2023, 11:12 AM

#

Ye I don't want to read again after I write

cold sky Oct 30, 2023, 11:23 AM

#

wicked notch ```glsl // they are the same image layout (rgba32f, set = 0, binding = 0) unifor...

in GENERAL layout, yes

cold sky Oct 30, 2023, 11:23 AM

#

frank sail But I do not think stores are visible to samplers without a pipeline barrier

there's no guarantee, but they might become visible whenever

cold sky Oct 30, 2023, 11:24 AM

#

wicked notch Ye I don't want to read again after I write

this is UB territory

#

there's no way to mark the sampler2D as coherent so any writes to the Storage image have no guarantee of showing up

#

furthermore, you "might" accidentally tap other pixels if you have the wrong sampler set (bilinear interpolation, etc)

#

why aren't you just using the image itself ?

#

like imagLoad and then imageStore to the same location ?

frank sail Oct 30, 2023, 11:27 AM

#

btw writeonly images don't need the format in the layout

cold sky Oct 30, 2023, 11:27 AM

#

cold sky this is UB territory

its literally "barely legal", like maaaybe maaaybe if you do all the following it will work:

no interpolation, literal NEAREST sampler
only draw a single pixel with the pipeline at any location, and have an image barrier before and after
image layout is GENERAL
etc.

cold sky Oct 30, 2023, 11:27 AM

#

frank sail btw writeonly images don't need the format in the layout

in Vulkan they both need, unless you have the WithoutFormat feature enabled

frank sail Oct 30, 2023, 11:27 AM

#

frank sail btw writeonly images don't need the format in the layout

and read images too if you have ext image load formatted

frank sail Oct 30, 2023, 11:27 AM

#

cold sky in Vulkan they both need, unless you have the `WithoutFormat` feature enabled

oof

#

OpenGL best API

cold sky Oct 30, 2023, 11:28 AM

#

but true, 99% of devices support write without format

#

so Nabla requires that feature

#

but still in Vulkan you need to add some stuff to VkStruct when making the image view IIRC

#

its not magically enabled on everything

cold sky Oct 30, 2023, 11:30 AM

#

wicked notch ```glsl // they are the same image layout (rgba32f, set = 0, binding = 0) unifor...

IMHO this fuckery is super "Not Worth It" (TM)

wicked notch Oct 30, 2023, 11:30 AM

#

cold sky why aren't you just using the image itself ?

I wanted to use linear filtering when reading

#

I could do it myself but eh

cold sky Oct 30, 2023, 11:31 AM

#

wicked notch I wanted to use linear filtering when reading

watbulb

wicked notch Oct 30, 2023, 11:31 AM

#

ye this is very ub

cold sky Oct 30, 2023, 11:31 AM

#

how are you going to linearly interpolate ?

#

:dafuq:

wicked notch Oct 30, 2023, 11:32 AM

#

Ye I can't lol

cold sky Oct 30, 2023, 11:32 AM

#

you write the texels with the invocations

wicked notch Oct 30, 2023, 11:32 AM

#

I'll just use two images

cold sky Oct 30, 2023, 11:32 AM

#

and then you want to read at an offset of 0.5

frank sail Oct 30, 2023, 11:32 AM

#

wicked notch ye this is very ub

It's ub if any other thread writes to the sampled footprint

cold sky Oct 30, 2023, 11:32 AM

#

btw, your original image would read texel values 0.5 away from pixel center, and store them to pixel center

#

this basically computes a quasi 1/2 downsample

#

this textureLod(u_history_sampled, gl_GlobalInvocationID / vec2(textureSize(u_history_sampled, 0)), 0); will give you 0.5 pixel less in each axis

#

so when globalinvID is 0,0 you end up tapping pixels {0,0} {-1,0}, {0,-1}, {-1,-1}

wicked notch Oct 30, 2023, 11:34 AM

#

yeah that was just quick way to demonstrate my issue

#

I do this now ```glsl
#version 460

layout (local_size_x = 16, local_size_y = 16) in;

layout (set = 0, binding = 0) uniform sampler2D u_velocity;
layout (set = 0, binding = 1) uniform sampler2D u_color;
layout (set = 0, binding = 2) uniform sampler2D u_history_sampled;
layout (rgba32f, set = 0, binding = 3) uniform image2D u_final_color;

void main() {
const vec2 resolution = vec2(textureSize(u_color, 0));
if (any(greaterThanEqual(gl_GlobalInvocationID.xy, ivec2(resolution)))) {
return;
}

const vec2 uv = (vec2(gl_GlobalInvocationID.xy) + 0.5) / resolution;
const vec2 velocity = textureLod(u_velocity, uv, 0).rg;
const vec2 prev_uv = uv - velocity;

const vec4 current_color = textureLod(u_color, uv, 0);
const vec4 previous_color = textureLod(u_history_sampled, prev_uv, 0);
imageStore(u_final_color, gl_GlobalInvocationID.xy, mix(current_color, previous_color, 0.9));

}```

cold sky Oct 30, 2023, 11:35 AM

#

ah you're doing TAA

wicked notch Oct 30, 2023, 11:35 AM

#

all images are different so no UB

cold sky Oct 30, 2023, 11:35 AM

#

btw use texelFetch

#

or Jaker's FSR2 on GL

#

or ask @hallow umbra for help, as he's poured ungodly amounts of time into TAA

#

maybe got some code to throw at you, esp that you're using visbuffer

frank sail Oct 30, 2023, 11:37 AM

#

cold sky or Jaker's FSR2 on GL

He usin Vulkan doe

#

So even better

cold sky Oct 30, 2023, 11:43 AM

#

FSR3?

#

FSR3, even better than FSR2

frank sail Oct 30, 2023, 11:53 AM

#

real

wispy spear Oct 30, 2023, 11:54 AM

#

FSR4 when

frank sail Oct 30, 2023, 11:58 AM

#

FSR 4 free on gpuopen.com

hallow umbra Oct 30, 2023, 12:28 PM

#

effort spent making good TAA is better spent tweaking your masks and stuff to give FSR/streamline the highest quality inputs you can

wispy spear Oct 30, 2023, 1:00 PM

#

@hallow umbra what happened to your sparkly TAA world btw? its been a while since you posted pics of progress : )

hallow umbra Oct 30, 2023, 1:04 PM

#

nothing, i just did everything interesting that i could think of

wispy spear Oct 30, 2023, 1:04 PM

#

oh oki

#

any other projects going then?

hallow umbra Oct 30, 2023, 1:06 PM

#

nop, i'm just out of ideas for anything graphics related

#

i don't feel motivated making things that someone else already did and better

#

and i investigated all techniques that i thought are underlooked

wispy spear Oct 30, 2023, 1:07 PM

#

you could give virtual shadow maps a try ;P

#

assist lvstri/saky/jaker unlocking its secrets

frank sail Oct 30, 2023, 1:11 PM

#

hallow umbra nop, i'm just out of ideas for anything graphics related

volumetric frog

wicked notch Oct 31, 2023, 9:33 AM

#

uhh

#

I think NV knows

#

In my frag shader, I output a zero motion vector for now

#

And DLAA starts up in "NO_MV_MODE"

#

however as soon as I put some other value in it, NGX immediately switches to "LOWRES_MV_MODE"

#

how the fuck does it know

#

I actually don't even need to put any other value in it, if I do some operation that results in 0, it also switches

#

???

wispy spear Oct 31, 2023, 9:49 AM

#

driver detects access to api, and flips a switch perhaps

wicked notch Oct 31, 2023, 12:02 PM

#

Ok so NV calculates motion vectors like this in their sample

#

void main(
    in float4 i_position : SV_Position,
    in float2 i_uv : UV,
    out float4 o_color : SV_Target0
)
{
    o_color = 0;

#if USE_STENCIL
    uint stencil = t_GBufferStencil[i_position.xy].y;
    if ((stencil & g_TemporalAA.stencilMask) == g_TemporalAA.stencilMask)
        discard;
#endif
    float depth = t_GBufferDepth[i_position.xy].x;
    
    float4 clipPos;
    clipPos.x = i_uv.x * 2 - 1;
    clipPos.y = 1 - i_uv.y * 2;
    clipPos.z = depth;
    clipPos.w = 1;

    float4 prevClipPos = mul(clipPos, g_TemporalAA.reprojectionMatrix);

    if (prevClipPos.w <= 0)
        return;
    
    prevClipPos.xyz /= prevClipPos.w;
    float2 prevUV;
    prevUV.x = 0.5 + prevClipPos.x * 0.5;
    prevUV.y = 0.5 - prevClipPos.y * 0.5;

    float2 prevWindowPos = prevUV * g_TemporalAA.previousViewSize + g_TemporalAA.previousViewOrigin;

    o_color.xy = prevWindowPos.xy - i_position.xy;
}

#

And I gotta say, what the fuck

#

is this

#

What is a reprojection matrix

delicate rain Oct 31, 2023, 12:07 PM

#

Probably last frames clip->world

wicked notch Oct 31, 2023, 12:08 PM

#

Uhh

#

maybe

#

viewReprojection = inverse(view->GetViewMatrix()) * viewPrevious->GetViewMatrix();
reprojectionMatrix = inverse(view->GetProjectionMatrix(false)) * affineToHomogeneous(viewReprojection) * viewPrevious->GetProjectionMatrix(false);```

#

It's whatever this does

delicate rain Oct 31, 2023, 12:09 PM

#

Huuuuuh

#

Wtf is affineToHomogeneous

wicked notch Oct 31, 2023, 12:09 PM

#

template <typename T, int n>
matrix<T, n+1, n+1> affineToHomogeneous(affine<T, n> const & a)
{
    matrix<T, n+1, n+1> result;
    for (int i = 0; i < n; ++i)
    {
        for (int j = 0; j < n; ++j)
            result[i][j] = a.m_linear[i][j];
        result[i][n] = T(0);
    }
    for (int j = 0; j < n; ++j)
        result[n][j] = a.m_translation[j];
    result[n][n] = T(1);
    return result;
}```???????????????????????????

delicate rain Oct 31, 2023, 12:11 PM

#

What's wrong with just doing vertex * prevMVP - vertex*thisMVP in your vert shader?

#

Why do you have to do this cursed solution

wicked notch Oct 31, 2023, 12:11 PM

#

I don't

#

I do that in fact

#

This is the DLSS sample's app code

delicate rain Oct 31, 2023, 12:11 PM

#

https://tenor.com/view/sad-frown-rain-cat-gif-17035614

Tenor

wicked notch Oct 31, 2023, 12:12 PM

#

Thing is, I dunno if it is actually correct, because NV's docs tell me to do this

#

Whatever this means

#

#questions message
Here, check this out

#

In my thing stuff looks very aliased when I move around

delicate rain Oct 31, 2023, 12:14 PM

#

I have no clue what's going on, why do they read the velocity from the texture only if both XY are nonzero?

wispy spear Oct 31, 2023, 12:15 PM

#

perhaps negative/zero velocities need special treatment?

delicate rain Oct 31, 2023, 12:17 PM

#

Isn't negative velocity just moving in the opposite direction of positive velocity? (Aka back positive forward negative or the other way around)

wicked notch Oct 31, 2023, 12:17 PM

#

ye NV's docs don't mention anything about that

#

They just say this

delicate rain Oct 31, 2023, 12:19 PM

#

wicked notch ```cpp template <typename T, int n> matrix<T, n+1, n+1> affineToHomogeneous(affi...

Btw this builds a 4x4 matrix from some weirdo 3x3 matrix + translation I think

#

So the reprojection matrix goes prevFrameClip -> prevFrameView-> prevFrameWorld -> thisFrameView -> thisFrameClip

wicked notch Oct 31, 2023, 12:23 PM

#

btw

#

I am starting to think my derivative calculation isn't accurate enough

#

and my motion vectors are fine

wicked notch Oct 31, 2023, 12:42 PM

#

Other fun fact NVSDK_NGX_VK_Feature_Eval_Params::Sharpness does apparently nothing

#

Maybe it only works for DLSS, I'm using DLAA

wicked notch Oct 31, 2023, 3:35 PM

#

damn DLSS takes longer that it takes for me to rasterize the scene

#

amazing

delicate rain Oct 31, 2023, 7:06 PM

#

Can you just use DLSS I thought FSR is the only one which you can freely use?

wicked notch Oct 31, 2023, 7:07 PM

#

ye you can just clone it and use it

glass sphinx Oct 31, 2023, 7:09 PM

#

its source available?

delicate rain Oct 31, 2023, 7:09 PM

#

I doubt that

wicked notch Oct 31, 2023, 7:09 PM

#

heh, it's a 33MiB DLL

#

zero source whatsoever

glass sphinx Oct 31, 2023, 7:09 PM

#

froge_bleak

#

uuuhm

#

is that allowed?

wicked notch Oct 31, 2023, 7:10 PM

#

is what allowed?

glass sphinx Oct 31, 2023, 7:10 PM

#

to use it

wicked notch Oct 31, 2023, 7:10 PM

#

ye

#

as far as I know

glass sphinx Oct 31, 2023, 7:10 PM

#

do they not even have headers?

wicked notch Oct 31, 2023, 7:10 PM

#

Oh ye they do have headers

glass sphinx Oct 31, 2023, 7:10 PM

#

ah ok

wicked notch Oct 31, 2023, 7:10 PM

#

https://github.com/NVIDIA/DLSS

#

Here

wicked notch Oct 31, 2023, 8:55 PM

#

man

#

AA is so nice

#

but you know what would be nicer

#

Figure out why the fuck I get unstable AA when moving around at the edges of triangles

wicked notch Oct 31, 2023, 9:44 PM

#

Actually not even that

#

what the hell is wrong with DLAA

delicate rain Oct 31, 2023, 9:52 PM

#

mister mister do you have code for your project using the ktx thingy?

frank sail Oct 31, 2023, 9:52 PM

#

Why no fsr2

frank sail Oct 31, 2023, 9:52 PM

#

delicate rain mister mister do you have code for your project using the ktx thingy?

Libktx?

delicate rain Oct 31, 2023, 9:52 PM

#

yes

wicked notch Oct 31, 2023, 9:52 PM

#

wicked notch Oct 31, 2023, 9:52 PM

#

delicate rain mister mister do you have code for your project using the ktx thingy?

I do

delicate rain Oct 31, 2023, 9:52 PM

#

I'm finally writing a scene loader

wicked notch Oct 31, 2023, 9:52 PM

#

https://github.com/LVSTRI/IrisVk/blob/master/src/iris/gfx/texture.cpp#L19 it's all here

wicked notch Oct 31, 2023, 9:53 PM

#

wicked notch

btw look at the top of the building

#

this is with DLAA applied

#

lol

frank sail Oct 31, 2023, 9:53 PM

#

delicate rain yes

Ctrl+f ktx
https://github.com/JuanDiegoMontoya/Frogfood/blob/main/src/SceneLoader.cpp

wicked notch Oct 31, 2023, 9:53 PM

#

frank sail Why no fsr2

That's coming too

#

I have decided that shipping spirv isn't so bad after all

frank sail Oct 31, 2023, 9:54 PM

#

wicked notch

You need a magnifier homie

wicked notch Oct 31, 2023, 9:54 PM

#

I do

frank sail Oct 31, 2023, 9:54 PM

#

wicked notch I have decided that shipping spirv isn't so bad after all

I figured you had changed your mind when you were suddenly okay with shipping a 33mb dll bleakekw

wicked notch Oct 31, 2023, 9:54 PM

#

frank sail I figured you had changed your mind when you were suddenly okay with shipping a ...

yeah bleakekw

wicked notch Oct 31, 2023, 9:55 PM

#

delicate rain mister mister do you have code for your project using the ktx thingy?

btw, I suggest you take my code only as a demo of how to load KTX into Vulkan

frank sail Oct 31, 2023, 9:55 PM

#

wicked notch I do

Where

wicked notch Oct 31, 2023, 9:55 PM

#

For proper KTX management check out Jaker's code, he actually checks whether a texture is supercompressed, needs transcoding, etc.

wicked notch Oct 31, 2023, 9:55 PM

#

frank sail Where

No as in, I do need a magnifier KEKW

delicate rain Oct 31, 2023, 9:56 PM

#

wicked notch btw, I suggest you take my code only as a demo of how to load KTX into Vulkan

I'm looking at both, but is there really not a way to load directly into a staging buffer?

wicked notch Oct 31, 2023, 9:56 PM

#

There is but it's garbage

frank sail Oct 31, 2023, 9:56 PM

#

wicked notch No as in, I do need a magnifier <:KEKW:666849321462792234>

Oh KEKW

wicked notch Oct 31, 2023, 9:56 PM

#

The function is ktxTexture_VkUploadEx

#

But nobody uses it because it's bad

delicate rain Oct 31, 2023, 9:57 PM

#

bruuuh I'm starting to understand handmade ppl

frank sail Oct 31, 2023, 9:57 PM

#

Ktx also has a GL upload function but it sucks too agonyfrog

delicate rain Oct 31, 2023, 9:58 PM

#

I guess it is impossible to just take in a single pointer to a buffer into which we like to load our data

#

nono I AM THE LIBRARY I RETURN THE POINTER

#

ugh

wicked notch Oct 31, 2023, 9:58 PM

#

Just memcpy my boy

delicate rain Oct 31, 2023, 9:58 PM

#

yeah

#

I cope

wicked notch Oct 31, 2023, 9:58 PM

#

https://tenor.com/view/copium-cat-gif-27161395

Tenor

wicked notch Oct 31, 2023, 9:59 PM

#

wicked notch

Jaker can you run FSR2 in AA mode like this? (normals only, bistro)

frank sail Oct 31, 2023, 10:00 PM

#

Yes but it's not optimized for that

#

Fsr2 does a bunch of unnecessary work when you use it for 1x upscale (AA)

wicked notch Oct 31, 2023, 10:01 PM

#

I'm not looking at 🅱️erf

#

Just if a correct impl of FSR2 also suffers from the same aliasing

frank sail Oct 31, 2023, 10:01 PM

#

Are you asking me to test

#

Because I'm away from my PC for the next few days

wicked notch Oct 31, 2023, 10:01 PM

#

oh rip

#

I'll just pull latest frogfood then

frank sail Oct 31, 2023, 10:02 PM

#

Yeeeeeee

#

It's hard for me to tell if your vid shows poopy aliasing or compression artifacts (which are worsened due to discord mobile sucking)

#

Btw you will have to edit this shader to display normals
https://github.com/JuanDiegoMontoya/Frogfood/blob/main/data/shaders/ShadeDeferredPbr.frag.glsl

wicked notch Oct 31, 2023, 10:06 PM

#

Looks like FSR2 is the same

#

is this just a limitation of modern AA

wispy spear Oct 31, 2023, 10:06 PM

#

that video looks neat nonetheless

frank sail Oct 31, 2023, 10:07 PM

#

It's probably worse without any AA

wispy spear Oct 31, 2023, 10:07 PM

#

the tree would probably go bonkers without

wicked notch Oct 31, 2023, 10:09 PM

#

yeah definitely worse without AA

#

welp, I've ran all possible sanity checks, it looks like my impl of DLSS is without errors

#

At least, without obvious errors bleakekw

wicked notch Nov 2, 2023, 12:55 AM

#

here is yet another fun fact about DLSS

#

Here's good ol sponza

#

looks pretty bad innit, well it is performance mode DLSS

#

Now here's NV's sponza

#

Also in performance mode, same resolution

#

wait lemme remove the shadows

#

There it is

#

notice any difference?

#

How in god's name is their sample's app, from which I stole all the code, look so much better

#

what in the everloving fuck

wispy spear Nov 2, 2023, 1:01 AM

#

rename your shiddy.exe to whatevernvused.exe

wicked notch Nov 2, 2023, 1:02 AM

#

actually

#

let me try that

#

I swear to god if something changes I'm pulling DLSS out

wispy spear Nov 2, 2023, 1:02 AM

#

lol

#

im confident it willnt change anything

wicked notch Nov 2, 2023, 1:02 AM

#

yeah

#

on the 0.1% chance it does tho

glass sphinx Nov 2, 2023, 1:03 AM

#

😈

wispy spear Nov 2, 2023, 1:03 AM

#

drivers might use hashes not filenames anyway i suppose

wicked notch Nov 2, 2023, 1:03 AM

#

I'm changing the AppID

#

ok DLSS is safe

#

nothing changed

#

I'll try asking on NV's forums/discord

wispy spear Nov 2, 2023, 1:06 AM

#

was worf a try

frank sail Nov 2, 2023, 3:13 AM

#

wicked notch There it is

Different tonemapper? Lighting model? Post processing stack?

cold sky Nov 2, 2023, 9:13 AM

#

wicked notch How in god's name is their sample's app, from which I stole all the code, look s...

AI denoisers are super fragile and sensitive to "subpixel patterns" in your inputs

#

I've had the OptiX one refuse to work cause Mitsuba splatted samples to multiple pixels with a gaussian kernel

cold sky Nov 2, 2023, 9:14 AM

#

wicked notch notice any difference?

you have a pronounced difference in lighting

#

and quite a lot of moire on your curtains

wicked notch Nov 2, 2023, 9:29 AM

#

I disabled all post processing in the sample app btw

#

What I don't understand rn is the moirè

#

I am using the same LOD bias as they are using

frank sail Nov 2, 2023, 10:25 AM

#

Did you check RenderDoc

#

Compare render and display resolutions, image formats, sampler state, etc

wicked notch Nov 2, 2023, 10:48 AM

#

One thing I am noticing

#

In my stuff, I can only see the edges of objects jittered

#

Even with a 6.0x magnifier

#

In the sample app though I can see everything jittering thonk

frank sail Nov 2, 2023, 10:54 AM

#

Are you jittering your view or projection matrix

wicked notch Nov 2, 2023, 10:59 AM

#

The proj matrix

#

const auto jitter = sample_jitter(_device->frame_counter().current(), _state.dlss.jitter_count);
const auto jitter_translation = glm::vec3(2.0f * jitter / glm::vec2(_state.dlss.render_resolution), 0.0f);
const auto jitter_matrix = glm::translate(glm::mat4(1.0f), jitter_translation);
view.jittered_projection = jitter_matrix * view.projection;

#

As nvidia tells me to do

frank sail Nov 2, 2023, 11:00 AM

#

rip idk

#

I thought your bug could be that you only jitter gl_Position but not the other attributes

wicked notch Nov 2, 2023, 11:35 AM

#

I solved it

#

I forgor I was using a different view struct for the visbuffer resolve

#

goddamnit

wicked notch Nov 2, 2023, 12:13 PM

#

Finally

#

we've done it boys

frank sail Nov 2, 2023, 3:06 PM

#

How

wicked notch Nov 2, 2023, 3:26 PM

#

frank sail How

with the power of friendship and copy pasting code from the sample app

wispy spear Nov 2, 2023, 5:19 PM

#

the dethfrog had a hand in this probably 🙂

wicked notch Nov 2, 2023, 10:38 PM

#

I kinda have a defcon0 situation on my hands

#

debugPrintfEXT gives me device loss 💀

wispy spear Nov 2, 2023, 10:51 PM

#

6 3 8 2 5 9 2 1

#

here the launch codes

wicked notch Nov 2, 2023, 11:27 PM

#

this bug is megaweird ngl

#

also DLSS is enabling the deprecated VK_EXT_buffer_device_address

#

so that's fucking up my validation layers too

#

I fear integrating DLSS should be the last possible step of any engine

#

because it makes debugging impossible bleakekw

cold sky Nov 2, 2023, 11:36 PM

#

wicked notch also DLSS is enabling the deprecated VK_EXT_buffer_device_address

Old code old code

#

This is why FOSS is best

#

You can go in and fix that shit

wicked notch Nov 2, 2023, 11:37 PM

#

ye this absolutely sucks

wicked notch Nov 3, 2023, 12:43 AM

#

sigh

#

looks like I've been debugging nothing for two hours

#

!remindme 12h open debug printf issue

vivid boughBOT Nov 3, 2023, 12:43 AM

#

Alright lvstri, I'll remind you about open debug printf issue in 12 hours. ID: 62513782

wicked notch Nov 3, 2023, 12:44 AM

#

thanks bot

#

imma go cry myself to sleep

#

at least debugPrintfEXT works now

wicked notch Nov 4, 2023, 2:45 PM

#

and just now I notice that my page table isn't actually wrapping around bleakekw

#

epic texture viewer achieved

#

worst texture viewer in the world btw

wispy spear Nov 4, 2023, 2:48 PM

#

wicked notch epic texture viewer achieved

did you add asteroids 😄

wicked notch Nov 4, 2023, 2:52 PM

#

asteroids? wym

wispy spear Nov 4, 2023, 2:53 PM

#

the texture viewer thingy

#

looks cool how it changes as you move

delicate rain Nov 4, 2023, 3:22 PM

#

that is because mister has no caching still

#

daily reminder to add caching mister LVSTRI

wispy spear Nov 4, 2023, 3:24 PM

#

heh

wicked notch Nov 4, 2023, 11:53 PM

#

#define sampler_partially_bound decorate_with_string("update_after_bind|partially_bound")

layout (local_size_x = 16, local_size_y = 16) in;

layout (set = 0, binding = IRIS_TEXTURE_TYPE_2D_SFLOAT) sampler_partially_bound uniform sampler2D u_texture_2d_sfloat;
layout (set = 0, binding = IRIS_TEXTURE_TYPE_2D_SINT) sampler_partially_bound uniform isampler2D u_texture_2d_sint;
layout (set = 0, binding = IRIS_TEXTURE_TYPE_2D_UINT) sampler_partially_bound uniform usampler2D u_texture_2d_uint;
layout (set = 0, binding = IRIS_TEXTURE_TYPE_2D_ARRAY_SFLOAT) sampler_partially_bound uniform sampler2DArray u_texture_2d_array_sfloat;
layout (set = 0, binding = IRIS_TEXTURE_TYPE_2D_ARRAY_SINT) sampler_partially_bound uniform isampler2DArray u_texture_2d_array_sint;
layout (set = 0, binding = IRIS_TEXTURE_TYPE_2D_ARRAY_UINT) sampler_partially_bound uniform usampler2DArray u_texture_2d_array_uint;``` ahhh yes

#

modern GLSL code

#

static auto make_descriptor_binding_flag_from_decoration(const std::string& decoration) -> descriptor_binding_flag_t {
    const auto split = split_decoration_string(decoration);
    auto result = descriptor_binding_flag_t();
    for (const auto& each : split) {
        if (each == "update_after_bind") {
            result |= ir::descriptor_binding_flag_t::e_update_after_bind;
        } else if (each == "update_unused_while_pending") {
            result |= ir::descriptor_binding_flag_t::e_update_unused_while_pending;
        } else if (each == "partially_bound") {
            result |= ir::descriptor_binding_flag_t::e_partially_bound;
        } else if (each == "variable_descriptor_count") {
            result |= ir::descriptor_binding_flag_t::e_variable_descriptor_count;
        }
    }
    return result;
}``` mmmm

#

love it

frank sail Nov 4, 2023, 11:55 PM

#

Reflection my beloved

#

Using shading langs makes you wish for a nuclear winter

glass sphinx Nov 4, 2023, 11:58 PM

#

wicked notch ```glsl #define sampler_partially_bound decorate_with_string("update_after_bind|...

why dont you alias them to the same binding

#

https://tenor.com/view/the-rock-gif-23501265

Tenor

wispy spear Nov 5, 2023, 12:00 AM

#

hmm the e_ is ugly too, its quite obvious that its an enum already, otherwise iris* is quite sexy code wise

wicked notch Nov 5, 2023, 12:01 AM

#

glass sphinx why dont you alias them to the same binding

because I'd have to refactor my entire reflection system

glass sphinx Nov 5, 2023, 12:02 AM

#

why do you refelct at all

#

ah its not fully bindless?

wicked notch Nov 5, 2023, 12:02 AM

#

it's not 😦

#

I still have to steal your gpu table of resources

#

one day I'll be 100% bindless

glass sphinx Nov 5, 2023, 12:03 AM

#

https://www.youtube.com/shorts/XPcfbKYmxpw

YouTube

Short Clips

SpongeBob Foghorn Sound Effect

▶ Play video

wispy spear Nov 5, 2023, 12:03 AM

#

hmm we should make use of discord's soundboard 😄

frank sail Nov 5, 2023, 12:03 AM

#

Doesn't that require you to be in voice

wispy spear Nov 5, 2023, 12:04 AM

#

no idea tbh

#

but makes sense yeah

glass sphinx Nov 5, 2023, 12:04 AM

#

now i think daxas descriptor code shrunk down to like 300loc for all descriptor management and i added a lot of validation

wicked notch Nov 5, 2023, 12:04 AM

#

server wkde soundboard

glass sphinx Nov 5, 2023, 12:04 AM

#

wispy spear hmm we should make use of discord's soundboard 😄

hahaha yeeeesss

wicked notch Nov 5, 2023, 12:04 AM

#

glass sphinx now i think daxas descriptor code shrunk down to like 300loc for all descriptor ...

my pipeline.cpp is like 6000 lines

glass sphinx Nov 5, 2023, 12:05 AM

#

👹

#

youuung maan

wicked notch Nov 5, 2023, 12:05 AM

#

I know

#

bleakekw

glass sphinx Nov 5, 2023, 12:05 AM

#

🟪 its time to wear purple

delicate rain Nov 5, 2023, 12:05 AM

#

we send you merch

#

you write rt api for daxa

#

https://tenor.com/view/monkey-rizz-lightskin-stare-lightskin-monkey-lick-lips-monkey-stare-gif-27631236

Tenor

runic surge Nov 5, 2023, 12:06 AM

#

now i agree with that message

wispy spear Nov 5, 2023, 12:06 AM

#

that guy reminds me of my racoon, who comes visit here every once in a while 😄

glass sphinx Nov 5, 2023, 12:06 AM

#

omg i love saky so much

wicked notch Nov 5, 2023, 12:06 AM

#

delicate rain https://tenor.com/view/monkey-rizz-lightskin-stare-lightskin-monkey-lick-lips-mo...

live footage of potrick asking me to use daxa

glass sphinx Nov 5, 2023, 12:06 AM

#

i actually look like that irl

wispy spear Nov 5, 2023, 12:07 AM

#

https://tenor.com/view/schwarzer-picard-kaffee-junge-star-trek-gif-11067878

Tenor

Schwarzer Kaffee, Junge.

▶ Play video

#

its more like this, potrick == picard, crusher == lvstri, they even sit on daxa coloured chairs 😄

runic surge Nov 5, 2023, 12:07 AM

#

wicked notch live footage of potrick asking me to use daxa

lvstri making sure the daxa-fwog monopoly can never happen

glass sphinx Nov 5, 2023, 12:08 AM

#

wispy spear its more like this, potrick == picard, crusher == lvstri, they even sit on daxa ...

this is perfect

wispy spear Nov 5, 2023, 12:08 AM

#

https://tenor.com/view/monopoly-middle-finger-monopoly-boy-monopoly-rage-gif-14508393

Tenor

#

you mean like this?

wicked notch Nov 5, 2023, 2:28 PM

#

static float3 RandomVectorInCone(in float3 direction, in float angle) {
    const uint3 pixelCoord = DispatchRaysIndex();
    const uint3 dispatchDimension = DispatchRaysDimensions();
    const uint pixelIndex = pixelCoord.y * dispatchDimension.x + pixelCoord.x;
    const uint sampleIndex = RayTraceCB.CurrSampleIdx;
    uint state = pixelIndex * sampleIndex;

    const float phi = RandomPCG(state) * 2 * 3.141592653589793284626433;
    const float z = RandomPCG(state) * (1 - cos(angle)) + cos(angle);
    const float x = sqrt(1 - z * z) * cos(phi);
    const float y = sqrt(1 - z * z) * sin(phi);
    const float3 tangent = normalize(cross(float3(0, 1, 0), direction));
    const float3 bitangent = cross(direction, tangent);
    const float3x3 rotation = float3x3(tangent, bitangent, direction);
    return normalize(mul(float3(x, y, z), rotation));
}```

#

for posterity

wicked notch Nov 5, 2023, 3:30 PM

#

0.00872665

runic surge Nov 5, 2023, 5:19 PM

#

Don’t mind if i yoink that

wicked notch Nov 5, 2023, 5:43 PM

#

runic surge Don’t mind if i yoink that

make sure to send screenshots of your results with the code

glass sphinx Nov 5, 2023, 5:44 PM

#

lvstri qhat gpu do you have

wicked notch Nov 5, 2023, 5:44 PM

#

3070 doc

glass sphinx Nov 5, 2023, 5:44 PM

#

nice

wicked notch Nov 5, 2023, 5:45 PM

#

I may or may not be procrastinating on caching for my shadows with RT

#

ngl the RT API in Vulkan is super convoluted wtf

glass sphinx Nov 5, 2023, 5:45 PM

#

it also has lots of options that are just not usefu

#

like cpu side build

#

early days

wicked notch Nov 6, 2023, 8:34 AM

#

btw @frank sail

#

I figured out a very much more shrimpler way of doing your unhinged glm::inverse(bababooey) * baba_is_you * stable_view

frank sail Nov 6, 2023, 8:34 AM

#

please god yes

wicked notch Nov 6, 2023, 8:35 AM

#

const auto clip_world_position = view.stable_proj_view * glm::vec4(_camera.position(), 1.0f);
const auto uv_world_position = (glm::vec2(clip_world_position) / clip_world_position.w) * 0.5f;
const auto page_offset = glm::ivec2(uv_world_position * glm::vec2(IRIS_VSM_VIRTUAL_PAGE_ROW_SIZE));
const auto ndc_shift = 2.0f * (glm::vec2(page_offset) / glm::vec2(IRIS_VSM_VIRTUAL_PAGE_ROW_SIZE));
const auto world_page_offset = view.inv_stable_proj_view * glm::vec4(ndc_shift, 0.0f, 1.0f);
const auto world_page_offset_shift = glm::vec3(-world_page_offset);
const auto shifted_view = glm::translate(view.stable_view, world_page_offset_shift);
view.view = shifted_view;```

frank sail Nov 6, 2023, 8:36 AM

#

will analyze in a bit

wicked notch Nov 6, 2023, 9:39 AM

#

Does this suffer from "if player moves too far away then Z range is fucked" problem I wonder

frank sail Nov 6, 2023, 9:39 AM

#

probably

wicked notch Nov 6, 2023, 9:40 AM

#

because I'm supposedly translating the view matrix to where the player is, to the nearest page

frank sail Nov 6, 2023, 9:40 AM

#

I mean if you're just shifting xy then yes

#

it will suffer

wicked notch Nov 6, 2023, 9:40 AM

#

rip

frank sail Nov 6, 2023, 9:40 AM

#

the solution for z will be more complicated

#

you will need a per-page z offset or something

wicked notch Nov 6, 2023, 9:41 AM

#

or do what saky is doing

#

which is probably a per-page z offset bleakekw

frank sail Nov 6, 2023, 9:41 AM

#

when we discussed, I understood that it did not solve that problem

wicked notch Nov 6, 2023, 9:41 AM

#

o

#

so saky's thing is massively more clamplicated but it doesn't solve the problem? 💀

frank sail Nov 6, 2023, 9:42 AM

#

that's how I understood it, idk

wicked notch Nov 6, 2023, 9:47 AM

#

welp

#

I guess we'll make full use of our god given fp32 precision

cold sky Nov 6, 2023, 10:35 AM

#

for orthographic projection, fp32 makes no sense

#

use/emulate unorm32

#

with fp32 you waste 2 bits

#

and you have a logartihmic distribution of the remaining 30

wicked notch Nov 6, 2023, 10:37 AM

#

how do you emulate unorm32?

#

rn I just do floatBitsToUint(gl_FragCoord.z)

frank sail Nov 6, 2023, 10:37 AM

#

I mean, you don't have a real depth buffer

#

so you can use any format you want

wicked notch Nov 6, 2023, 10:38 AM

#

ye but I can't do atomicMin on a unorm32 image can I

frank sail Nov 6, 2023, 10:38 AM

#

you can do it on a uint32 image tho

#

unorm is just that, but with an implicit division by U32_MAX

#

I suppose you will have to do your math in fixed point to see any real benefit though

delicate rain Nov 6, 2023, 10:39 AM

#

frank sail that's how I understood it, idk

It does solve it

wicked notch Nov 6, 2023, 10:40 AM

#

pog

delicate rain Nov 6, 2023, 10:40 AM

#

I have to have per page z offset

frank sail Nov 6, 2023, 10:40 AM

#

ah

delicate rain Nov 6, 2023, 10:40 AM

#

And my thingy

#

And a bit more logic and it still has some quirks

#

So id suggest just go for sliding along the plane bleakekw

wicked notch Nov 6, 2023, 10:42 AM

#

can't we fix by translating the origin of the world somehow

#

perhaps by recreating the stable view matrix to point at another center

delicate rain Nov 6, 2023, 10:43 AM

#

You need to correct the depths then

#

That's what I do

#

Uh maybe I misunderstood actually

frank sail Nov 6, 2023, 10:44 AM

#

you can invalidate everything if the player goes too far from the origin (on the light-space z axis), if you want to use minimal effort

#

then you can shift the light camera

wicked notch Nov 6, 2023, 10:44 AM

#

what's an invalidation every time you move 2000km

frank sail Nov 6, 2023, 10:45 AM

#

good luck getting sufficient z precision

wicked notch Nov 6, 2023, 10:45 AM

#

rip

frank sail Nov 6, 2023, 10:46 AM

#

you could make the frustum length like 1000 units and then shift every 500 (with a buffer zone to prevent the player from constantly triggering full refreshes by moving past a threshold)

delicate rain Nov 6, 2023, 10:47 AM

#

Btw how do you deal with player going into negative coordinates from the origin? Won't it shift the sun camera underneath the terrain?

wicked notch Nov 6, 2023, 10:48 AM

#

depth clamping smart

frank sail Nov 6, 2023, 10:48 AM

#

yeah not much you can do there except make the frustum longer

#

most game content probably won't span such a huge area

#

vertically, that is

delicate rain Nov 6, 2023, 10:49 AM

#

I want start citizen planets 🥸

frank sail Nov 6, 2023, 10:49 AM

#

make a bigger frustum

wicked notch Nov 6, 2023, 10:49 AM

#

full scale planets yes

delicate rain Nov 6, 2023, 10:49 AM

#

My frustum size increases with each clipmap

wicked notch Nov 6, 2023, 10:50 AM

#

actually

#

make the galaxy cast a shadow

frank sail Nov 6, 2023, 10:50 AM

#

you don't need insane precision when you are 50,000,000 km from the surface

delicate rain Nov 6, 2023, 10:50 AM

#

So it actually is fine

wicked notch Nov 6, 2023, 10:53 AM

#

everything would be so much easier if we had infinite memory and infinite precision smh

wicked notch Nov 11, 2023, 1:51 AM

#

raytracing is bad for my health

#

it's 3am and I am staring at path traced power plant

#

I have been staring at it for 10 minutes

#

this is a cry for help

frank sail Nov 11, 2023, 1:53 AM

#

go to sleep and dream about path traced frogs

wicked notch Nov 11, 2023, 1:53 AM

#

thank god tomorrow is saturday

glass sphinx Nov 11, 2023, 2:04 AM

#

wicked notch it's 3am and I am staring at path traced power plant

i am sitting in a similar boat

#

i should sleep

runic surge Nov 11, 2023, 3:48 AM

#

wicked notch this is a cry for help

Now he gets it

wicked notch Nov 11, 2023, 10:41 PM

#

oh yeah

#

frame time variance

#

no variance at all

#

#

This is even worse KEKW

#

amazing

runic surge Nov 11, 2023, 11:08 PM

#

are those frame times supposed to be normal?

#

i have no clue but usually frame times don't become sinosodual

wicked notch Nov 11, 2023, 11:20 PM

#

depends on your definition of normal 💀

runic surge Nov 11, 2023, 11:20 PM

#

babe wake up new frametimes just dropped

#

technically speaking

#

average frametime is 🔥

#

just ignore the 1% lows

wicked notch Nov 12, 2023, 12:02 AM

#

does this mean my blocker search has not enough shrimples?

frank sail Nov 12, 2023, 12:07 AM

#

Looks fine to me

wicked notch Nov 12, 2023, 12:07 AM

#

#

me when shown literally any kind of contact hardening:

#

https://tenor.com/view/cavestory-cave-story-gif-21735440

Tenor

#

but perhaps the light size is too big KEKW

frank sail Nov 12, 2023, 12:08 AM

#

Add sliders for sample count, width, etc

wicked notch Nov 12, 2023, 12:08 AM

#

ye

twin bough Nov 12, 2023, 1:20 AM

#

i have pcss too ill ask if i can share it

#

heh

#

looks mega stupid tho

frank sail Nov 12, 2023, 1:20 AM

#

I'll allow it

twin bough Nov 12, 2023, 1:32 AM

#

glass sphinx Nov 12, 2023, 1:33 AM

#

nice

twin bough Nov 12, 2023, 1:33 AM

#

also works for local lights

#

frank sail Nov 12, 2023, 1:37 AM

#

I could recognize that rust texture anywhere

twin bough Nov 12, 2023, 1:40 AM

#

sauce

frank sail Nov 13, 2023, 12:24 PM

#

so it seems like UE5 do be making an HZB for the VSM

#

tbh I think hzb would work if you have a two-pass approach

wicked notch Nov 13, 2023, 12:42 PM

#

remember that unreal's meshlets are 99% of the times smaller that a page

#

so they can do HZB per page

#

we gotta think more heavily about it bleakekw

frank sail Nov 13, 2023, 12:43 PM

#

ye I'm doing that rn

#

ALSO

#

I determined that HZB is only helpful for dynamic geometry

#

if a page wasn't previously visible (when the camera moves or the light rotates), then it never had meaningful depth to cull against

#

so all geometry that touches that page must be rendered

#

anyways, here's the idea:

the usual: mark & allocate visible pages, clear dirty physical pages, etc.
build HPB and cull visible objects against it (visible objects are determined from step 4 of last frame)
render remaining visible objects
build HZB and cull objects against it
render objects whose visibility changed from 0 to 1 (this is essential to avoid getting fucked by the cached nature of pages)

#

again, HZB only helps when moving geometry can cause an already-visible page to become invalidated

#

idk if geometry can move in any of our engines bleakekw

wicked notch Nov 13, 2023, 12:49 PM

#

💀

frank sail Nov 13, 2023, 12:52 PM

#

HPB however is useful for everything

#

and I don't think they can be cleverly combined like I originally thought

wicked notch Nov 13, 2023, 12:55 PM

#

ye unfortunately

#

I wanted to go with the "idk separate them" anyways

frank sail Nov 13, 2023, 12:55 PM

#

actually I think they can be merged if you put HPB in step 4

#

wait uh

#

with merged HPB+HZB in step 4, if you see a new page, objects may not be rendered to it in the first render, but they shouldn't be culled in step 4 as the HPB+HZB will be empty, which means they should be rendered in step 5

#

the idea requires storing object visibility until the next frame, which is numViews * numObjects bits of storage

#

where an object is presumably a meshlet

wicked notch Nov 13, 2023, 1:03 PM

#

thank god for uint64_t KEKW

frank sail Nov 13, 2023, 1:03 PM

#

well even if you have a million meshlets and 16 views, 16 million bits is only 2 MB

#

it's not as bad as trying to store the maximum number of indices for every view bleakekw

#

it's like 3 orders of magnitude less storage

wicked notch Nov 13, 2023, 1:06 PM

#

tru

wicked notch Nov 13, 2023, 7:56 PM

#

alright it's time to switch things up a bit

#

I shall put VSM in the backburner for a while

glass sphinx Nov 13, 2023, 8:02 PM

#

https://tenor.com/view/mts-meg-squinting-meg-camera-megan-thee-stallion-megan-near-camera-gif-24541289

Tenor

#

@delicate rain man overboard!

wicked notch Nov 13, 2023, 8:05 PM

#

I will be doing RT yes

#

hopefully I can help out with daxa's RT efforts

glass sphinx Nov 13, 2023, 8:05 PM

#

https://tenor.com/view/omg-oh-my-god-crypto-eth-ethereum-gif-27338613

Tenor

#

froge_love

wicked notch Nov 13, 2023, 8:21 PM

#

potrick while you're here

#

do you mind explaining a bit how daxa's resource table work on the C++ side

#

how are BufferIds and SamplerIds created, bound to descriptors and destroyed specifically

delicate rain Nov 13, 2023, 8:32 PM

#

glass sphinx <@226726721133477888> man overboard!

I am still a believer

#

in this endeavor

#

I thin HZB will help when you have animated thingy which just sways for example and you are redrawing the tile each frame

wicked notch Nov 13, 2023, 8:33 PM

#

I have not abandoned you guys lol

#

the VSM train is still going strong

#

'Tis but one of my usual detours

delicate rain Nov 13, 2023, 8:34 PM

#

Good goood I'm also on VSM holiday

#

but I'll soon return stronger than ever

glass sphinx Nov 13, 2023, 8:45 PM

#

wicked notch do you mind explaining a bit how daxa's resource table work on the C++ side

hello

#

Example for Buffer:

creating it gives you an id (index + version)
index of id indexes into cpu side array of ImplBuffers (the metadata for the buffer)
index indexes into a descriptor set binding array
when creating the buffer its imediately written to the mega descriptor set
daxa only has one descriptor set that has update after bind and some other flags set to make it convenient
when calling destroy on the buffer it becomes a zombie
zombies life until all already submitted commands running at the point in time when you call destroy are done
daxa checks when they are done and actually performs destructions in Device::collect_garbage
it uses timeline semaphores tracking submits on a cpu and gpu timeline
actualyl destroying the buffer writes a dummy in the place of the dead buffer to avoid dangling descriptors

wicked notch Nov 13, 2023, 8:50 PM

#

epic

#

then to access the buffer do you use push const or something?

#

to index in the buffer table that is

glass sphinx Nov 13, 2023, 8:51 PM

#

you either put it in a push constant or bind it as a uniform buffer (yeeaaa i know, i am not sure if i wanna keep them uniform buffers but daxa has them atm)

wicked notch Nov 13, 2023, 8:52 PM

#

alright the refactor is coming soon™️

#

but RT first

glass sphinx Nov 13, 2023, 9:20 PM

#

wispy spear Nov 13, 2023, 9:23 PM

#

lustri, traitor

wicked notch Nov 14, 2023, 1:30 PM

#

hmm

#

making a bottom level AS per meshlet

#

what could go wrong?

frank sail Nov 14, 2023, 1:31 PM

#

bottom level algebra subprogram

#

btw, why

wicked notch Nov 14, 2023, 1:32 PM

#

idk

frank sail Nov 14, 2023, 1:32 PM

#

why not coarser granularity

wicked notch Nov 14, 2023, 1:32 PM

#

I was writing code for bvh building for hardware RT

wicked notch Nov 14, 2023, 1:32 PM

#

frank sail why not coarser granularity

it's probably best innit

#

maybe meshlets

#

except meshlet triangle upper bound is 65536 triangles KEKW

runic surge Nov 14, 2023, 1:58 PM

#

Wait lvstri, you use daxa now?

#

I thought you used your own custom stuff

wicked notch Nov 14, 2023, 1:59 PM

#

I don't use daxa

#

I might go for it in the future

runic surge Nov 14, 2023, 2:08 PM

#

perfectly balanced
Lvstri uses daxa
I stop using phobos

wicked notch Nov 14, 2023, 2:11 PM

#

I think I'll start using other's people stuff after I make a render graph for my own stuff

glass sphinx Nov 14, 2023, 2:40 PM

#

a mostly complete rendergraph is massive pain and work

#

thats when i ~~enslaved saky~~ saky joined. Without our combined brains it would havebeen impossible

#

@delicate rain tell your tg pain

delicate rain Nov 14, 2023, 3:52 PM

#

It is pain

#

Took us like a month to just think of all the shit

wispy spear Nov 14, 2023, 6:46 PM

#

frank sail why not coarser granularity

something something sparse something? : )

wicked notch Nov 16, 2023, 2:06 PM

#

Currently thinking about full bindless, but I'm wondering if I have enough descriptor set bindings

#

I need:

binding for everything that is VK_DESCRIPTOR_TYPE_SAMPLER
binding for everything that is VK_DESCRIPTOR_TYPE_SAMPLED_IMAGE
binding for everything that is VK_DESCRIPTOR_TYPE_STORAGE_IMAGE

#

Possibly even one for VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER but let's not KEKW

frank sail Nov 16, 2023, 2:08 PM

#

you could use mutable descriptors and make everything in the same binding

#

but I think support for that is extremely low nervous

#

inshallah we will have d3d12isms

wicked notch Nov 16, 2023, 2:09 PM

#

do I actually need that

frank sail Nov 16, 2023, 2:10 PM

#

ok support is actually decent on newer desktop gpus

frank sail Nov 16, 2023, 2:10 PM

#

wicked notch do I actually need that

no, but it allows you to jam everything into one binding

#

mainly it's used for porting d3d12 apps to vulkan KEKW

wicked notch Nov 16, 2023, 2:11 PM

#

ye I could just do this

layout (set = 0, binding = 0) uniform sampler sampler_table[];
layout (set = 0, binding = 0) uniform samplerShadow sampler_table[];

layout (set = 1, binding = 0) uniform image1D image_table[];
layout (set = 1, binding = 0) uniform image2D image_table[];
layout (set = 1, binding = 0) uniform image3D image_table[];
layout (set = 1, binding = 0) uniform image1DArray image_table[];
layout (set = 1, binding = 0) uniform image2DArray image_table[];
...```

frank sail Nov 16, 2023, 2:11 PM

#

it also wastes memory because it essentially makes your descriptors a union

wicked notch Nov 16, 2023, 2:12 PM

#

it's ok I'll just allocate enough memory that nobody's ever going to use

#

128MiB of descriptor mem sounds reasonable enough

frank sail Nov 16, 2023, 2:12 PM

#

wicked notch ye I could just do this ```glsl layout (set = 0, binding = 0) uniform sampler sa...

oh cool, can image descriptors alias without any extension?

wicked notch Nov 16, 2023, 2:12 PM

#

I believe that's what daxa does

frank sail Nov 16, 2023, 2:12 PM

#

seems legit

wicked notch Nov 16, 2023, 2:13 PM

#

frank sail oh cool, can image descriptors alias without any extension?

you need to do formatless ext etc but ye

frank sail Nov 16, 2023, 2:13 PM

#

ye but everyone has it so eh

wicked notch Nov 16, 2023, 2:16 PM

#

I'm having a cursed idea

#

so you know how textures can be streamed in and out right

#

I'm thinking of shader_resource_table.remove_resource() and shader_resource_table.insert_resource();

#

Instead of having dummy descriptors, I just use a flat hash map that keeps track of all my textures and textures ids

#

so I can remap indices on the fly

frank sail Nov 16, 2023, 2:19 PM

#

I'm not sure I follow either approach lol

#

But I'm just a duck

wicked notch Nov 16, 2023, 2:19 PM

#

I'm not sure either approach is doable either

#

KEKW

#

I'm searching for the laziest, easiest way I can do this

#

leveraging the power of flat hash maps

frank sail Nov 16, 2023, 2:20 PM

#

you basically just need to allocate indices for descriptors, right

#

or are you trying to solve something else

wicked notch Nov 16, 2023, 2:21 PM

#

ye I just need to allocate indices for descriptors

#

and be able to remove them without shifting shit around

#

like a page allocator where 1 descriptor = 1 page

frank sail Nov 16, 2023, 2:21 PM

#

no need to overthink it bleakekw

#

yeah just keep a list of free indices or something

#

a "free list" of sorts

wicked notch Nov 16, 2023, 2:22 PM

#

nah slow

#

__builtin_clz baby

#

even shrimpler

#

and I can reuse the old crusty VSM page allocator I did on the CPU side

#

when I still believed in hw sparse 😭

frank sail Nov 16, 2023, 2:23 PM

#

lol

#

even I'm using the dumb bit array for my VSM so I can't complain

wicked notch Nov 16, 2023, 2:23 PM

#

it's a great approach

frank sail Nov 16, 2023, 2:25 PM

#

but allocations are O(n), rip perf

wicked notch Nov 16, 2023, 2:25 PM

#

worst case smart

frank sail Nov 16, 2023, 2:25 PM

#

when you have billions of descriptors that is

wicked notch Nov 16, 2023, 2:25 PM

#

yeah

#

if I have 16k descriptors then worst case it's O(256) KEKW

frank sail Nov 16, 2023, 2:26 PM

#

a performance disaster in vblancospeak

wicked notch Nov 16, 2023, 2:26 PM

#

so more like O(sqrt(n))

frank sail Nov 16, 2023, 2:28 PM

#

Wait what

#

Isn't it just n/32

wicked notch Nov 16, 2023, 2:30 PM

#

ye but O(n/64) is asymptotically equivalent to O(n)

frank sail Nov 16, 2023, 2:30 PM

#

Ye

wicked notch Nov 16, 2023, 2:30 PM

#

so I paint a better picture by saying O(sqrt(n))

#

even though it's a lie

frank sail Nov 16, 2023, 2:30 PM

#

The secret ingredient is lying

wicked notch Nov 16, 2023, 2:32 PM

#

I gotta cache ids btw

#

and give them a TTL

frank sail Nov 16, 2023, 2:33 PM

#

Ttl?

wicked notch Nov 16, 2023, 2:34 PM

#

time to live

#

so a descriptor slot remains allocated as long as it's TTL > 0

#

and it's freed when TTL = 0

frank sail Nov 16, 2023, 2:34 PM

#

Or be a man and overwrite it while in use

#

Btw I did not implement ttl for my gpu allocator in my voxel engine and chunks would sometimes artifact for a frame or two when you modified stuff bleakekw

#

The real issue was that I had occlusion culling data that was used next frame

#

But the end result was the same

wicked notch Nov 16, 2023, 2:38 PM

#

technically

#

I can update while in use

#

I'm just not sure how leinent drivers are

#

but UPDATE_AFTER_BIND actually allows it KEKW

frank sail Nov 16, 2023, 2:39 PM

#

I don't think it means modifying the descriptor that is in use though bleakekw

wicked notch Nov 16, 2023, 2:40 PM

#

ye that's API misuse

#

but you can just turn off the validation layers

frank sail Nov 16, 2023, 2:40 PM

#

One weird trick to not have any validation errors

wicked notch Nov 16, 2023, 2:44 PM

#

here's another issue, double buffered resources

#

hmm

#

actually

#

not an issue

#

I can just allocate 2 ids

#

void init() {
    view_buffer = shader_resource_table->allocate_shared_buffer_resource<frames_in_flight>(); // std::vector<buffer_id_t>(2);
}

void render() {
    thing = shader_resource_table->get_buffer_slice({ .id = view_buffer[current_frame] }); // returns buffer_slice_t, refreshes the cache
    thing.insert(...);
}```

#

oh god this is gonna take ages

#

I can basically just delete pipeline.cpp and remake it from scratch

#

amazing

glass sphinx Nov 16, 2023, 3:00 PM

#

😈

wicked notch Nov 16, 2023, 4:04 PM

#

lovely

delicate rain Nov 16, 2023, 4:31 PM

#

https://tenor.com/view/sacred-text-star-wars-luke-yoda-gif-16551138

Tenor

wispy spear Nov 16, 2023, 4:49 PM

#

wicked notch lovely

hmm hmm my stomach bubblin'

glass sphinx Nov 16, 2023, 5:25 PM

#

sexy

wicked notch Nov 16, 2023, 5:33 PM

#

with good ol macros it's a little less

#

now comes the hard part

#

the shader resource table™️

wispy spear Nov 16, 2023, 5:44 PM

#

hehe its pretty crazy how do you use all of them, must be some big ass if else block in main, no?

#

perhaps next evolution is generating the shaders to whatever you need it generate to

wicked notch Nov 16, 2023, 5:53 PM

#

I make a macro that generates more macros that access these

#

the usage I'm going for is this

#

void main() {
    vec3 payload = IRIS_STORAGE_IMAGE_2D_LOAD(uint32, image_id).xyz;
}```

wispy spear Nov 16, 2023, 6:02 PM

#

ah that do be looking daxaesque

#

or sweashopesque

wicked notch Nov 16, 2023, 6:04 PM

#

I make sure to give publicity to daxa KEKW

wispy spear Nov 16, 2023, 6:24 PM

#

ah 😄

wicked notch Nov 16, 2023, 8:15 PM

#

beautiful

wispy spear Nov 16, 2023, 8:16 PM

#

its readable, i like it

wicked notch Nov 16, 2023, 8:17 PM

#

I wouldn't say this part is especially readable but the rest is manageable at least KEKW

#define _IRIS_ACQUIRE_COMBINED_SAMPLER(dimension, type, image_id, sampler_id) sampler##dimension(_IRIS_ACQUIRE_SAMPLED_IMAGE(dimension, type, image_id), u_sampler_table[sampler_id])
#define _IRIS_ACQUIRE_COMBINED_SAMPLER_SHADOW(dimension, type, image_id, sampler_id) sampler##dimension##Shadow(_IRIS_ACQUIRE_SAMPLED_IMAGE(dimension, type, image_id), u_sampler_shadow_table[sampler_id])```

wispy spear Nov 16, 2023, 8:17 PM

#

hehe

#

sooner or later ill have to go through something like that as well

wicked notch Nov 16, 2023, 10:14 PM

#

can you return opaque types from functions in glsl

#

like

image2D id_to_descriptor(uint id) {
    return table[id];
}```

wicked notch Nov 16, 2023, 10:31 PM

#

I can't

#

but I have achieved epic syntax regardless

#

#define output_image iris_image_accessor(restrict_write, u_output_image_id)
#define texture_2d_sfloat iris_combined_sampler_2d(float32, u_texture_id, u_sampler_id)
#define texture_2d_sint iris_combined_sampler_2d(int32, u_texture_id, u_sampler_id)
#define texture_2d_uint iris_combined_sampler_2d(uint32, u_texture_id, u_sampler_id)
#define texture_2d_array_sfloat iris_combined_sampler_2d_array(float32, u_texture_id, u_sampler_id)
#define texture_2d_array_sint iris_combined_sampler_2d_array(int32, u_texture_id, u_sampler_id)
#define texture_2d_array_uint iris_combined_sampler_2d_array(uint32, u_texture_id, u_sampler_id)```

#

totally not inspired by daxa KEKW

#

#version 460
#include "bindings.glsl"

iris_declare_storage_image_descriptor_qualified(restrict_read, restrict readonly, image2D);
iris_declare_storage_image_descriptor_qualified(restrict_write, restrict writeonly, image2D);
#define input_image iris_image_accessor(restrict_read, u_input_image_id)
#define output_image iris_image_accessor(restrict_write, u_output_image_id)

layout (scalar, push_constant) restrict readonly uniform u_push_constant {
    uint u_input_image_id;
    uint u_output_image_id;
};

layout (local_size_x = 16, local_size_y = 16) in;
void main() {
    const ivec2 size = imageSize(input_image);
    if (any(greaterThanEqual(gl_GlobalInvocationID.xy, size))) {
        return;
    }
    const ivec2 position = ivec2(gl_GlobalInvocationID.xy);
    const vec4 payload = imageLoad(input_image, position);
    imageStore(output_image, position, vec4(linear_as_srgb(tonemap(payload.xyz)), 1.0));
}```

wispy spear Nov 16, 2023, 10:45 PM

#

wouldnt surprise me if daxa was inspired by sweatshop.pl 😉

wicked notch Nov 16, 2023, 10:54 PM

#

god fucking damnit this is so good

#

I can literally stop thinking about anything

#

just handle = srt->allocate(); and buffer = srt->acquire(handle);

#

it's crazy

glass sphinx Nov 16, 2023, 11:35 PM

#

it really is amazing

glass sphinx Nov 16, 2023, 11:37 PM

#

wispy spear wouldnt surprise me if daxa was inspired by sweatshop.pl 😉

i tried to see if devsh had any bindles util like that but he doesnt seem to have any

#

designing and deciding on the makros were pure pain in my head

#

endless changes

#

now im happy

wicked notch Nov 16, 2023, 11:38 PM

#

btw

#

what do you use for dummy descriptors

#

do you just create a 1x1 image or something

glass sphinx Nov 16, 2023, 11:38 PM

#

yes

wicked notch Nov 16, 2023, 11:38 PM

#

epic

glass sphinx Nov 16, 2023, 11:38 PM

#

but im considering using the robustness vulkan feature stuff

wicked notch Nov 16, 2023, 11:38 PM

#

it's quite sad we can't use null

glass sphinx Nov 16, 2023, 11:38 PM

#

you actually can

#

but you need some feature

#

it can apparently tank perf

#

so im scared of it

#

but dx12 has it default enabled for everything afaik

#

so cant be too bad

#

robustness also makes it legal to read and write out of bounds

#

it ignores writes and on reads you get 0

wicked notch Nov 16, 2023, 11:39 PM

#

device loss be gone

glass sphinx Nov 16, 2023, 11:39 PM

#

REAL

#

maybe i make it optional or something idk

#

but i think its very nice that dx12 saves you and forced gpu makers to implement hw acceleration for these checks

glass sphinx Nov 16, 2023, 11:41 PM

#

glass sphinx it can apparently tank perf

i also vaguely remember it was for mobile maybe

#

so desktop might not care

wicked notch Nov 16, 2023, 11:41 PM

#

mobile is not real so we're good

glass sphinx Nov 16, 2023, 11:41 PM

#

https://tenor.com/view/gif-gif-19491841

Tenor

wicked notch Nov 16, 2023, 11:41 PM

#

I mean if it works for D3D12 why not for Vk too

#

the hw is the same

glass sphinx Nov 16, 2023, 11:42 PM

#

yea

wicked notch Nov 16, 2023, 11:42 PM

#

I doubt the Vk/Dx drivers are much different either

glass sphinx Nov 16, 2023, 11:46 PM

#

random insane fact: ada lovlace has 128 bit atomic cas

wispy spear Nov 16, 2023, 11:52 PM

#

ah, i see

#

i really need to advance deeper into gpu drivenisms in order to understand all this

wicked notch Nov 22, 2023, 11:07 AM

#

Hmm

#

me wonder

#

couple the ShaRT with frames in flight or not

#

shit I have a devilish idea

#

one ShaRT per frame in flight

#

bleakekw

wispy spear Nov 22, 2023, 11:11 AM

#

a devshish idea?

wicked notch Nov 22, 2023, 11:11 AM

#

yes

#

KEKW

#

if one ponders the orb, one shall realize that two frames in flight might have completely different ShaRTs

wispy spear Nov 22, 2023, 11:15 AM

#

make a third frame

#

where you interpolate between them

wicked notch Nov 22, 2023, 11:15 AM

#

DLSS3 knockoff

wispy spear Nov 22, 2023, 11:15 AM

#

smart

#

oh

#

did i just reinvent it

#

i feel like i dropped into a barrel full of toxic waste and grew superpowers lol

wicked notch Nov 22, 2023, 11:16 AM

#

did you figure out descriptors in Fuk btw

#

I haven't heard much lately

wispy spear Nov 22, 2023, 11:17 AM

#

i will pick fuk up again over the wekekend

#

all this gfx nonsense has drained me : (

wicked notch Nov 22, 2023, 11:18 AM

#

take your time frogking

wicked notch Nov 22, 2023, 12:12 PM

#

hmm yes

#

a lovely 512KiB

frank sail Nov 22, 2023, 12:20 PM

#

thic

wicked notch Nov 22, 2023, 1:46 PM

#

it's 1MiB now KEKW

wicked notch Nov 22, 2023, 2:07 PM

#

hmmmmm

#

Since updating a descriptor set only comes with allocation/deallocation

#

should I make that RAII-style

wicked notch Nov 22, 2023, 11:00 PM

#

I'm lighting the daxa beam

glass sphinx Nov 22, 2023, 11:00 PM

#

https://tenor.com/view/laser-gif-20698522

Tenor

wicked notch Nov 22, 2023, 11:00 PM

#

mr potrick, how do you handle buffers/images in the gpu resource table that change within a frame in flight

#

so for example, say you have a camera buffer that you update every frame

glass sphinx Nov 22, 2023, 11:01 PM

#

well

#

either i just alloc from a per frame staging buffer (device local or host local depending on what its used for)

#

or i make an array inside the buffer of the cam info

#

and pass the index to it

#

write the appropriate part

#

the table doesnt know anything but creation and deletion

delicate rain Nov 22, 2023, 11:02 PM

#

I don't think there is any specific resource table handling to them

glass sphinx Nov 22, 2023, 11:02 PM

#

the indices are 100% tied to the resources

#

so when i change a resource between frames i pass different ids to the shaders

wicked notch Nov 22, 2023, 11:03 PM

#

what about differences in bindings between frames?

glass sphinx Nov 22, 2023, 11:03 PM

#

expand

wicked notch Nov 22, 2023, 11:04 PM

#

e.g:
frame 0: I want to allocate these two images, please update this frame's descriptor set with the two images
frame 1: I want to allocate three more images, please update this frame's descriptor set with the three images

#

afaik calling vkUpdateDescriptorSets is illegal while the set is in use

glass sphinx Nov 22, 2023, 11:04 PM

#

frog_thinkk

#

i dont understand

#

there is one descriptor set

#

you can update slots that arent used

#

its fine if the set is in use

#

the only restriction is that the specific slots within the descriptor array arent actually used

wicked notch Nov 22, 2023, 11:05 PM

#

it's fine as long as you don't touch slots that are in use?

glass sphinx Nov 22, 2023, 11:05 PM

#

yes

wicked notch Nov 22, 2023, 11:05 PM

#

pog

glass sphinx Nov 22, 2023, 11:05 PM

#

so stale slots are free to be written

#

aty any time

wicked notch Nov 22, 2023, 11:05 PM

#

so for the camera buffer, staging and then vkCmdCopy?

glass sphinx Nov 22, 2023, 11:06 PM

#

no just staging

#

well

delicate rain Nov 22, 2023, 11:06 PM

#

You can do both

#

Depends on what you want

glass sphinx Nov 22, 2023, 11:06 PM

#

well its actually bacially orthogonal to daxa @wicked notch

delicate rain Nov 22, 2023, 11:06 PM

#

No?

glass sphinx Nov 22, 2023, 11:07 PM

#

what I do is either:

have a single buffer that i copy to from staging once a frame
simply use device local host visible scratch memory that i get an offset into (linear alloc) then write and pass the bda to the shader, so no staging or copy. Just instant cpu write then use on gpu, ultra fast.

glass sphinx Nov 22, 2023, 11:08 PM

#

wicked notch so for the camera buffer, staging and then vkCmdCopy?

you can do that

#

i usually dont anymore

#

bar memory is sexier

wicked notch Nov 22, 2023, 11:08 PM

#

alright epic

#

many thanks to the daxa team

delicate rain Nov 22, 2023, 11:08 PM

#

glass sphinx what *I do* is either: 1. have a single buffer that i copy to from staging once ...

The nice thing about 2) is that daxa (task graph) handles the lifetime of the memory for you

distant lodge Nov 22, 2023, 11:09 PM

#

why only one set though? why not have 1 set per frame in flight

#

so you can update one while reading the others

delicate rain Nov 22, 2023, 11:10 PM

#

Why would you need that

#

You always allocate new buffers into unused slots

distant lodge Nov 22, 2023, 11:11 PM

#

what if next frame, I free 1 image, add its index to the freelist, and then try to write into that

wicked notch Nov 22, 2023, 11:12 PM

#

you defer the "add index to freelist" part

#

but hmm

delicate rain Nov 22, 2023, 11:12 PM

#

Wat

#

I don't follow

distant lodge Nov 22, 2023, 11:13 PM

#

imagine I have image A bound to a slot, next frame I remove this image but also register a new one, if I add A's old index to the freelist immediately then it will be picked up as the index for the new image

delicate rain Nov 22, 2023, 11:13 PM

#

If you free a buffer or an image it is only freed after GPU frame cnt catches up to the point on CPU frame cnt where it was freed

distant lodge Nov 22, 2023, 11:13 PM

#

and if I have a frame in flight, something might be reading that slot

glass sphinx Nov 22, 2023, 11:13 PM

#

distant lodge why only one set though? why not have 1 set per frame in flight

it uses way more memory to have multiple sets

#

there is also no benefit afaik

#

daxa also doesnt know what fif is

distant lodge Nov 22, 2023, 11:13 PM

#

that's interesting

glass sphinx Nov 22, 2023, 11:13 PM

#

it doesnt need to

#

fif are trivial with daxa

delicate rain Nov 22, 2023, 11:14 PM

#

delicate rain If you free a buffer or an image it is only freed after GPU frame cnt catches up...

So there is no way for you to use a slot that is currently in use on the GPU still

wicked notch Nov 22, 2023, 11:14 PM

#

Ye I'm trying to design this such that FIF are trivial too

delicate rain Nov 22, 2023, 11:14 PM

#

It is deferred

distant lodge Nov 22, 2023, 11:14 PM

#

so you don't have any core systems that rely on FIF like deletion queues

glass sphinx Nov 22, 2023, 11:14 PM

#

distant lodge what if next frame, I free 1 image, add its index to the freelist, and then try ...

that wont ever be a problem

#

its checked and deferred

glass sphinx Nov 22, 2023, 11:14 PM

#

distant lodge so you don't have any core systems that rely on FIF like deletion queues

i do

#

it doesnt rely on fif

delicate rain Nov 22, 2023, 11:14 PM

#

distant lodge so you don't have any core systems that rely on FIF like deletion queues

I don't think relying on fif is good anywhere

glass sphinx Nov 22, 2023, 11:14 PM

#

it checks a timeline semaphore

delicate rain Nov 22, 2023, 11:14 PM

#

It's too arbitrary

glass sphinx Nov 22, 2023, 11:15 PM

#

there is a cpu and gpu submit timeline

delicate rain Nov 22, 2023, 11:15 PM

#

I might want to use daxa for compute sim where there is no bound on fif

glass sphinx Nov 22, 2023, 11:15 PM

#

destructions are deferred until the gpu catches up to the cpu at the timepoint of destruction call

glass sphinx Nov 22, 2023, 11:15 PM

#

delicate rain I don't think relying on fif is good anywhere

its very easy to implement\

#

daxa still has a cleanup function that should be called once a frame to do housekeeping

#

but its not tied to fif or anything like that

delicate rain Nov 22, 2023, 11:16 PM

#

glass sphinx its very easy to implement\

It is arbitrary what daxa has is much better

distant lodge Nov 22, 2023, 11:16 PM

#

FIF is pretty convenient for anywhere you need to do n-buffered CPU-GPU sync

#

do you do timeline semaphores for all of that

wicked notch Nov 22, 2023, 11:16 PM

#

CPU-GPU timeline makes more sense though

glass sphinx Nov 22, 2023, 11:16 PM

#

wicked notch Ye I'm trying to design this such that FIF are trivial too

just be aware to not do the cherno bullshit of abstractions that have n copies inherently per resource per fif

delicate rain Nov 22, 2023, 11:17 PM

#

Yeah but that is users responsibility

#

Handling fif for readback and everything I mean

glass sphinx Nov 22, 2023, 11:18 PM

#

btw im working on a tg facelift atm that will make it easier to use, more powerful AND less loc

distant lodge Nov 22, 2023, 11:18 PM

#

yeah that's interesting though, I think if I had timeline based deletion logic I could pull FIF code out of my core vulkan context

glass sphinx Nov 22, 2023, 11:18 PM

#

https://tenor.com/view/vorpi-god-brain-when-brain-machine-done-genius-gif-17644903

Tenor

delicate rain Nov 22, 2023, 11:19 PM

#

distant lodge yeah that's interesting though, I think if I had timeline based deletion logic I...

Thinking about embedding fif into abstractions makes my brain hurt

#

Too complex for me

glass sphinx Nov 22, 2023, 11:19 PM

#

distant lodge yeah that's interesting though, I think if I had timeline based deletion logic I...

i got that from dolkar, they do that too. Very awsome. Its also verrry simple to implement. Just keep in mind that it wont defer non-submitted commands

distant lodge Nov 22, 2023, 11:19 PM

#

my context just owns the counter

#

and I mainly use it to scale buffer capacity for stuff like uploading instances

glass sphinx Nov 22, 2023, 11:19 PM

#

delicate rain Too complex for me

easy, daxa user doesnt even know what it is really cause it just works tm 😈

#

gabe forgot most things about descriptors

delicate rain Nov 22, 2023, 11:20 PM

#

I did too

glass sphinx Nov 22, 2023, 11:20 PM

#

good

#Iris - A Journey through OpenGL and beyond to learn Graphics

Example for Buffer: