Fwog and co. | Graphics Programming | Page 16

oak garden May 23, 2023, 6:43 PM

#

Hmm yeah I see

#

So the compiled backend should have them just fine I guess, even when calling from rust

long robin May 23, 2023, 6:49 PM

#

yeah

#

the shader tool bakes the shader permutations and reflection data into a bunch of headers

#

oh btw my epic glUniform calls don't work on AMD bleakekw

#

glGetUniformLocation returns -1 for those

oak garden May 23, 2023, 6:58 PM

#

dread_cat

long robin May 23, 2023, 6:58 PM

#

time to make vendor-specific paths bleakekw

oak garden May 23, 2023, 7:02 PM

#

wouldnt the workaround also work on nvidia

long robin May 23, 2023, 7:02 PM

#

the workaround I was thinking of would be to simply use the high bindings on AMD, which doesn't work on NV

oak garden May 23, 2023, 7:03 PM

#

ahh right because of the binding limit

#

vendor specific boogaloo it is

long robin May 23, 2023, 7:03 PM

#

orrrrrrr I can figure something else out

dapper gorge May 23, 2023, 7:03 PM

#

I love how Intel is completely out of the equation bleakekw

#

I have a laptop with a 12th gen intel chip because the ryzen models were all sold out 😭

#

I guess I'll just be running my app on my main machine

oak garden May 23, 2023, 7:04 PM

#

dapper gorge I love how Intel is completely out of the equation <:bleakekw:108259835030353924...

not a real vendor bleakekw

long robin May 23, 2023, 7:04 PM

#

GL_MAX_IMAGE_UNITS on arc is 8, which means I have to do whatever I do for NV and pray that it works

#

https://tenor.com/view/oh-blow-fish-choked-vomit-gif-16437355

Tenor

#

I'd rather not have to make a hack though

#

I really just want glslang to automap bindings but not be dumb by starting at a uselessly high index

long robin May 23, 2023, 7:26 PM

#

I finally figured out how to manipulate glslang into doing what I want

#

I just needed these flags
--amb --stb comp 8 --ssb comp 8 --suavb comp 0 --sib comp 0

oak garden May 23, 2023, 7:27 PM

#

nervous

#

was

long robin May 23, 2023, 7:28 PM

#

the idea is to force textures and samplers to start at index 8, then auto-mapped bindings for images will start at index 0

oak garden May 23, 2023, 7:28 PM

#

right i see

dapper gorge May 23, 2023, 7:28 PM

#

Very nice

long robin May 23, 2023, 7:28 PM

#

guess it means I can remove le hacks now

dapper gorge May 23, 2023, 7:34 PM

#

Let us know if it works fine

#

if it does execute le push

#

So that I can execute le pull

long robin May 23, 2023, 7:34 PM

#

the current thing works on nvidia atm

#

so you can le pull whenever

#

you have an nv gpu, right?

dapper gorge May 23, 2023, 7:35 PM

#

Yes

long robin May 23, 2023, 7:35 PM

#

then it should Just Work™️ as the uploaded version has the glUniform hack

dapper gorge May 23, 2023, 7:35 PM

#

Inshallah pull I must

#

"The current GL state uses a sampler (6) that has depth comparisons disabled, with a texture object (56) with a depth format, by a shader that samples it with a shadow sampler. This will result in undefined behavior." nervous

long robin May 23, 2023, 7:49 PM

#

ignore it

oak garden May 23, 2023, 7:49 PM

#

but jaker

#

this WILL result in undefined behavior!

long robin May 23, 2023, 7:50 PM

#

it's annoying because these things are reported before the actual draw

dapper gorge May 23, 2023, 7:50 PM

#

Sir

#

My shadows are gone

#

frog_gone

long robin May 23, 2023, 7:50 PM

#

nervous

dapper gorge May 23, 2023, 7:50 PM

#

Well not gone, just very faint

#

(ignore the motion vectors)

long robin May 23, 2023, 7:51 PM

#

uh

#

these are the samplers that the gl backend creates

#

the only shadow sampler that I made is in the gltf viewer sample in fwog, which you are apparently not building

dapper gorge May 23, 2023, 7:53 PM

#

Something's amiss

#

How do I unbind a sampler

long robin May 23, 2023, 7:57 PM

#

glBindSampler(theUnitYouWantToUnbind, 0);

dapper gorge May 23, 2023, 7:57 PM

#

Which units do you bind

long robin May 23, 2023, 7:57 PM

#

several

#

dapper gorge May 23, 2023, 7:58 PM

#

Interesting

long robin May 23, 2023, 7:58 PM

#

all my homies love using sampler objects

dapper gorge May 23, 2023, 7:59 PM

#

I will now do something that for sure does NOT involve a for (i = 0 to GL_MAX_COMBINED_TEXTURE_IMAGE_UNITS)

oak garden May 23, 2023, 8:00 PM

#

bleakekw

dapper gorge May 23, 2023, 8:00 PM

#

Would you look at that, it worked NOT doing that thing I very explicitly said I wasn't going to do!

oak garden May 23, 2023, 8:00 PM

#

incredible

long robin May 23, 2023, 8:01 PM

#

now would be a good time to start using shrimpler objects yourself

dapper gorge May 23, 2023, 8:01 PM

#

I suppose so bleakekw

oak garden May 23, 2023, 8:01 PM

#

so afaik fsr2 is an upscaling thing but would you also do aa with it by downscaling again or something like that?

dapper gorge May 23, 2023, 8:01 PM

#

I think render resolution and present resolution being equal should be TAA

oak garden May 23, 2023, 8:01 PM

#

aha

#

so it does more than just upscale

dapper gorge May 23, 2023, 8:01 PM

#

Or well, the best TAA AMD could come up with

long robin May 23, 2023, 8:01 PM

#

it's also TAA even when it's upscaling

oak garden May 23, 2023, 8:02 PM

#

very cool

#

yeh ill definitely replace my crappy MSAA with it once i get the bindings sorted out

long robin May 23, 2023, 8:02 PM

#

but you could get even more AA by combining FSR 2 with supersampling, I suppose

#

probably not huge gains I'd imagine

long robin May 23, 2023, 8:03 PM

#

oak garden yeh ill definitely replace my crappy MSAA with it once i get the bindings sorted...

ye MSAA and FSR 2 don't mix anyways

oak garden May 23, 2023, 8:04 PM

#

frogapprove

dapper gorge May 23, 2023, 8:11 PM

#

Holy shit

#

Now this is some good TAA

#

Compared to that garbage I did this is incredible

#

Big blurry mess when moving though (my fault) KEKW

#

How does FSR2 want its motion vectors again?

long robin May 23, 2023, 8:15 PM

#

https://github.com/GPUOpen-Effects/FidelityFX-FSR2#providing-motion-vectors

#

I rendered mine as rg16f, then passed {renderWidth, renderHeight} to the motion vector scale param

#

||it still artifacts, but not too badly I guess bleakekw ||

oak garden May 23, 2023, 8:18 PM

#

bleakekw

long robin May 23, 2023, 8:24 PM

#

no motion vs me shaking the camera violently

oak garden May 23, 2023, 8:24 PM

#

honestly not that bad

long robin May 23, 2023, 8:25 PM

#

yeah it's better than I expected

#

maybe I did integrate it properly

#

well, I'm still missing proper motion vectors on the skybox

#

but idk, it still seems more aliased under motion compared to the vulkan sample

oak garden May 23, 2023, 8:26 PM

#

opengl issue

long robin May 23, 2023, 8:27 PM

#

aliasing

#

not sure if that is a case that fsr2 can handle

dapper gorge May 23, 2023, 8:30 PM

#

It's a bit too sharp maybe?

long robin May 23, 2023, 8:30 PM

#

looks properly AA'd

#

do you have the RCAS flag enabled?

dapper gorge May 23, 2023, 8:31 PM

#

Ah, yeah I did 😛

#

It do be kinda iffy when shaking your mouse violently

#

But under normal human being conditions it's great

oak garden May 23, 2023, 8:33 PM

#

i mean this is zoomed in

dapper gorge May 23, 2023, 8:33 PM

#

Yeah we're pixel picking, it's great, thank you Jaker

#

I will make sure to pay you in pretty pictures

#

Like this frog, for example

oak garden May 23, 2023, 8:34 PM

#

froge

long robin May 23, 2023, 8:38 PM

#

Might be worth taking a look at the fsr2 cauldron samples btw

oak garden May 23, 2023, 8:38 PM

#

did you delete the media submodule in your 🍴

long robin May 23, 2023, 8:38 PM

#

It seemed like the image was cleaner than my own

shell inlet May 23, 2023, 8:38 PM

#

I don't remember fsr2 being this bad when shaking camera

shell inlet May 23, 2023, 8:38 PM

#

long robin aliasing

this

long robin May 23, 2023, 8:38 PM

#

oak garden did you delete the media submodule in your 🍴

I didn't explicitly delete it

oak garden May 23, 2023, 8:39 PM

#

i fought git submodules for a while to get rid of it because downloading a couple gb of assets isnt exactly great

long robin May 23, 2023, 8:39 PM

#

I was able to run the vk sample in my fork

#

I manually invoked cmake though

oak garden May 23, 2023, 8:39 PM

#

hmm yeah

#

maybe its unnecessary

long robin May 23, 2023, 8:40 PM

#

shell inlet I don't remember fsr2 being this bad when shaking camera

I'm guessing my motion vectors are still slightly wack

shell inlet May 23, 2023, 8:41 PM

#

how calculate?

dapper gorge May 23, 2023, 8:42 PM

#

ndc_prev - ndc_curr

long robin May 23, 2023, 8:42 PM

#

Bottom of these files
https://github.com/JuanDiegoMontoya/Fwog/blob/fsr2-test/example/shaders/SceneDeferredPbr.vert.glsl
https://github.com/JuanDiegoMontoya/Fwog/blob/fsr2-test/example/shaders/SceneDeferredPbr.frag.glsl

#

Variables are poorly named

#

I tried calculating motion vectors by doing the world->ndc transform in the fs, then subtracting, but somehow that made my motion vectors huge

shell inlet May 23, 2023, 8:46 PM

#

format for mv image?

long robin May 23, 2023, 8:46 PM

#

rg16f

shell inlet May 23, 2023, 8:47 PM

#

both seem fine

shell inlet May 23, 2023, 8:48 PM

#

long robin I tried calculating motion vectors by doing the world->ndc transform in the fs, ...

?

#

passed fragment world to fs?

long robin May 23, 2023, 8:49 PM

#

yeah

shell inlet May 23, 2023, 8:49 PM

#

weird

#

should be same

long robin May 23, 2023, 8:49 PM

#

I feel like I made a mistake, but it was like 2 lines of code so idk lmao

shell inlet May 23, 2023, 8:58 PM

#

did you set proper motion vector scale?

#

https://github.com/GPUOpen-Effects/FidelityFX-FSR2/blob/149cf26e1229eaf5fecfb4428e71666cf4aee374/src/ffx-fsr2-api/ffx_fsr2.h#L129

#

If your application computes motion vectors in another space - for example normalized device coordinate space - then you may use the motionVectorScale field of the FfxFsr2DispatchDescription structure to instruct FSR2 to adjust them to match the expected range for FSR2. The code examples below illustrate how motion vectors may be scaled to screen space. The example HLSL and C++ code below illustrates how NDC-space motion vectors can be scaled using the FSR2 host API.

dispatchParameters.motionVectorScale.x = (float)renderWidth;
dispatchParameters.motionVectorScale.y = (float)renderHeight;

#

https://github.com/GPUOpen-Effects/FidelityFX-FSR2#providing-motion-vectors:~:text=infinite far plane.-,Providing motion vectors,-Space

long robin May 23, 2023, 9:00 PM

#

yeah
https://github.com/JuanDiegoMontoya/Fwog/blob/fsr2-test/example/03_gltf_viewer.cpp#L667

#

also lol that formatting

#

I also tried different permutations of negating the jitter Y, but the current config seems to be the most stable

dapper gorge May 23, 2023, 9:07 PM

#

Damn, FSR2 is taking 2ms nervous

long robin May 23, 2023, 9:08 PM

#

what gpu

dapper gorge May 23, 2023, 9:09 PM

#

3070 currently

long robin May 23, 2023, 9:10 PM

#

what resolution?

dapper gorge May 23, 2023, 9:10 PM

#

A little under 1080p

long robin May 23, 2023, 9:10 PM

#

oh and what upscaling factor

dapper gorge May 23, 2023, 9:10 PM

#

None, render and present resolutions are equal

long robin May 23, 2023, 9:11 PM

#

ah

#

fsr2 isn't optimized for pure TAA, but your perf shouldn't be that bad still

#

perf should be closer to 1ms according to this chart (looking at the 6800 column)
https://github.com/JuanDiegoMontoya/FidelityFX-FSR2#performance

dapper gorge May 23, 2023, 9:13 PM

#

Here's the breakdown btw

long robin May 23, 2023, 9:13 PM

#

I'm doing a lot of suboptimal things in the backend

excess memory barriers
excess gl calls (glGetUniformLocation and glUniform1i)
no fp16
no subgroups

#

also, the persistently mapped buffers may not be offering the best performance

#

I think no subgroups hurts perf a lot in the SPD pass

#

not sure about no fp16 though

dapper gorge May 23, 2023, 9:16 PM

#

Completely unrelated, but do you know why my second and third cascade take so much longer to render? nervous

long robin May 23, 2023, 9:17 PM

#

yeah, they're rendering more stuff

dapper gorge May 23, 2023, 9:17 PM

#

The second, third and fourth cascade have the same amounts of (indirect) drawcalls

long robin May 23, 2023, 9:17 PM

#

hmm

#

are you doing any culling for them

dapper gorge May 23, 2023, 9:18 PM

#

Only frustum

long robin May 23, 2023, 9:18 PM

#

the later cascades should have larger frustums, no?

dapper gorge May 23, 2023, 9:19 PM

#

Yeah but I'm using a very low lambda so 2nd 3rd and 4th enclose the whole bistro

shell inlet May 23, 2023, 9:20 PM

#

@long robin check this out https://github.com/GPUOpen-Effects/FidelityFX-FSR2/issues/22

long robin May 23, 2023, 9:20 PM

#

dapper gorge Yeah but I'm using a very low lambda so 2nd 3rd and 4th enclose the whole bistro

it's not a surprise to me that it would take longer than the first cascade then

dapper gorge May 23, 2023, 9:20 PM

#

Of course, but the problem is the second cascade

#

it takes 0.66ms while the third takes 1.5ms

long robin May 23, 2023, 9:21 PM

#

hmm

#

so you're saying second and third are the same size, but third takes way longer?

dapper gorge May 23, 2023, 9:21 PM

#

Yes

#

You can see compute warps skyrocket in the third cascade too

#

But there's no in here dispatch...

long robin May 23, 2023, 9:22 PM

#

and unit throughput goes wack on the third cascade

dapper gorge May 23, 2023, 9:22 PM

#

Ah well I'll figure it out, I'll stop leaking my thread into this lol

shell inlet May 23, 2023, 9:27 PM

#

trying to find fsr2 code that does sampling with motion vectors

#

why is it so hard

#

it had to be the most convoluted code ever

#

so they say

For example, a motion vector for a pixel in the upper-left corner of the screen with a value of <width, height> would represent a motion that traversed the full width and height of the input surfaces, originating from the bottom-right corner.

#

doesn't that mean that the scale is actually width/2 and height/2? If ndc is [-1; 1], such that from say (-1, -1), you travel to (1, 1) the distance is 2 on each axis, then you want to scale such that
(1 - (-1))x = width
2x = width
x = width/2

#

@long robin plz respond

#

don't tell me ur already asleep

golden schooner May 23, 2023, 9:56 PM

#

https://tenor.com/view/wake-up-dog-doggy-hooman-get-up-gif-20608008

Tenor

shell inlet May 23, 2023, 9:59 PM

#

https://tenor.com/view/i-am-waiting-gifs-reactions-trending-dog-gif-19523809

Tenor

long robin May 23, 2023, 10:05 PM

#

I was afk

long robin May 23, 2023, 10:05 PM

#

shell inlet doesn't that mean that the scale is actually width/2 and height/2? If ndc is [-1...

Yeah that's what it would seem like

#

I'll test when I'm no longer on the john ||producing my best code||

oak garden May 23, 2023, 10:12 PM

#

shell inlet it had to be the most convoluted code ever

its extremely well documented heee

long robin May 23, 2023, 10:17 PM

#

the empty line after all the enum names annoys me 😩

oak garden May 23, 2023, 10:19 PM

#

me too lol

long robin May 23, 2023, 10:20 PM

#

however, it is consistent 😃

oak garden May 23, 2023, 10:21 PM

#

If only all of it were consistent

#

using size_t for one count and u32 for the next

long robin May 23, 2023, 10:21 PM

#

well, you see, um

oak garden May 23, 2023, 10:22 PM

#

bleakekw

golden schooner May 23, 2023, 10:22 PM

#

at least the * is left at the type

oak garden May 23, 2023, 10:22 PM

#

As it should

golden schooner May 23, 2023, 10:22 PM

#

i have a huge respect for you jaker

#

for dinglefarting all that together within a few days 🙂

oak garden May 23, 2023, 10:23 PM

#

Yeah thats honestly quite impressive

golden schooner May 23, 2023, 10:23 PM

#

working as intended or not

long robin May 23, 2023, 10:24 PM

#

shell inlet May 23, 2023, 10:26 PM

#

long robin I'll test when I'm no longer on the john ||producing my best code||

john amd?

long robin May 23, 2023, 10:33 PM

#

the other john

dapper gorge May 23, 2023, 10:34 PM

#

Competitor John

long robin May 23, 2023, 10:49 PM

#

hmm it seems like simply dividing the motion vector scale by 2 fixed things

oak garden May 23, 2023, 10:49 PM

#

that seems so random

long robin May 23, 2023, 10:49 PM

#

oak garden May 23, 2023, 10:50 PM

#

that looks quite nice

long robin May 23, 2023, 10:50 PM

#

there is still a little flickering under no motion, but I think that's caused by the high frequency detail in the scene

#

normally there would be indirect lighting to reduce the contrast

oak garden May 23, 2023, 10:51 PM

#

do you happen to know if ffxFsr2ContextDispatch is thread safe?

long robin May 23, 2023, 10:51 PM

#

wdym

oak garden May 23, 2023, 10:51 PM

#

or just access to the context in general

#

tbh not that i would ever call it from different threads

#

just curious if i should wrap it for safety

long robin May 23, 2023, 10:52 PM

#

you can call it from multiple threads if you are using an explicit API

#

er

oak garden May 23, 2023, 10:52 PM

#

right for opengl it wouldnt be very safe ofc

long robin May 23, 2023, 10:52 PM

#

you cannot call it from multiple threads simultaneously

oak garden May 23, 2023, 10:52 PM

#

ah

#

okay so no internal sync going on

long robin May 23, 2023, 10:52 PM

#

not sure why you'd want to do that anyways

oak garden May 23, 2023, 10:52 PM

#

yeah true

#

ill just put a little disclaimer

#

the bindings are done i think so i can start integrating

long robin May 23, 2023, 10:53 PM

#

you can call it from a different thread than the one that initialized it

oak garden May 23, 2023, 10:53 PM

#

thats good enough

long robin May 23, 2023, 10:53 PM

#

but not multiple threads at once

oak garden May 23, 2023, 10:53 PM

#

frogapprove

long robin May 23, 2023, 10:53 PM

#

the context and backend each store a bunch of state that gets modified when you dispatch

#

and the state is modified in a non-atomic way, so overlapping dispatches would fook everything up

oak garden May 23, 2023, 10:53 PM

#

yeah makes sense

#

i havent really looked at the impl

#

just mindlessly copied headers to rust kekw

long robin May 23, 2023, 10:54 PM

#

take a look at these files to see what kind of state there is
https://github.com/JuanDiegoMontoya/FidelityFX-FSR2/blob/master/src/ffx-fsr2-api/gl/ffx_fsr2_gl.cpp
https://github.com/JuanDiegoMontoya/FidelityFX-FSR2/blob/master/src/ffx-fsr2-api/ffx_fsr2.cpp

#

here's a vid of me adding indirect lighting to remove the flickering and own the libs

#

lol the flickering is still there

oak garden May 23, 2023, 10:56 PM

#

kekw

#

it is

#

why is that

#

it shouldnt be right pepe_think

long robin May 23, 2023, 10:58 PM

#

prolly not

dapper gorge May 23, 2023, 11:01 PM

#

I don't observe flickering btw

long robin May 23, 2023, 11:02 PM

#

what do you do for motion vectors and scale

dapper gorge May 23, 2023, 11:04 PM

#

vec2 calculate_velocity() {
    vec4 clip_pos = i_clip_pos;
    vec4 prev_clip_pos = i_prev_clip_pos;

    clip_pos /= clip_pos.w;
    prev_clip_pos /= prev_clip_pos.w;

    return prev_clip_pos.xy - clip_pos.xy;
}```

#

In FS

#

scale is just render resolution

long robin May 23, 2023, 11:04 PM

#

huh

#

what about jitter

#

jitter is probably where mine is messed up tbh

dapper gorge May 23, 2023, 11:05 PM

#

I just use ffxFsr2GetJitterPhaseCount and ffxFsr2GetJitterOffset

#

In VS I construct a shrimple mat4 with the jitter offsets

long robin May 23, 2023, 11:06 PM

#

I do this

  float jitterX{};
  float jitterY{};
  ffxFsr2GetJitterOffset(&jitterX, &jitterY, frameIndex, ffxFsr2GetJitterPhaseCount(renderWidth, windowWidth));
  const float jitterOffsetX = 2.0f * jitterX / (float)renderWidth;
  const float jitterOffsetY = 2.0f * jitterY / (float)renderHeight;
  const auto jitterMatrix = glm::translate(glm::mat4(1), glm::vec3(jitterOffsetX, jitterOffsetY, 0));
  const auto projUnjittered = glm::perspectiveZO(cameraFovY, renderWidth / (float)renderHeight, cameraNear, cameraFar);
  const auto projJittered = jitterMatrix * projUnjittered;
...
  dispatchDesc.jitterOffset = {jitterX, jitterY},

dapper gorge May 23, 2023, 11:11 PM

#

Ah I also do a mul by clip_pos.w

#

I actually forgor to do it the FSR2 way bleakekw

long robin May 23, 2023, 11:12 PM

#

I think my motion vectors are okay actually

dapper gorge May 23, 2023, 11:12 PM

#

gl_Position = clip_pos + vec4(u_jitter * clip_pos.w, 0.0, 0.0);```

#

Here's what I actually do

long robin May 23, 2023, 11:13 PM

#

spooky

dapper gorge May 23, 2023, 11:13 PM

#

Indeed

long robin May 23, 2023, 11:13 PM

#

I just pass this frame's and last frame's unjittered viewproj

dapper gorge May 23, 2023, 11:13 PM

#

Yeah, same

long robin May 23, 2023, 11:13 PM

#

what's the deal with u_jitter in your math then

dapper gorge May 23, 2023, 11:14 PM

#

Right now I don't jitter the projection at all

#

u_jitter is a vec2 of the offsets fsr2 gives me

long robin May 23, 2023, 11:14 PM

#

oh, this is for gl_Position

#

what do you pass for the jitter offset in the dispatch description?

#

the output of ffxFsr2GetJitterOffset?

dapper gorge May 23, 2023, 11:15 PM

#

Output of that yes

long robin May 23, 2023, 11:15 PM

#

what about for u_jitter?

dapper gorge May 23, 2023, 11:16 PM

#

jitter_offset * 2.0f / glm::vec2(window.width, window.height)

#

Where jitter_offset is the output of ffxFsr2GetJitterOffset

long robin May 23, 2023, 11:17 PM

#

oh you're using window size and I'm using render size

dapper gorge May 23, 2023, 11:17 PM

#

Ah well remember my render size and present size are one and the same

long robin May 23, 2023, 11:17 PM

#

ah

dapper gorge May 23, 2023, 11:17 PM

#

I think it should be render size though?

long robin May 23, 2023, 11:17 PM

#

yeah the guide says so

const float jitterX = 2.0f * jitterX / (float)renderWidth;
const float jitterY = -2.0f * jitterY / (float)renderHeight;

#

except we don't negate the Y because opengl

dapper gorge May 23, 2023, 11:18 PM

#

Best API

shell inlet May 23, 2023, 11:18 PM

#

oak garden that seems so random

no it's because I did the maths

#

https://tenor.com/view/battle-droid-compute-think-star-wars-gif-17180363

Tenor

dapper gorge May 23, 2023, 11:18 PM

#

Turns out math works (huge discovery)

long robin May 23, 2023, 11:19 PM

#

I'm gonna try rendering with 1x upscaling

shell inlet May 23, 2023, 11:19 PM

#

also someone should make a PR or poke someone at AMD to update the motion vector section to include correct info

long robin May 23, 2023, 11:20 PM

#

yeah, someone

shell inlet May 23, 2023, 11:20 PM

#

like I dunno

#

someone who's working at amd

#

maybe his name starts with J

long robin May 23, 2023, 11:20 PM

#

joker

shell inlet May 23, 2023, 11:21 PM

#

joker works at amd? Well that's why hes so mentally unstable

final cove May 23, 2023, 11:21 PM

#

idk I wouldn't ask the first guy, maybe the 2nd

dapper gorge May 23, 2023, 11:21 PM

#

That's true, what happened to Jaker 1?

shell inlet May 23, 2023, 11:22 PM

#

by the way flickering is normal

#

you can see it in every game with fsr2 on any fence

long robin May 23, 2023, 11:23 PM

#

yeah, this is 1x upscale (TAA basically)

dapper gorge May 23, 2023, 11:24 PM

#

I can only see 4 blocks

shell inlet May 23, 2023, 11:24 PM

#

https://tenor.com/view/ken-jeong-community-too-small-to-read-read-reading-gif-5494204

Tenor

Too small to read - Small

▶ Play video

dapper gorge May 23, 2023, 11:24 PM

#

H264 still the king btw

final cove May 23, 2023, 11:24 PM

#

seems pretty poopy if its supposed to flicker ngl

final cove May 23, 2023, 11:24 PM

#

shell inlet you can see it in every game with fsr2 on any fence

production games use fsr?

shell inlet May 23, 2023, 11:25 PM

#

yes some spoodermoon

long robin May 23, 2023, 11:25 PM

#

tons of games use fsr2 listenyoupieceofshit

shell inlet May 23, 2023, 11:25 PM

#

ok I was a bout to list some but ok a lot of games use fsr2

long robin May 23, 2023, 11:26 PM

#

I was indirectly responding to DR

#

not u

final cove May 23, 2023, 11:26 PM

#

I basically don't play any modern games so I legit wouldn't know

dapper gorge May 23, 2023, 11:26 PM

#

Tbh

#

Modern games are so modern you can't play them on modern hardware KEKW

final cove May 23, 2023, 11:27 PM

#

the crysis crisis

long robin May 23, 2023, 11:27 PM

#

bababooey

dapper gorge May 23, 2023, 11:27 PM

#

Spooky

long robin May 23, 2023, 11:27 PM

#

this is the official shrimple

dapper gorge May 23, 2023, 11:27 PM

#

How the hell do I not see any flickering?

long robin May 23, 2023, 11:27 PM

#

you aren't looking hard enough KEKW

final cove May 23, 2023, 11:27 PM

#

what's going on with the yellow balls inside

long robin May 23, 2023, 11:28 PM

#

also it's harder for you to observe because you aren't doing any upscaling

dapper gorge May 23, 2023, 11:28 PM

#

oak garden May 23, 2023, 11:28 PM

#

lvstri built different

long robin May 23, 2023, 11:28 PM

#

you need a more difficult area to test

#

this is 2x upscaling

long robin May 23, 2023, 11:29 PM

#

final cove what's going on with the yellow balls inside

final cove May 23, 2023, 11:29 PM

#

oh

#

is that part of the scene?

long robin May 23, 2023, 11:30 PM

#

yeah, it's to show how it interacts with particlezzzz

oak garden May 23, 2023, 11:30 PM

#

do you need to recreate the fsr2 context if the display size changes?

long robin May 23, 2023, 11:30 PM

#

yeah

dapper gorge May 23, 2023, 11:30 PM

#

Yes

oak garden May 23, 2023, 11:30 PM

#

hm alr

#

should be doable but somewhat annoying

#

ill ignore it for now

final cove May 23, 2023, 11:30 PM

#

another thing to crash and burn when your swapchain resizes

#

excellent

long robin May 23, 2023, 11:30 PM

#

you should already be creating all window-size-related resources anyways

oak garden May 23, 2023, 11:31 PM

#

Yeah I do that somewhat implicitly

dapper gorge May 23, 2023, 11:31 PM

#

destroy_the_world()

oak garden May 23, 2023, 11:31 PM

#

eh ill think about it later

long robin May 23, 2023, 11:31 PM

#

it's very easy in opengl

oak garden May 23, 2023, 11:31 PM

#

opengl

#

ofc it is

dapper gorge May 23, 2023, 11:31 PM

#

I literally just do a reassignment KEKW

oak garden May 23, 2023, 11:31 PM

#

kekw

dapper gorge May 23, 2023, 11:31 PM

#

Checkmate Vulkaners

long robin May 23, 2023, 11:31 PM

#

https://github.com/JuanDiegoMontoya/Fwog/blob/fsr2-test/example/03_gltf_viewer.cpp#L371

oak garden May 23, 2023, 11:32 PM

#

FfxFsr2Message fpMessage can this be null btw

long robin May 23, 2023, 11:32 PM

#

yeah

#

if it's null, then don't add FFX_FSR2_ENABLE_DEBUG_CHECKING

dapper gorge May 23, 2023, 11:33 PM

#

Ah btw did you remove the asserts for commandList and device

#

.commandList = (void*)0x1,``` Might look like I was on crack while coding

long robin May 23, 2023, 11:34 PM

#

I have not touched those yet

oak garden May 23, 2023, 11:34 PM

#

seems useful to have so ill implement it anyway

long robin May 23, 2023, 11:41 PM

#

I forgor about the giant magnifier that comes with the samples

dapper gorge May 23, 2023, 11:41 PM

#

https://tenor.com/view/toybox-witch-smile-vaudeville-magnifying-glass-gif-11946669

Tenor

long robin May 23, 2023, 11:48 PM

#

don't turn the sharpening > 1

shell inlet May 23, 2023, 11:49 PM

#

this is not sharpening, this is coarse sanding at this point

long robin May 24, 2023, 12:21 AM

#

how important is it to actually use the samplers from a gltf

#

because atm mine don't have a lod bias since I load the scene before creating the fsr2 context, so all the textures are blurry

daring surge May 24, 2023, 12:30 AM

#

hat

#

what

#

that's a weird question

long robin May 24, 2023, 12:31 AM

#

I'm wondering if there are gltf files that actually use a "non standard" sampler, if that makes sense

#

because I would like to just use one hardcoded sampler of my choice

daring surge May 24, 2023, 12:31 AM

#

in prop engines, it's dictated by the integration with external editors and how much the artists care (which usually comes down to style)

#

general purpose engines obviously have to care

#

and bespoke engines i would imagine almost always hardcode samplers or expose it in-editor

#

if you want to be able to arbitrarily take a gltf file off sketchfab

#

there is going to be stuff that doesn't work due to samplers, i'd imagine mostly due to the address difference though, not the sampling

long robin May 24, 2023, 12:33 AM

#

yeah

#

so things that rely on repeating uvs or something

daring surge May 24, 2023, 12:33 AM

#

is there an LOD setting with samplers?

long robin May 24, 2023, 12:33 AM

#

I imagine some pixel-style models want point filtering too

long robin May 24, 2023, 12:33 AM

#

daring surge is there an LOD setting with samplers?

yeah, you can set a lod bias

daring surge May 24, 2023, 12:33 AM

#

i see

long robin May 24, 2023, 12:34 AM

#

seems not exposed in gltf though

#

same with anisotropy

daring surge May 24, 2023, 12:34 AM

#

having 100% coverage of gltf is a relatively dumb goal imo though

#

unless your name is godot, UE, or unity

long robin May 24, 2023, 12:34 AM

#

just normie stuff like address and filter mode seem exposed by gltf

long robin May 24, 2023, 12:35 AM

#

daring surge having 100% coverage of gltf is a relatively dumb goal imo though

ye, I just want to support a "reasonable" number of models for testing

daring surge May 24, 2023, 12:35 AM

#

artists are really good at finding weird ways to use things that exist, and working around things that don't exist

long robin May 24, 2023, 12:35 AM

#

I have an idea w.r.t. samplers anyways

#

I can just construct them on the fly

#

rather than baking them into the loaded model

daring surge May 24, 2023, 12:37 AM

#

hehe this is definitely what UVs were intended for right

shell inlet May 24, 2023, 12:38 AM

#

daring surge artists are really good at finding weird ways to use things that exist, and work...

yeah like generate normal map from albedo, fiddle with the generated normal map such that it doesn't contain unit vectors anymore

#

change output color space in the engine to some obscure one because it "looks better"

daring surge May 24, 2023, 12:41 AM

#

void

#

level designers are more unhinged

#

most of them start out modding games, a lot of the time without documentation

shell inlet May 24, 2023, 12:42 AM

#

completely ignore requests to make models to scale cause "you can scale in-engine by eyeballing anyways"

daring surge May 24, 2023, 12:43 AM

#

2m tall chair

#

but yeah, if your goal is just to import random gltf models, you're probably gonna find edge cases that don't work because people do really weird things

#

samplers are probably pretty low on the graph of cost vs reward

long robin May 24, 2023, 12:45 AM

#

I fixed the issue already; my question was dumm

shell inlet May 24, 2023, 12:45 AM

#

soon anticipating artists to use ai to generate illustrations and then it turns out ai autocompleted a chunk of it to be identical to some reference they trained it on

daring surge May 24, 2023, 12:46 AM

#

what a crazy emote

#

deccer up to some business

shell inlet May 24, 2023, 12:46 AM

#

goner froger

long robin May 24, 2023, 12:47 AM

#

an absolutely wild lad

#

anyways, textures don't turn to blob when I lower the render res now

daring surge May 24, 2023, 1:14 AM

#

crazy how much better quality looks than balanced imo

shell inlet May 24, 2023, 2:14 AM

#

I think quality is the fullres TAA via upscaling and downsampling

daring surge May 24, 2023, 2:25 AM

#

yeah

#

its not surprising given what it is

#

but often quality/balanced shows little actual quality change

#

vs balanced to performance

#

e.g. video encoding

#

or graphics settings in general tbh

long robin May 24, 2023, 2:58 AM

#

shell inlet I think quality is the fullres TAA via upscaling and downsampling

Quality is 1.5x upscale

#

i.e., they're all upscaling, just to different degrees

#

balanced is 1.7x area upscaling and performance is 2.0x

#

and ultra perf is 3x

shell inlet May 24, 2023, 3:12 AM

#

long robin Quality is 1.5x upscale

damn that's unexpected

daring surge May 24, 2023, 3:23 AM

#

i would not expect 1.5x and 1.7x to look perceivably different

#

would i be cruel for asking for a no FSR comparison?

long robin May 24, 2023, 3:26 AM

#

yes

#

I'll add a checkbox for native rendering (no fsr) when I'm back

long robin May 24, 2023, 7:08 AM

#

@oak garden you could make multiple fsr2 contexts if you wanted to multithread

#

then you could provide different command lists and get parallel fsr2s

#

not sure why you'd want that though lmao

oak garden May 24, 2023, 7:56 AM

#

That’s a bit weird yeah lmao

long robin May 24, 2023, 9:29 AM

#

fsr looks barely different when I use the wrong depth convention

#

I wonder if it even matters

#

thank you resharper

oak garden May 24, 2023, 9:43 AM

#

nervous

long robin May 24, 2023, 10:00 AM

#

@dapper gorge latest fsr2 stuff has goodies like not relying on extreme UB and FFX_FSR2_ALLOW_NULL_DEVICE_AND_COMMAND_LIST

#

also it should use fp16 now, if supported

oak garden May 24, 2023, 10:02 AM

#

does fsr2 even build on linux

long robin May 24, 2023, 10:02 AM

#

nop

oak garden May 24, 2023, 10:02 AM

#

lol

#

i figured from the cmake

#

oh well

long robin May 24, 2023, 10:02 AM

#

however
https://github.com/GPUOpen-Effects/FidelityFX-FSR2/pull/60

oak garden May 24, 2023, 10:03 AM

#

very cool

long robin May 24, 2023, 10:03 AM

#

it probably wouldn't be too hard to take that person's efforts and make it work with the gl backend

#

eh but you're a vulkan guzzler, so the pr already has what you want

oak garden May 24, 2023, 10:05 AM

#

true

#

i could probably integrate it in my fork eventually

#

i think i messed up the build and i cant get it to generate cmake manually bleakekw

long robin May 24, 2023, 10:06 AM

#

btw don't use my fork if you're targeting non-gl

#

I probably screwed up the cmake somehow

oak garden May 24, 2023, 10:06 AM

#

ah i can generate it from the cmd line outside vs

long robin May 24, 2023, 10:07 AM

#

yeah

oak garden May 24, 2023, 10:10 AM

#

building it makes my entire pc run at 3fps dread_cat

#

i dont know how

long robin May 24, 2023, 10:10 AM

#

same

#

I think it's the shader tool generating hundreds of permutations with every thread on your pc

oak garden May 24, 2023, 10:11 AM

#

ah probably

#

alright finally

#

yeah i did touch those

#

woops

long robin May 24, 2023, 10:19 AM

#

renderdoc doesn't seem to like something I added recently

#

ah, it's this

  if (!subgroupSupported)
  {
    return FFX_ERROR_BACKEND_API_ERROR; // GL_KHR_shader_subgroup is required
  }

#

renderdoc does not report support for the subgroup ext (or most other exts for that matter)

long robin May 24, 2023, 10:48 AM

#

yet the ext works

digital lion May 24, 2023, 10:48 AM

#

how do you set subgroupSupported?

long robin May 24, 2023, 10:48 AM

#

er

long robin May 24, 2023, 10:48 AM

#

digital lion how do you set `subgroupSupported`?

  bool subgroupSupported = false;

  GLint numExtensions{};
  glGetIntegerv(GL_NUM_EXTENSIONS, &numExtensions);
  for (GLint i = 0; i < numExtensions; i++)
  {
    const auto* extensionString = reinterpret_cast<const char*>(backendContext->glFunctionTable.glGetStringi(GL_EXTENSIONS, i));
    if (strcmp(extensionString, "GL_KHR_shader_subgroup") == 0)
    {
      GLint supportedStages{};
      backendContext->glFunctionTable.glGetIntegerv(GL_SUBGROUP_SUPPORTED_STAGES_KHR, &supportedStages);
      if (supportedStages & GL_COMPUTE_SHADER_BIT)
      {
        subgroupSupported = true;
      }
    }
    if (strcmp(extensionString, "GL_NV_gpu_shader5") == 0 || strcmp(extensionString, "GL_AMD_gpu_shader_half_float") == 0)
    {
      deviceCapabilities->fp16Supported = true;
    }
  }

  if (!subgroupSupported)
  {
    return FFX_ERROR_BACKEND_API_ERROR; // GL_KHR_shader_subgroup is required
  }

digital lion May 24, 2023, 10:49 AM

#

ok. I now on the latest and greatest amd opengl drivers querying it like that doesnt actually contain GL_KHR_shader_subgroup even though you can use it in glsl

long robin May 24, 2023, 10:58 AM

#

man, the new nsight (gpu traces in particular) is way better

#

no more horrendous lag when a capture is open, interface looks a lot more like RGP

#

there is even an analysis view that shows estimated gain by fixing various things

#

too bad the handy little tips don't appear on compute passes, which is where I spend about 90% of my frame 😄

dapper gorge May 24, 2023, 11:11 AM

#

long robin <@320895822394818561> latest fsr2 stuff has goodies like not relying on extreme ...

Superb

viral haven May 24, 2023, 4:31 PM

#

long robin man, the new nsight (gpu traces in particular) is way better

I read somewhere that gpu trace supports async compute (pretty nice), I think the normal frame capture doesnt. I guess it means this (https://developer.nvidia.com/blog/the-peak-performance-analysis-method-for-optimizing-any-gpu-workload/) is outdated now 😦

long robin May 24, 2023, 4:36 PM

#

yeah :'(, fortunately there is a blog post telling us how to migrate

viral haven May 24, 2023, 4:39 PM

#

long robin yeah :'(, fortunately there is a blog post telling us how to migrate

do you have any links for it 👀

long robin May 24, 2023, 4:40 PM

#

Yeah, one sec

viral haven May 24, 2023, 4:52 PM

#

Maybe its this ?https://developer.nvidia.com/blog/optimizing-vk-vkr-and-dx12-dxr-applications-using-nsight-graphics-gpu-trace-advanced-mode-metrics/

long robin May 24, 2023, 4:57 PM

#

it's this @viral haven
https://developer.nvidia.com/blog/migrating-from-range-profiler-to-gpu-trace-in-nsight-graphics/

NVIDIA Technical Blog

Avinash Baliga

Migrating from Range Profiler to GPU Trace in Nsight Graphics | NVI...

Starting in Nsight Graphics 2023.1, the GPU Trace Profiler is the best way to profile your graphics application at the frame level. The Frame Profiler activity, and the Range Profiler tool window…

viral haven May 24, 2023, 5:06 PM

#

Thanks froge

golden schooner May 24, 2023, 5:37 PM

#

if its not in #graphics-resources perhaps its a good item for there

final cove May 24, 2023, 5:52 PM

#

while you're posting articles I've been looking at this particularly good set of resources on morton curves
http://johnsietsma.com/2019/12/05/morton-order-introduction/

long robin May 24, 2023, 6:18 PM

#

@digital lion have you already reported the bug with the AMD driver not reporting support for the subgroup ext

#

I am experiencing it right now bleakekw

#

it's not even reporting support for GL_AMD_gpu_shader_half_float

golden schooner May 24, 2023, 6:27 PM

#

they really want gl to die 😦

long robin May 24, 2023, 6:28 PM

#

wait nvm it is reporting the half float ext

golden schooner May 24, 2023, 6:28 PM

#

they really only try to kill gl

long robin May 24, 2023, 6:29 PM

#

well there was the gl/dx11 driver overhaul somewhat recently

#

that was pretty nice, but a lot of the changes were undocumented

oak garden May 24, 2023, 6:29 PM

#

golden schooner they really only try to kill gl

good ElmoFire

dapper gorge May 24, 2023, 6:32 PM

#

oak garden good <a:ElmoFire:924630975147823124>

I am still wrapping my head around Vulkan about one week after starting studying it, I was up and running with GL within the first few days 😦

oak garden May 24, 2023, 6:32 PM

#

it does take some time

dapper gorge May 24, 2023, 6:32 PM

#

GL must remain

long robin May 24, 2023, 6:32 PM

#

byeah

  const auto* vendor = reinterpret_cast<const char*>(backendContext->glFunctionTable.glGetString(GL_VENDOR));
  if (strstr(vendor, "ATI") || strstr(vendor, "AMD"))
  {
    subgroupSupported = true;
  }

oak garden May 24, 2023, 6:33 PM

#

bleakekw

#

nice lmao

dapper gorge May 24, 2023, 6:33 PM

#

Incredible

long robin May 24, 2023, 6:36 PM

#

the last thing I want to add to the backend (frontend technically) is support for gl crusty z convention, but my question in #mathematics hasn't gotten any attention yet (I'm praying that criver sees it)

oak garden May 24, 2023, 6:38 PM

#

inshallah criver will bless you

#

habibi_pray

long robin May 24, 2023, 6:39 PM

#

also praying for driver engineers to resolve my ticket for the subgroup thing

oak garden May 24, 2023, 6:40 PM

#

could go do some angry yelling in their office

dapper gorge May 24, 2023, 6:40 PM

#

Just update the driver yourself and release a sneaky update not even the engineers know about

golden schooner May 24, 2023, 6:41 PM

#

subgroupSupported = strstr(vendor, "ATI") || strstr(vendor, "AMD");

oak garden May 24, 2023, 6:41 PM

#

that would set it to false if it was reported as supported on nv

long robin May 24, 2023, 6:42 PM

#

nv declares supports the actual ext

#

on amd it's shrimply implicit

golden schooner May 24, 2023, 6:42 PM

#

sheesh

brazen glacier May 24, 2023, 6:48 PM

#

then simply

subgroupSupported |= strstr(vendor, "ATI") || strstr(vendor, "AMD");

golden schooner May 24, 2023, 6:58 PM

#

isSubgroupSupported 😉

oak garden May 24, 2023, 6:58 PM

#

imo that doesnt make it much cleaner

golden schooner May 24, 2023, 6:59 PM

#

perhaps ATI doesnt make so much sense, since you probably wont be able to run anything involving Fwog on an actual ATI

long robin May 24, 2023, 7:14 PM

#

yeetus
https://opengl.gpuinfo.org/compare.php?reports=9347,7003

long robin May 24, 2023, 7:14 PM

#

golden schooner perhaps ATI doesnt make so much sense, since you probably wont be able to run an...

and yet

#

so really the test against the "AMD" string is unnecessary afaik, I just have it to hedge my bets

dapper gorge May 24, 2023, 7:16 PM

#

ATI's ghost will forever haunt us

long robin May 24, 2023, 7:20 PM

#

on the bright side, the new driver reports GL_EXT_nonuniform_qualifier froge_love

golden schooner May 24, 2023, 7:21 PM

#

oh

#

https://tenor.com/view/pam-office-the-office-walk-away-awkward-gif-21262546

Tenor

long robin May 24, 2023, 7:22 PM

#

long robin on the bright side, the new driver reports `GL_EXT_nonuniform_qualifier` <:froge...

@digital lion you may enjoy this information

golden schooner May 24, 2023, 7:23 PM

#

he knows 😉

long robin May 24, 2023, 7:23 PM

#

how

golden schooner May 24, 2023, 7:23 PM

#

i believe we talked about that like a month ago or so already

#

when AMD released their new gl driver

long robin May 24, 2023, 7:24 PM

#

I know he knows that the driver supports the ext, but it finally reports that when you enumerate exts

#

instead of being secretly supported

golden schooner May 24, 2023, 7:24 PM

#

oh, ah, uh

#

ok im really gone now 😛

long robin May 24, 2023, 7:24 PM

#

np 😄

dapper gorge May 24, 2023, 7:24 PM

#

golden schooner oh, ah, uh

monke_sounds.ogg

#

Speaking of monke, I almost forgot to watch Oliver eating an apple today

long robin May 24, 2023, 7:25 PM

#

hwat

dapper gorge May 24, 2023, 7:26 PM

#

https://www.youtube.com/watch?v=GlOQnsVOa2o

digital lion May 24, 2023, 8:31 PM

#

is the ticket you've created public, I'd like to follow it

long robin May 24, 2023, 8:32 PM

#

It's not public

heavy cipher May 24, 2023, 10:20 PM

#

you can just give us your credentials to look at the ticket even if it is not public

daring surge May 24, 2023, 11:32 PM

#

you could also install a remote access tool like parsec or horizon for us to take a peek

digital lion May 24, 2023, 11:40 PM

#

you could also send us hourly screenshots of the tickets progress

daring surge May 24, 2023, 11:42 PM

#

it's the least you could do, really

long robin May 25, 2023, 5:48 AM

#

shell inlet May 25, 2023, 7:52 AM

#

long robin

does fsr2 blur your dithered transparency and shadows?

long robin May 25, 2023, 7:54 AM

#

I don't have transparency atm, but it sorta blurs shadows

#

however, the shadows don't have per-frame rng, so it's noisy no matter what

shell inlet May 25, 2023, 8:00 AM

#

so it just reconstructs the dither pattern?

long robin May 25, 2023, 8:00 AM

#

I guess yeah

shell inlet May 25, 2023, 8:00 AM

#

lol

long robin May 25, 2023, 8:01 AM

#

the worst part is that it reconstructs the low-res dither pattern

#

so you get extra big chunky bits

#

debug build btw

long robin May 25, 2023, 8:11 AM

#

long robin the worst part is that it reconstructs the low-res dither pattern

#

rsm settings no longer reset whenever you change the resolution 🙏

#

fsr2 actually resolves some amount of noise pretty well

#

it breaks when the noise is super high frequency (1spp)

#

hmm I guess this is #wip-worthy

oak garden May 25, 2023, 9:24 AM

#

looks very nice

#

did you end up fiddling more with motion vector calculation?

long robin May 25, 2023, 9:25 AM

#

no, I think it's good how it is

oak garden May 25, 2023, 9:26 AM

#

you did clip space and then pass (width,height) / 2 to fsr2 right

long robin May 25, 2023, 9:27 AM

#

I did ndc space, then resolution/2 to fsr2

oak garden May 25, 2023, 9:27 AM

#

right

long robin May 25, 2023, 9:45 AM

#

ah, I think I know why my renderer is so slow in renderdoc

#

I'm doing this for roughly 5k draws

#

I need to

sort mah draws
put all the uniforms in a buffer so I don't call glBufferData a billion times

long robin May 25, 2023, 11:26 AM

#

apparently this math for reprojection is wrong (I'm plugging motion vectors into the denoiser instead of using matrix math)
vec2 reprojectedUV = uv + textureLod(s_gMotion, uv, 0.0).xy;

shell inlet May 25, 2023, 11:30 AM

#

what?

long robin May 25, 2023, 11:31 AM

#

something else is wrong methinks

shell inlet May 25, 2023, 11:31 AM

#

reprojected uv is kinda odd name

long robin May 25, 2023, 11:31 AM

#

my motion vectors should be in uv space now
o_motion = ((v_oldPos.xy / v_oldPos.w) - (v_curPos.xy / v_curPos.w)) * 0.5;

shell inlet May 25, 2023, 11:31 AM

#

you have uv into previous frame, to put onto current uv

#

that's reprojection

long robin May 25, 2023, 11:32 AM

#

I'm going the other way

#

reverse reprojection, if you will

shell inlet May 25, 2023, 11:32 AM

#

?

#

you can have motion vectors into the future?

long robin May 25, 2023, 11:33 AM

#

I thought you were talking about projecting from last frame onto the current frame, but I guess not

#

I'm going from current frame to last

shell inlet May 25, 2023, 11:34 AM

#

but why what do you mean

#

motion vectors do tell the displacement needed to arrive at the position of the current fragment on the previous frame

#

so you treat the current one as the origin for motion vectors

#

and get the difference (target - origin)

long robin May 25, 2023, 11:37 AM

#

yeah I get how they work

#

there was just a bit of a terminology mishap

#

I think my motion vectors are fine actually

shell inlet May 25, 2023, 11:42 AM

#

long robin my motion vectors should be in uv space now `o_motion = ((v_oldPos.xy / v_oldPos...

I don't think this is in uv

#

o_motion = ((v_oldPos.xy / v_oldPos.w) - (v_curPos.xy / v_curPos.w)) * 0.5;

NDC to UV
[-1; 1] -> [0; 1] : (x+1)/2

start from NDC
(prev+1)/2 - (curr+1)/2

simplify
(prev+1 - curr+1)/2
(prev - curr + 2)/2

something's odd, you have
(prev - curr)/2

#

did I do the maths wrong

long robin May 25, 2023, 11:44 AM

#

no, I think this is another terminology moment tho

#

maybe

#

"uv" means they are suitable for math in uv space

#

"ndc" motion vectors are suitable for math in ndc space [-1, 1], but "uv" motion vectors are halved because the space is half as large

shell inlet May 25, 2023, 11:48 AM

#

if you want to pass width and height to motion vector scale in FSR2 I think you should use what I derived

full screen step from corner to corner
prev = (-1, -1)
cur = (1, 1)

get the UV space motion vector
(prev - curr + 2)/2
((-1, -1) - (1, 1) + 2)/2
((1, 1) - (3, 3))/2
(-2, -2)/2
(-1, -1)
full step in UV [0; 1] achieved

#

I am basing it on their claim that motion vectors are done such that each tells amount of pixels displaced together with the scale factor

#

For example, a motion vector for a pixel in the upper-left corner of the screen with a value of <width, height> would represent a motion that traversed the full width and height of the input surfaces, originating from the bottom-right corner.
I can only interpret this as (0, 0) + (width, height) travels entire screen, meaning that you want [0; 1] UV and [width, height] remapping scale factor

long robin May 25, 2023, 11:56 AM

#

here is my maf

motion_ndc = ndc_old - ndc_cur
motion_uv = uv_old - uv_cur

motion_uv = (ndc_old*.5+.5) - (ndc_cur*.5+.5)
motion_uv = 0.5 * (ndc_old - ndc_cur)

#

anyways, my current motion vectors work fine for fsr2

#

the docs are also wrong about the motion vectors, they expect "uv" motion vectors rather than "ndc" ones (assuming the motion scale passed to the API is equal to the screen resolution)

#

there are two issues about this
https://github.com/GPUOpen-Effects/FidelityFX-FSR2/issues/22
https://github.com/GPUOpen-Effects/FidelityFX-FSR2/issues/78

shell inlet May 25, 2023, 11:58 AM

#

is uv you're talking about zero to one?

long robin May 25, 2023, 11:59 AM

#

it's the difference between coordinates in uv space

shell inlet May 25, 2023, 11:59 AM

#

actually doesn't matter anyways because motion vectors only denote displacement, doesn't matter which space it is because of that and that you can scale them

#

what's clear is that in the end your space must be [0, width]*[0; height]

long robin May 25, 2023, 12:00 PM

#

yeah

shell inlet May 25, 2023, 6:18 PM

#

late reply but displacement renders offset term obsolete because

b - a = c
make a relationship such that c is constant displacement between a and b, by expressing b as dependent on a and displacement c
b = a + c
substitute
(a + c) - a  = c
now add arbitrary offset x
((a + x) + c) - (a + x) = c
(a + x) + c - (a + x) = c
cancel out terms
c = c
still holds

#

so (prev - curr + 2)/2 and (prev - curr)/2 are equivalent

dapper gorge May 25, 2023, 9:12 PM

#

Jaker did you fix FSR2 perf

long robin May 25, 2023, 9:19 PM

#

No

#

My investigation yielded that I'm severely VRAM bottlenecked in gl for some reason

#

There was a comment in the Vulkan backend indicating that fp16 should be disabled for a certain pass on Nvidia due to high VRAM use

#

I already did that though

dapper gorge May 25, 2023, 9:22 PM

#

Time to boop your colleagues

long robin May 25, 2023, 9:22 PM

#

I wonder if maybe it's the resources I pass to fsr2 being too thicc

dapper gorge May 25, 2023, 9:24 PM

#

Yeah it is severely VRAM limited

#

L1 hit rates are also abysmal

long robin May 25, 2023, 9:24 PM

#

dapper gorge Time to boop your colleagues

Lol I'm not gonna have them waste time looking at this

#

Hmm cache hit rates are fine for me in the slow passes

dapper gorge May 25, 2023, 9:25 PM

#

This is all FSR btw

#

It's averaging 49.1% L1 hit rates

long robin May 25, 2023, 9:25 PM

#

hmm that's not the worst

#

but also not the best lol

#

do you have the latest version of the backend?

dapper gorge May 25, 2023, 9:26 PM

#

https://tenor.com/view/maybe-possibly-perhaps-gif-14588906

Tenor

#

Now I do

#

It's even worse KEKW

long robin May 25, 2023, 9:28 PM

#

lol

#

what are the purple things

dapper gorge May 25, 2023, 9:29 PM

#

Subchannel switches

long robin May 25, 2023, 9:29 PM

#

ah, subchannel switches?

#

ok

#

why are these in the middle of fsr2 listenyoupieceofshit

dapper gorge May 25, 2023, 9:29 PM

#

I donut know

long robin May 25, 2023, 9:30 PM

#

(they imply the workload is switching between graphics/compute/copy)

dapper gorge May 25, 2023, 9:30 PM

#

I think there's some kind of bug within my app tbh

#

I just don't know what it is

#

There are more subchannel switches in completely random positions

long robin May 25, 2023, 9:31 PM

#

Are you using persistently mapped buffers

snow sun May 25, 2023, 9:31 PM

#

long robin why are these in the middle of fsr2 <:listenyoupieceofshit:1023657474148012142>

i would also like to know

long robin May 25, 2023, 9:32 PM

#

I have them in the fsr backend and I wonder if they're causing issues

dapper gorge May 25, 2023, 9:32 PM

#

long robin Are you using persistently mapped buffers

None

snow sun May 25, 2023, 9:32 PM

#

i believe nv just randomly shits in some commands into your command stream

#

i saw these randomly in many captures

long robin May 25, 2023, 9:32 PM

#

Interesting

oak garden May 25, 2023, 9:33 PM

#

why does it do that

long robin May 25, 2023, 9:33 PM

#

Keeps ya on yer toes

dapper gorge May 25, 2023, 9:34 PM

#

With the advanced trace dwm.exe appears for some reason

#

Right in the middle of fsr lol

long robin May 25, 2023, 9:34 PM

#

I also need to check the shaders again to see if the extensions are actually being used

#

Especially subgroup

dapper gorge May 25, 2023, 9:35 PM

#

(Ignore the many subchannel switches in shadow map rendering, those are my fault KEKW )

golden schooner May 25, 2023, 9:35 PM

#

re dwm, schwapping vram resources perchance?

oak garden May 25, 2023, 9:36 PM

#

you never know with gl

dapper gorge May 25, 2023, 9:36 PM

#

I dunno, I have plenty of free VRAM

#

Also ReBAR is enabled

golden schooner May 25, 2023, 9:36 PM

#

you hung the bar too high

long robin May 25, 2023, 9:36 PM

#

These also might simply be regular wfis due to memory barriers

#

I think it's more likely that nsight is just being dumb though

golden schooner May 25, 2023, 9:38 PM

#

https://tenor.com/view/waving-hand-alphakep-xset-like-a-wave-moving-my-hand-gif-20802082

Tenor

long robin May 25, 2023, 9:40 PM

#

Can I get a caption on this one

golden schooner May 25, 2023, 9:40 PM

#

"wave front invocation"

#

i apologise

#

no good jiff.gifs for those terms : (

heavy cipher May 25, 2023, 9:42 PM

#

golden schooner https://tenor.com/view/waving-hand-alphakep-xset-like-a-wave-moving-my-hand-gif-...

me giving virtual headpat to Jaker for his opengl heroisms

golden schooner May 25, 2023, 9:43 PM

#

that much better 🙂

dapper gorge May 25, 2023, 9:43 PM

#

Same, your hard work is appreciated Jaker 🫡

golden schooner May 25, 2023, 9:44 PM

#

jaker the wallpaper man, getting fsr2 to work in greenland

digital lion May 25, 2023, 9:46 PM

#

yes it is🫡

long robin May 25, 2023, 9:49 PM

#

nice

long robin May 25, 2023, 10:16 PM

#

btw fsr2 runs much faster on my 6800

#

1.7ms at 4k, quality (1.5x) scaling

golden schooner May 25, 2023, 10:27 PM

#

compared to how much without fsr2?

long robin May 25, 2023, 10:28 PM

#

that's how long fsr2 takes, sorry I wasn't clear

golden schooner May 25, 2023, 10:28 PM

#

no i am just an idiot

long robin May 25, 2023, 10:28 PM

#

here is what I'm comparing it against
https://github.com/GPUOpen-Effects/FidelityFX-FSR2#performance

golden schooner May 25, 2023, 10:28 PM

#

what you said earlier makes absolute sense

#

ah

#

can i just run it after checking out the fxr2-test branch?

long robin May 25, 2023, 10:29 PM

#

0.4ms at 1080p frogapprove

#

the branch is quite outdated 😦

golden schooner May 25, 2023, 10:29 PM

#

oki

long robin May 25, 2023, 10:29 PM

#

I can push to it real quick if you wanna test

#

but it will have some bugs 😄

golden schooner May 25, 2023, 10:30 PM

#

i could give it a try here if you want

long robin May 25, 2023, 10:30 PM

#

lemme write the fat commit msg

golden schooner May 25, 2023, 10:30 PM

#

lunix + gtx1060

long robin May 25, 2023, 10:31 PM

#

it won't run on loonix

golden schooner May 25, 2023, 10:31 PM

#

oh, right, you also mentioned that yestergestern

long robin May 25, 2023, 10:31 PM

#

I'm also using windows.hisms to detect renderdoc listenyoupieceofshit

final cove May 25, 2023, 10:31 PM

#

what makes it windows exclusive

golden schooner May 25, 2023, 10:31 PM

#

#include <windows.h>

long robin May 25, 2023, 10:32 PM

#

final cove what makes it windows exclusive

shader tool being an exe
some code relying on windows.h and msvc C extensions
cmake being hardcoded to reject non-windows builds

#

all of which are fixable, and in fact someone has made a PR on the fsr2 github to make it build on linux

#

I'm still open to someone porting the stuff to make it work on linux to my branch

golden schooner May 25, 2023, 10:33 PM

#

but that one needs massaging

shell inlet May 25, 2023, 10:33 PM

#

bro why would they do all of the above

golden schooner May 25, 2023, 10:34 PM

#

i suppose because people more likely live on windows

oak garden May 25, 2023, 10:34 PM

#

🤷

long robin May 25, 2023, 10:34 PM

#

shell inlet bro why would they do all of the above

idk, especially the shader tool one is baffling

golden schooner May 25, 2023, 10:34 PM

#

ja 😦

oak garden May 25, 2023, 10:34 PM

#

yeah the shader tool i dont understand

shell inlet May 25, 2023, 10:34 PM

#

someone needs to get the stick out

oak garden May 25, 2023, 10:34 PM

#

the entire shader compiler folder is like 30 Mb

long robin May 25, 2023, 10:34 PM

#

the equivalent python script is like 300 loc

oak garden May 25, 2023, 10:34 PM

#

I had to delete the DX12 compiler because it wouldnt fit on crates.io's 10 Mb limit kekw

final cove May 25, 2023, 10:35 PM

#

they have binaries in the fsr repo?

#

bruh

oak garden May 25, 2023, 10:35 PM

#

their shader compiler, glslangvalidator, dxcompiler.dll and dxir.dll

long robin May 25, 2023, 10:35 PM

#

well you kinda have to ship dxc/fxc binaries

oak garden May 25, 2023, 10:35 PM

#

i mean

#

they error if the vk sdk is not installed

golden schooner May 25, 2023, 10:36 PM

#

kinda feels like hastily hacked together somehow

long robin May 25, 2023, 10:37 PM

#

probably some choices made in the beginning that ended up in the final thing since they didn't affect the target

golden schooner May 25, 2023, 10:37 PM

#

yeah

#

which is fine

#

its more like a POC anyway i guess, and AAA studios have the wo:manpower/$$$ to implement it properly

oak garden May 25, 2023, 10:39 PM

#

i do have to say i was pleasantly surprised by how shrimple it is to integrate

long robin May 25, 2023, 10:44 PM

#

@golden schooner try pulling the latest from the fsr2-test branch and running 03_gltf_viewer

#

02_deferred is broken because I haven't added motion vectors to it yet, and rsm now requires them

golden schooner May 25, 2023, 10:51 PM

#

cmake is somehow fucked

#

it builds libfwog

#

but wont detect anything else

long robin May 25, 2023, 10:52 PM

#

examples are not built by default btw

golden schooner May 25, 2023, 10:52 PM

#

oh

long robin May 25, 2023, 10:52 PM

#

you need to enable a thingy

golden schooner May 25, 2023, 10:52 PM

#

is that new?

long robin May 25, 2023, 10:52 PM

#

it's relatively new (I changed it probably a month or two ago)

#

around the time I was adding docs I think

golden schooner May 25, 2023, 10:52 PM

#

oi

#

hmm perhaps i havent checked since

long robin May 25, 2023, 10:53 PM

#

don't worry, I've already confused myself with that change KEKW

golden schooner May 25, 2023, 10:53 PM

#

oki what do i enable where? 🙂

long robin May 25, 2023, 10:53 PM

#

erm

#

I just check this box in vs

golden schooner May 25, 2023, 10:53 PM

#

ah

#

found it

#

option(FWOG_BUILD_EXAMPLES "Build the example projects for Fwog." FALSE)

#

ye 🙂 i dont use cmake-gui

long robin May 25, 2023, 10:54 PM

#

I don't even use cmake-gui anymore except to build other projects

#

just VS's integration

golden schooner May 25, 2023, 10:54 PM

#

i just let cmake addon in vscode do all the things

long robin May 25, 2023, 10:55 PM

#

there is probably a nice button for you to check too

golden schooner May 25, 2023, 10:55 PM

#

and cashually switch a FALSE to TRUE in one's cmakelist

#

nah, i have to edit the .txts

long robin May 25, 2023, 10:55 PM

#

https://stackoverflow.com/questions/18435516/how-to-set-a-cmake-option-at-command-line

golden schooner May 25, 2023, 10:55 PM

#

but bulding/selecting targets works

#

its all good

#

[cmake] CMake Error at build/_deps/fsr2-src/CMakeLists.txt:69 (message):
[cmake]   Cannot find MSVC toolset version 142 or greater.  Please make sure Visual
[cmake]   Studio 2019 or newer installed

#

I dont understand why americans put 2 spaces after . irritates the shit out of me

long robin May 25, 2023, 10:56 PM

#

fsr2 isn't very friendly to devs who don't conform

golden schooner May 25, 2023, 10:56 PM

#

hehe

long robin May 25, 2023, 10:56 PM

#

golden schooner I dont understand why americans put 2 spaces after . irritates the shit out of m...

only old people do that here

golden schooner May 25, 2023, 10:56 PM

#

dragonslayer does that too

long robin May 25, 2023, 10:57 PM

#

he's old

#

imo it isn't as bad as when people put a space before punctuation (ahem gob)

golden schooner May 25, 2023, 10:57 PM

#

hes en baguette beljeeque, they do be plenking, but ye

long robin May 25, 2023, 10:59 PM

#

when I'm done with this branch, I'll probably turn this into its own example that is quarantined behind a flag

#

or maybe its own repo

golden schooner May 25, 2023, 11:00 PM

#

are you going to turn this into an article for your blog?

#

the porting endeavour

long robin May 25, 2023, 11:01 PM

#

I was thinkin about it

golden schooner May 25, 2023, 11:01 PM

#

https://tenor.com/view/make-it-so-jean-luc-picard-star-trek-the-next-generation-make-it-happen-gif-23455354

Tenor

#

iknewyoudsaythatjiff.gif 😛

oak garden May 25, 2023, 11:01 PM

#

golden schooner hes en baguette beljeeque, they do be plenking, but ye

i think thats a french thing

#

i dont do that

golden schooner May 25, 2023, 11:01 PM

#

it is

#

to plenk (the act of leaving a space before the sentence terminator)

long robin May 25, 2023, 11:02 PM

#

as if I need more evidence that frenchies are crazed

golden schooner May 25, 2023, 11:02 PM

#

well you didnt ask for it, but zer you go

#

pengu you are not from the bagette part of beljeek, you are from the cheese corner of beljeekistan

oak garden May 25, 2023, 11:03 PM

#

i do not associate with the cheeseheads

#

but yes

golden schooner May 25, 2023, 11:03 PM

#

hehe

oak garden May 25, 2023, 11:03 PM

#

i have one living in my hallway

#

i dislike him very much

golden schooner May 25, 2023, 11:04 PM

#

how come?

#

he likes the stinky kinds of tscheeses?

oak garden May 25, 2023, 11:04 PM

#

how do i say that

#

hmm

#

he's kinda used to the comfy "mom provides everything for me"

long robin May 25, 2023, 11:05 PM

#

l'estench d'echeese

golden schooner May 25, 2023, 11:05 PM

#

ah

oak garden May 25, 2023, 11:05 PM

#

bit stuck up

#

kind of a pain in the backdoor

#

but i move out in a month so ill tolerate it

golden schooner May 25, 2023, 11:05 PM

#

eggcellent

oak garden May 25, 2023, 11:05 PM

#

for a bit longer

#

my new place is much nicer

#

private kitchen and bathroom is a huge plus

golden schooner May 25, 2023, 11:06 PM

#

+1

long robin May 25, 2023, 11:20 PM

#

shell inlet actually doesn't matter anyways because motion vectors only denote displacement,...

btw my issue was that I wasn't applying the jitter to my uv before reprojecting

#

it was working before because I was accidentally passing a jittered invViewProj without realizing KEKW

shell inlet May 25, 2023, 11:22 PM

#

wait but jitter is subpixel

#

and motion vectors are per pixel

#

why it matters

long robin May 25, 2023, 11:23 PM

#

because it affects the location of the reprojected uv

#

the subpixel location matters since I filter

#

the only thing that confuses me is that the frame I'm reprojecting onto was jittered in a different way, so I feel like that should matter too

shell inlet May 25, 2023, 11:28 PM

#

wait I get it, the jitter makes up for the displacement in the target resolution

#

say you have 1/4 render resolution, then full resolution would have 4x4 tile + 4x4 jitter offset as the pixel

#

or more than just the 4x4 grid if you account for TAA

long robin May 25, 2023, 11:31 PM

#

this rsm denoising stuff is all happening at the render resolution too though rgbemoji

#

hmm, I think I see what you mean

#

but I'm also pretty sure it's not exclusive to upscaling, but rather any TAA with per-frame jittering

shell inlet May 25, 2023, 11:33 PM

#

TAA at least makes sense in motion, with upscaling I still have no idea what happens when you just move around and low resolution geometry changes shape

long robin May 25, 2023, 11:35 PM

#

I think a big part of it is that internal upscaled depth and color are maintained between frames (in the TAAU impl)

#

so you can compare the low-res input with the reconstructed image from last frame and do some magic heuristics to reconstruct the current frame

shell inlet May 25, 2023, 11:37 PM

#

I don't think upscaling depth helps anything in motion

#

motion invalidates any reconstruction

#

just rotating camera is not motion, what I'm talking about is real moving and deforming geometry, especially the edges

#

and even worse stuff like moving blades of grass

#

or hair

long robin May 25, 2023, 11:40 PM

#

it's still possible to get high quality motion vectors from things like that

#

I think getting good motion vectors from literally everything is one of the painful parts of TAA(U) integration in a real engine

shell inlet May 25, 2023, 11:42 PM

#

this isn't about motion vectors anymore though I just segued into something different at this point

long robin May 25, 2023, 11:42 PM

#

at least FSR 2 lets you provide masks for things like transparent stuff and scrolling textures

long robin May 25, 2023, 11:42 PM

#

shell inlet this isn't about motion vectors anymore though I just segued into something diff...

where're we at now

golden schooner May 25, 2023, 11:43 PM

#

ltt's sponsor segment

shell inlet May 25, 2023, 11:43 PM

#

my point is that if all you have is 1 pixel per tile when geometry moves you gotta drop it to pure blurry low res quality

#

upscaling only makes sense for somewhat still geometry to me

long robin May 25, 2023, 11:44 PM

#

have you read this
https://github.com/GPUOpen-Effects/FidelityFX-FSR2#the-technique

shell inlet May 25, 2023, 11:45 PM

#

yeah it doesn't have any insights

long robin May 25, 2023, 11:45 PM

#

just outsights

shell inlet May 25, 2023, 11:46 PM

#

I mean it's not very hard to make a TAAU

#

hard part is to solve TAAU in motion

long robin May 25, 2023, 11:46 PM

#

shell inlet my point is that if all you have is 1 pixel per tile when geometry moves you got...

what does "when geometry moves" mean exactly?

#

are you referring to e.g., objects having different transforms between frames

shell inlet May 25, 2023, 11:47 PM

#

imagine a bending leaf which is also over some distant background

#

there is a hard edge between those which you need to reconstruct with just 1 pixel per some tile

#

this leaf moves such that you only get 1 pixel per tile each frame, all different tiles

#

what do you reconstruct?

#

does that sound like it's possible to get meaningful info?

#

at least with just TAA you have a proper 1:1 geometry outline every frame and extra info is subpixel

long robin May 25, 2023, 11:51 PM

#

shell inlet there is a hard edge between those which you need to reconstruct with just 1 pix...

what is a tile

#

pixel in input resolution?

shell inlet May 25, 2023, 11:51 PM

#

render resolution is 1 pixel in a tile of the target resolution

#

each frame you render 1 pixel in a tile as a consequence

#

from that you'd reconstruct the full image

#

given that it's all static of course, it'd converge to ground truth

#

but in motion naah

long robin May 25, 2023, 11:54 PM

#

how does it work when there are 1.5 pixels per tile

shell inlet May 25, 2023, 11:54 PM

#

simple, you don't actually treat pixels as discrete with reconstruction

long robin May 25, 2023, 11:55 PM

#

shrimple as that

shell inlet May 25, 2023, 11:55 PM

#

it's just easier to think of it as tile

#

in reality it's continuous

#

you can splat samples onto the target image and eventually it converges to the full resolution

#

that's what wavelet upscalers do iirc

#

the problem is still a problem, when stuff moves it's hard to make sense of it

#

when the scene is static and you only rotate the view you still get pretty small error because motion vectors compensate for the change

#

and only rotating the camera is not deforming anything

long robin May 26, 2023, 12:01 AM

#

what is deforming

shell inlet May 26, 2023, 12:02 AM

#

my enthusiasm to explain it all over again

#

imagine a cubemap

#

you put it on the skybox

#

rotating a camera does not alter anything

#

but now if clouds move that's deforming geometry

#

does that make it clearer

long robin May 26, 2023, 12:04 AM

#

I see

#

what I don't get is how that's fundamentally different from transforming the camera

shell inlet May 26, 2023, 12:04 AM

#

because maths

long robin May 26, 2023, 12:05 AM

#

you can still get disocclusions and such when the camera moves

shell inlet May 26, 2023, 12:05 AM

#

moves yes rotates no

long robin May 26, 2023, 12:05 AM

#

that was the missing part for me

shell inlet May 26, 2023, 12:06 AM

#

moving camera is really same as moving object relative to it

#

mathematically speaking

long robin May 26, 2023, 12:07 AM

#

I still don't get how info is irrecoverable except when disocclusions occur

#

unless that was the point you were making

shell inlet May 26, 2023, 12:08 AM

#

because we also have changing shading especially if motion is fast

#

specular is nasty adds lots of high frequency details

#

add a bump map it's a nightmare

long robin May 26, 2023, 12:08 AM

#

yeh

#

probs why high frequency detail turns into mush under movement with TAAU

shell inlet May 26, 2023, 12:09 AM

#

so you either have bad ghosting or bad quality from dropping history

long robin May 26, 2023, 12:10 AM

#

I guess you could somewhat "compensate" by not using the negative sampler lod offset lmao

#

so now everything looks awful, but at least it's not blocky under movement

shell inlet May 26, 2023, 12:11 AM

#

ok I need to sleep now

long robin May 26, 2023, 12:11 AM

#

gn

long robin May 26, 2023, 7:40 AM

#

persistently mapped UBOs vs glBufferSubData

#

heavy cipher May 26, 2023, 7:43 AM

#

mfw when opengl

long robin May 26, 2023, 7:45 AM

#

it's like the driver delays the dma until the perfect moment to troll you

heavy cipher May 26, 2023, 7:46 AM

#

the driver senses profiling so it randomises behavior

long robin May 26, 2023, 7:47 AM

#

yeah btw I can't even reproduce the fucked up graph

#

oh wait

#

now, with glBufferSubData, I get the epic dma at the beginning of the pass

#

guess I won't worry about it then

digital lion May 26, 2023, 7:48 AM

#

long robin persistently mapped UBOs vs glBufferSubData

the one with longer frametime and empty space is persistently mapped?

long robin May 26, 2023, 7:49 AM

#

yeah

#

but it's probably random chance that caused it

#

now I need to figure out why I'm being horribly bottlenecked by vram throughput

#

even the vk one gets screwed by random dmas in the middle of the pass, but at least it's not uber vram bottlenecked (instead, it's L1 + SM throughtput)

#

btw @digital lion, the subgroup bug should be fixed in the next driver release

digital lion May 26, 2023, 8:03 AM

#

amazing

long robin May 26, 2023, 8:07 AM

#

long robin even the vk one gets screwed by random dmas in the middle of the pass, but at le...

the difference between the gl shaders and the vk shaders is almost nil, so I guess it's either an epic GL moment, NV moment (since it doesn't happen on AMD), or just the inputs I'm providing being too thicc

#

meh, reduced my color input & output to R11_G11_B10F (from RGBA16F) and almost 0 difference in perf

#

I blame nv for making bad hw

fiery sorrel May 26, 2023, 8:28 AM

#

I have been lurkin here for the past few days and have still been confused if this is still fwog thread

long robin May 26, 2023, 8:28 AM

#

this is the "& co." part of it

fiery sorrel May 26, 2023, 8:28 AM

#

Oooh!

long robin May 26, 2023, 8:29 AM

#

there are some fwog example updates interspersed throughout

fiery sorrel May 26, 2023, 8:29 AM

#

frogapprove

#

I love fwog

#

I should write a review of what its like to use it

dapper gorge May 26, 2023, 8:33 AM

#

Fwog now has a FSR2 sample that has random performance on NV KEKW

long robin May 26, 2023, 8:34 AM

#

heh

#

tbf, it's "only" like .5ms of randomness in the fsr2 pass

dapper gorge May 26, 2023, 8:34 AM

#

How does NV hardware take 300 microseconds to switch to DMA mode smh

long robin May 26, 2023, 8:34 AM

#

fsr2 performs pretty consistently poorly on this (3-4x worse than the equivalent AMD card)

long robin May 26, 2023, 8:35 AM

#

dapper gorge How does NV hardware take 300 microseconds to switch to DMA mode smh

it's doing a copy and waiting for it I think

long robin May 26, 2023, 9:45 AM

#

Meme hw btw
https://github.com/GPUOpen-Effects/FidelityFX-FSR2/blob/master/src/ffx-fsr2-api/vk/ffx_fsr2_vk.cpp#L1303

#

(seriously though, my problem is also high vram and occupancy in the accumulate pass on NV)

#

Except when I toggle fp16, there is no epic perf difference 😢

long robin May 27, 2023, 1:24 AM

#

should I

long robin May 27, 2023, 1:43 AM

#

documenting my fsr2 journey, starting somewhere around here: #1019779751600205955 message

long robin May 27, 2023, 8:03 AM

#

it gon be lit

dapper gorge May 27, 2023, 8:09 AM

#

hype

oak garden May 27, 2023, 10:05 AM

#

Epic

shell inlet May 27, 2023, 10:32 AM

#

type

golden schooner May 27, 2023, 10:46 AM

#

long robin it gon be lit

finally it pays off to pester you to write 🙂

long robin May 28, 2023, 12:07 AM

#

can confirm that this works btw

#

you can target multiple envs with glslang like so: --target-env opengl --target-env spirv1.3

golden schooner May 28, 2023, 8:45 AM

#

you need to run them separately then neh?

long robin May 28, 2023, 9:03 AM

#

no, you can combine multiple envs at once

#

I literally invoke the shader compiler with these args:
-compiler=glslang -e main --target-env opengl --target-env spirv1.3 --amb --stb comp 8 --ssb comp 8 --sib comp 0 --suavb comp 0 -Os -S comp -DFFX_GLSL=1

golden schooner May 28, 2023, 9:21 AM

#

ah

#

oh man

#

i cant read

#

i read "you canT target multiple..."

#

-,-

long robin May 28, 2023, 9:21 AM

#

cant arget

long robin May 28, 2023, 11:47 AM

#

long robin it gon be lit

I'm accepting ideas for memes (funny or not) to put in this

#

I'm thinking about putting the stick-in-spokes meme for when I dared to try using glUniform to set image bindings in a spirv shader

oak garden May 28, 2023, 11:53 AM

#

yes

#

kek

long robin May 28, 2023, 12:09 PM

#

I already have 1000 words wtf

#

why can't I pump out actually useful stuff this quickly

golden schooner May 28, 2023, 12:12 PM

#

: )

#

dont be too hard on yourself

long robin May 28, 2023, 12:16 PM

#

I'm just mildly frustrated that I haven't worked on the other article that much lately

golden schooner May 28, 2023, 12:17 PM

#

work on the other article after dodo

#

https://tenor.com/view/obi-wan-obi-was-identification-gif-22485508

Tenor

golden schooner May 29, 2023, 6:01 PM

#

ill just leave that here https://developer.download.nvidia.com/opengl/tutorials/bindless_graphics.pdf

long robin May 30, 2023, 11:01 AM

#

I recently noticed that my fix for my temporal reprojection (incorporating jitter) adds a small bias which makes it look like the pixels are moving up and right 😩

#

it's weird because the jitter is distributed around 0, so no single direction should be favored over time

shell inlet May 30, 2023, 12:02 PM

#

off-centre sampling causes this

#

when you forgor add 0.5 offset

long robin May 30, 2023, 12:04 PM

#

vec2 uv = (vec2(gid) + 0.5) / uniforms.targetDim;

#

it also only happens when I add the jitter

shell inlet May 30, 2023, 12:05 PM

#

can you step through the shadre

#

in a debuggre

long robin May 30, 2023, 12:05 PM

#

I think I'm not accounting for last frame's jitter but that sounds like a problem for tomorrow me

long robin May 30, 2023, 12:06 PM

#

shell inlet in a debuggre

not in OpenGL sadly

#

gotta use my mental interpreter

shell inlet May 30, 2023, 12:07 PM

#

truly mental

#

if I were to write opengl-like softraster app today I would name shader interpreter source file jaker.cpp

long robin May 30, 2023, 12:09 PM

#

to make it accurate you need to leave any bugs you find in it

golden schooner May 30, 2023, 5:52 PM

#

hmm https://skybox.blockadelabs.com/

Skybox AI

Skybox AI: One-click 360° image generator from Blockade Labs

oak garden May 30, 2023, 6:01 PM

#

interesting

#

takes a while to generate though

golden schooner May 30, 2023, 6:03 PM

#

yep, a minute or so, thats fine

oak garden May 30, 2023, 6:03 PM

#

yeah

#

ill try "large canyon with rocks scattered around the place. a dense fog covers the bottom"

golden schooner May 30, 2023, 6:05 PM

#

its not as "accurate" you would think it is/or have played with on the stable diffusion/openai discord

oak garden May 30, 2023, 6:05 PM

#

golden schooner May 30, 2023, 6:06 PM

#

oi nice

oak garden May 30, 2023, 6:06 PM

#

yeah it doesnt like my request for fog :P

#

not too bad though

golden schooner May 30, 2023, 6:06 PM

#

can you share the image nonetheless?

oak garden May 30, 2023, 6:06 PM

#

sure

golden schooner May 30, 2023, 6:06 PM

#

perhaps DM? then we dont pollute jaker's living room

oak garden May 30, 2023, 6:06 PM

#

https://skybox.blockadelabs.com/dfa5f490226f46ec7bdbcf418ddc9a88 heres the link

long robin May 31, 2023, 5:18 AM

#

golden schooner hmm https://skybox.blockadelabs.com/

are these HDR

daring surge May 31, 2023, 5:19 AM

#

no, they are jpegs

long robin May 31, 2023, 5:21 AM

#

rip

#

I guess you can always do the inverse tonemap hack bleakekw

#

hello banding my beloved

long robin May 31, 2023, 10:50 AM

#

agx (copied from jasper) vs aces

#

the toe on aces deletes so many details

#

anyways, I guess tony mcmapface looks similar to the left pic

shell inlet May 31, 2023, 10:55 AM

#

we need to see the path to white

long robin May 31, 2023, 10:56 AM

#

it do look like this

#

https://www.shadertoy.com/view/Dt3XDr

shell inlet May 31, 2023, 10:56 AM

#

I see

#

the blues are more cyan here

#

#

this is mapface

golden schooner May 31, 2023, 10:57 AM

#

are your monitors calibrated?

shell inlet May 31, 2023, 10:58 AM

#

also slight shift to purple in blues in agx

long robin May 31, 2023, 10:58 AM

#

golden schooner are your monitors calibrated?

mine are

#

I got a colorimeter last year

shell inlet May 31, 2023, 10:59 AM

#

mine is not calibrated because it burns in srgb mode

#

but the difference is in luma calibration only

#

so it's either no srgb or buying a new monitor for me

long robin May 31, 2023, 11:00 AM

#

my brother had his monitors at 250 nits and saturation cranked up 💀

#

it was like staring into those pits of glowing green goo in half life

shell inlet May 31, 2023, 11:13 AM

#

what's with the colorimeter thing?

#

are you colorimetring the colors of your surroundings as a hobby?

long robin May 31, 2023, 11:20 AM

#

I used it to calibrate my monitors lol

#

and everyone else's monitors that I come across

shell inlet May 31, 2023, 11:24 AM

#

but there's like

#

dedicated sRGB mode in each monitor's internal config

#

just choosing it should be enough

#

like built-in hardware

long robin May 31, 2023, 11:31 AM

#

sRGB as opposed to HDR of some sort?

shell inlet May 31, 2023, 11:32 AM

#

I have no clue how hdr is supposed to work

long robin May 31, 2023, 11:32 AM

#

most monitors do not come from the factory well-calibrated anyways

#

plus you need to account for suboptimal viewing conditions

shell inlet May 31, 2023, 11:32 AM

#

I know there are color spaces with wider gamut

#

in that case hardware should come with settings for those as well

#

what is a suboptimal viewing condition

long robin May 31, 2023, 11:34 AM

#

ambient light > 0

shell inlet May 31, 2023, 11:35 AM

#

displays are black and absorb most ambient radiation though

#Fwog and co.