Dynamic shadows artifacts

IronicParadox · February 7, 2018, 5:02am

No, it actually has some basis to it. Obviously, there is going to be a HUGE variance in shadow frame times, from scene to scene, but from what I’ve seen, shadowdepths+shadowedlights can run you around 25% of the frame time in a dynamically lit scene(at least in some of my forest heavy scenes that I’ve been working on and with the hardware I’m using right now). Now it’s hard to profile it specifically, but I’d assume that shadow filtering takes up a lot of the shadow frame time; due to the sampling techniques that have to be performed.

Using my example, let’s say you’ve got a frame time of 16ms, that would mean 4ms would be eaten up by shadowing. If you increase the filtering math and it bumps up the shadow frame cost by let’s say 2x, due to more sampling math and a wider kernel, you’ll see frame time increases of +4ms, which would equate to ((20-16)/16)*100=25% increase in overall frame time…

Deathrey · February 7, 2018, 9:11am

No, it is not hard at all. There is a dedicated entry in built-in profiler for that, which you are unaware of, leading you to:

assumptions, like that one, based on random myths. That is a wrong one, in any case.
Filtering part, as a rule, takes less time, than depth rendering.

Which part exactly of the so-called filtering math are you intending to increase, that would result in 4ms shadow filtering rendering time inflation, while addressing the issue discussed and what part would a wider kernel play in it?

anonymous_user_c94f194d · February 7, 2018, 10:10am

With proper slope scaled depth bias filtering could be cheaper than it’s now. Filtering also could be optimized by using hardware bilinear comparision sampler where it is possible. Currently every sample is separately doing soft comparision.
Related code.



        // The standard comparison is SceneDepth < ShadowmapDepth
        // Using a soft transition based on depth difference
        // Offsets shadows a bit but reduces self shadowing artifacts considerably
        float TransitionScale = Settings.TransitionScale; //SoftTransitionScale.z;
        float ShadowFactor = saturate((ShadowmapDepth - Settings.SceneDepth) * TransitionScale + 1);

Deathrey · February 7, 2018, 10:22am

Kalle-H:

With proper slope scaled depth bias filtering could be cheaper than it’s now. Filtering also could be optimized by using hardware bilinear comparision sampler where it is possible. Currently every sample is separately doing soft comparision.
Related code.
// The standard comparison is SceneDepth < ShadowmapDepth
// Using a soft transition based on depth difference
// Offsets shadows a bit but reduces self shadowing artifacts considerably
float TransitionScale = Settings.TransitionScale; //SoftTransitionScale.z;
float ShadowFactor = saturate((ShadowmapDepth - Settings.SceneDepth) * TransitionScale + 1);

I’d tend to agree here. It is just that the actual performance gain is hardly measurable. This is mostly due to the fact that large part of 4 in 1 comparison gain is the actual 4 in 1 texture fetch, which is already used. It boils down to 1 MAD and 1 SUB instructions less minus cost of hardware comparison per 4 samples, which is… well… not much these days, but still something.

But following the same logic, I’d say that ditching whole TransitionScale, and letting user tweak the depth bias per-cascade(not a replacement for slope-scaled bias, just alternative approach to the goals, that are set for existing system) would be good alternative, don’t you think?

anonymous_user_c94f194d · February 7, 2018, 1:08pm

Yeah sample Gather is used SM5 platforms but not on mobile. IOS supports: sample_compare(), gather() and gather_compare() fetches.

spacegojira · February 7, 2018, 2:33pm

DamirH:

I’ve been trying to write up a reply for a bit now to suggest what could or should be done but there really isn’t much to say. At its core, it’s disheartening and saddening to see this go without a reply. I’m angry, worried and annoyed not by the bug but by the lack of a response in over a solid year. I could go rant about how Epic has abandoned the forum, plainly visible in the still broken software and lack of responses, or how the community has taken a sharp decline or all those things, but on the other hand I realize that they’re all just humans doing their dayjobs and I’m sure they aren’t avoiding this topic out of malice. I suppose it’s my own fault for getting invested in the community and setting myself up for disappointment, but at this point I consider the fact that our projects are tied to UE4 and will be tied to UE4 for a good while in the future a fault rather than a strength. I can no longer know if this engine, its community or anything resembling an infrastructure will exist a year or two down the line, and that’s very frightening.

Edit: Some people may think me melodramatic, but right now pretty much 100% of my income, my family’s existence hinges on UE4, so it very much interests me if there is something unwell with any part of it. Even disregarding that, the forum community here is one I thoroughly enjoyed being a part of and one I grew much invested in.

I feel with you. I have spend the past two years intensively learning UE4 to be able to publish my own games and (digital) art projects, not to mention spend a lot of money on the marketplace. And now it feels all of this was wasted because suddenly it seems Epic doesn’t give a **** anymore and I can’t be certain that I can rely on UE4 later on.

Honestly I will finish my current project, hope it will give me some return and then switch to Lumberyard, since Amazon is pouring a **** ton of resources into it, and they don’t do such a thing without a longterm goal.

anonymous_user_c94f194d · February 7, 2018, 2:36pm

Not commenting on single thread is hardly not giving a ■■■■. Grass is not greener on other side.

NilsonLima · February 7, 2018, 2:41pm

Before choosing UE4 I have given Lumberyard a shot and I didn’t like it. More than two years later I am not seeing much progress there and maybe because Im not there at all, but I know what progress we are having here and the space it still has to grow here, so shifting is a no no for me. The feeling for the known are much better of the feeling for the unknown…

Deathrey · February 7, 2018, 3:05pm

Yep. No doubt, one way or another SampleCmp is better than Gather plus two vector instructions. Might be even more significant on mobile, but I’m not familiar with mobile shading.

Slope scaled bias would have eliminated the need for that soft comparison thingy, which does not seem to do the job it was intended to do, apart from transitions.
Speaking of soft comparison, the way it works now, it has a linear dependency curve on shadow distance. In reality, one would almost always desire to have exponential curve, that relates shadow cascade number and bias. That is another thing, why having Slope Bias and Bias controls adjustable for each cascade individually would be a one size fits all solution.

Additional thing it might be worth implementing, is ability to adjust filter size depending on the cascade, again with full manual control. This would situationally boost performance as well as eliminate another issue, that some users were complaining, namely over-blurred shadows in the distance. It would be cool, but it tags along some complications with cascade snapping and i’m unaware of reliable methods for handling cascades borders in this case. At least not with a box filter. But IMO, this topic is worth looking into.

anonymous_user_c94f194d · February 7, 2018, 3:44pm

Yep. No doubt, one way or another SampleCmp is better than Gather plus two vector instructions. Might be even more significant on mobile, but I’m not familiar with mobile shading.

Slope scaled bias would have eliminated the need for that soft comparison thingy, which does not seem to do the job it was intended to do, apart from transitions.
Speaking of soft comparison, the way it works now, it has a linear dependency curve on shadow distance. In reality, one would almost always desire to have exponential curve, that relates shadow cascade number and bias. That is another thing, why having Slope Bias and Bias controls adjustable for each cascade individually would be a one size fits all solution.

Additional thing it might be worth implementing, is ability to adjust filter size depending on the cascade, again with full manual control. This would situationally boost performance as well as eliminate another issue, that some users were complaining, namely over-blurred shadows in the distance. It would be cool, but it tags along some complications with cascade snapping and i’m unaware of reliable methods for handling cascades borders in this case. At least not with a box filter. But IMO, this topic is worth looking into.

Just found another problem that contributes this problem. When Gather4 isn’t used code is calling function. FetchRowOfThree() this always offset towards positive x coords. Calling code always use positive offsets. This Shifts shadows 1-2 texels(on both axis) depending if pcf2 or 3 is used.

Deathrey · February 7, 2018, 4:00pm

This is is true for mobile platforms. The kernel is missing 1 texel of offset, when MOBILE_CSM_QUALITY == 2.

FetchRowOfThree() is not used anywhere else.

anonymous_user_c94f194d · February 8, 2018, 9:07am

But FetchRowOfFour is used with similar flaw when Gather4 isn’t used.

spacegojira · February 8, 2018, 9:48am

Forum still broken. Two-Factor authentification still not implemented (I have to delete my credit card data each time after making a purchase). Sharp decline in responses from Epic. Marketplace-quality control non-existant. Oh and how about a good dynamic lighting solution? yeah no…

And yes, if you keep ignoring a thread where a ton of different users describe how negatively such a problem is impacting their projects, and there is ZERO response from Epic, then I think it is safe to say that they don’t give a **** about this.

I’m not saying they don’t care about the engine or community in general, but it’s clear there is a decline and it makes me sad to see this.

Deathrey · February 8, 2018, 9:55am

The calling code adds offsets to FetchRowOfFour call, so it does not use only positive offsets.
For 4x4 filter, the sample center is offset is offset by (-1,-1). It matches gather code.
for 7x7 filter, the sample center is offset by (-3,-3), It is half a texel off the gather code.

I don’t see more than half a texel shift anywhere.

anonymous_user_c94f194d · February 8, 2018, 11:54am

Yeah you are right. I tested code a bit and noticed that original and my offset version(-1, 0, 1) was both half texel offsetted. Proper fix was to remove - 0.5f bias and use my offsets.


float2 TexelPos = ShadowPosition * Settings.ShadowBufferSize.xy - 0.5f; // bias to be consistent with texture filtering hardware

anonymous_user_c94f194d · February 8, 2018, 12:05pm

Created pull request for this fix. https://github.com/EpicGames/UnrealEngine/pull/4485

Edit:
So this bug is only happening when kernel is symmetric. This affect PCF2x2 and non gather 7x7 kernel.

jojo8026 · February 8, 2018, 1:56pm

No they do not, Its so sad really, We can send a self-driving car into space to mars but we can’t get good dynamic lighting in UE4

Deathrey · February 9, 2018, 3:37pm

Pretty noticeable difference for just 1 texel of a shift.

anonymous_user_c94f194d · February 13, 2018, 9:16am

Just made PR to optimize mobile and non gather PCF. https://github.com/EpicGames/UnrealEngine/pull/4494

spacegojira · February 14, 2018, 6:52pm

I get a 404 error when I open that link?