[ReSTIR+SVGF-Sim]Spatiotemporal Reservoir Resampling With Spatiotemporal Filtering In MiniEngine
Introduction
We implement a GI algorithm combining ReSTIR and spatiotemporal filtering based on Unreal Engine and ReSTIR GI(SiGG 20). At first, we initial the reservoir samples using hardware raytracing at half resolution. Then we perform resampled importance sampling (RIS) to draw samples generated from sample importance resampling (SIR). To reduce time complexity O(M) and space complexity O(M), we use weighted reservoir sampling (WRS) to improve performance. After that, we reuse reservoir samples spatially and generate diffuse/specular lighting based on those samples. Finally, we perform temporal filter and spatial filter (joint bilateral filter) to denoise the indirect lighting results.
[ReSTIR]Spatiotemporal Reservoir Resampling
Theory
There are two basic problems in the Monte Carlo integration of rendering equation to reduce the errors and improve the sampling efficiency:
1.Given the limited sample number, find a distribution similar to the function f(x) in the integrand, making the samples more likely to be taken when the magnitude of the integrand is relatively large. In orther words, take the samples that are more importance.
2.How to draw random samples from the chosen probability distribution found in problem 1.
It’s really hard to take a sample from the PDF that is proportional to L(wi)fr(wi,wo)cos(θi) for traditional sampling method (such as inverse sampling),since it requires us to evaluate the integral at first, which becomes a chicken and egg problem. The sample importance resampling algorithm (SIR) described below solves this problem.
Sample Importance Resampling (SIR)
SIR Workflow
Since drawing samples from the target PDF ptarget(x) proportional to L(wi)fr(wi,wo)cos(θi) is hard, SIR takes samples from a more simple PDF called proposal PDF pproposal(x). Here is the basic workflow of the SIR:
1.Take M samples from proposal PDF pproposal(x), named as X=x1,x2,...,xm. In our implementation, we use uniform sample hemisphere sampling to generate the samples, which means:
pproposal(x)=2π1
2.For each sample taken from X, assign a weight for the sample, which is:
w(x)=pproposal(x)ptarget(x).
And
ptarget(x)=luminance(L(wi))
3.Next, draw a sample from X, also named as target sample in this post, from this proposal sample set of M. The probability of each sample being picked is proportional to its weight, which means:
p(xz∣X)=Σi=0Mw(xi)w(xz)
The PDF of drawing a sample from the whole sample space is called SIR PDF.
Suppose the event Ez is sample z drawn from the sample set X and the event EX is sample z existing in the sample set X.
p(Ez∣EX)=p(EX)p(Ez∩EX)
Where
p(Ez∣EX)=p(xz∣X)=Σi=0Mw(xi)w(xz)
And the SIR PDF is:
psir(x)=p(Ez∩EX)
p(Ez∩EX)=p(Ez∣EX)p(EX)
Draw a sample from the whole sample space, the probability of this sample being equal to z is pproposal(z). The probability that sample z exists in sample set X is:
RIS uses importance sampling to draw samples from SIR. It should be noted that the SIR, similar to the inverse mapping, is a sampling distribution mapping instead of importance sampling !!! The RIS workflow is the same as Importance sampling:
1.Generate M samples by SIR as described in the SIR part. 2.Draw N samples from M samples generated by SIR using importance sampling from the SIR distribution.
The integration formula is shown below, which is the same as importance sampling:
IRISN,M=N1Σi=1Npsir(xi)f(xi)
Replace the SIR PDF with the formula derived before:
IRISN,M=N1Σi=1Npsir(xi)f(xi)
=N1Σi=1N(ptarget(x)f(xi)(M1Σp=0Mw(xip)))
When N = 1, it becomes:
IRIS1,M=ptarget(x)f(xi)(M1Σp=0Mw(xip))
,Which is used in ReSTIR.
Weighted Reservoir Sampling(WRS)
RIS requires us to pick a sample (N = 1) from M proposal samples generated by SIR, which causes two issues:
1.Requires O(M) space for storing proposal samples. 2.The time complexity of picking up a sample is O(M).
Weighted reservoir sampling is an elegant algorithm that draws a sample from a stream of samples without pre-existed knowledge about how many samples are coming. Each sample in the stream would stand a chance that is proportional to its weight to be picked as the winner.
Assume we have n samples xi∈[1,n]. Each sample weighs w(xi) and its probability is p(xi). In ReSTIR, WRS temporally iterates the samples in the sample stream, picking and storing a sample randomly with the probability p(xi).
p(xi)=Σj=1nw(xj)w(xi)
According to the above formula, WRS stores two variables: Current picked sample(w(xi)) Total processed weight(Σj=1nw(xj))
The Workflow of WRS
Assume we have iterated k−1(k∈[1,n]), the iterating result is stored in a structure named reservoir containing the following members: 1.The sample that is picked after (k-1) times of iteration. y=xt,t∈[1,k−1] 2.The sum of the weight.wsum=Σi=1k−1w(xi)
For an incoming sample in iteration k, we perform the following steps: 1.Generate a random number(r) between 0 and 1. 2.If r>wsum+w(xk)w(xk), the incoming sample will be ignored. 3.If r<=wsum+w(xk)w(xk), the new sample will be picked as the winner replacing the previously picked sample, if existed 4.Regardless, the total weight will be updated.wsum=wsum+w(xk)
Prof of Picking Probability
Assuming K samples have been processed, we want to prove that the probability of picking sample k-1 is euqal to the probability formula.
If sample K is ignored, the probability of sample K-1 being picked is p1=Σi=1k−1w(xi)w(xk−1). If sample K is picked, the probability of sample K being picked is p1, which means the probability of K-1 being ignored is p2=Σi=1kw(xi)w(xk).
The probability of sample K-1 being picked when we process sample K is:
Our implementation is based on ReSTIR GI (SIGG 2022) and Unreal Engine, which provides a detailed description.
The first phase of our algorithm generates a new sample point for each visible point. Our implementation takes as input a G-buffer with the visible point’s position and surface normal in each pixel, though it could also easily be used with ray-traced primary visibiity
For each pixel q with corresponding visible point xv, we sample a direction ωi using the source PDF pq(ωi) and trace a ray to obtain the sample point xs. The source PDF may be a uniform distribution, a cosine-weighted distribution, or a distribution based on the BSDF at the visible point.
Spatial-temporal blue noise (STBN) is useful for providing per-pixel random values to make noise patterns in renderings. It could generate a uniform random sequence in any direction, whether spatial or temporal. We use STBN to generate random rays due to its excellent properties.
1 2 3 4 5 6 7
float2 E = GetBlueNoiseVector2(stbn_vec2_tex3d, reservoir_coord, g_current_frame_index); float4 hemi_sphere_sample = UniformSampleHemisphere(E);
we use uniform sample hemisphere sampling to generate the samples, which means the proposal pdf is 2π1
To retrieve mesh data during ray tracing, we use bindless ray tracing to obtain hit point surface information. A mesh instance ID assigned during top-level acceleration structure construction.
Store the offsets of GPU resources, including index buffer offset, vertex attributes, material index and so on, for each mesh in a global byteaddress buffer.
Retieval the surface attributes, including texture UV, vertex normal and so on, from the GPU resource based on the offset information stored on a prepared GPU buffer described above that is indexed by Instance ID.
Here is the structure of the sample in the ReSTIR GI paper.
Our implementation is based on Unreal, which is different from the original paper: 1.There is no need to store the position and the normal of the visible point since they could be obtained from the GBuffer. 2.We store the ray direction and hit distance rather than hit position which could be calculated from the direction and hit distance. 3.We use STBN to store the random value, which means there is no need to store it.
After the fresh initial sample is taken, spatial and temporal resapling is applied. The target function ptarget()=Li(xv,wi)f(wo,wi)cosθ=Lo(xs,−wi)f(wo,wi)cosθ includes the effect of the BSDF and cosine factor at the visible point, though we have also found that the simple target function ptarget()=Li(xv,wi)works well. While it is a suboptimal target function for a single pixel, we have found that it is helpful for spatial resampling in that it preserves samples that may be effective at pixels other than the one that initially generated it.
The original paper ignore the BSDF and cos factor when calculate the tatget PDF. Since it “works well”, we ignore those factors as well and use the luminance of the radiance as the target PDF.
1 2
float target_pdf = Luminance(radiance); float w = target_pdf / initial_sample.pdf;
After that, update the reservoir information. Since we only have one sample obtained from the ray tracing, meaning wnew/wsum=1, current sample will be stored in the reservoir absolutely. Below is the reservoir structure.
It contains 4 members: 1.m_sample: Reservoir sample picked up by WRS 2.weight_sum: the weight sum of the proposal samples being processed. (used for WRS) 3.M: the number of proposal samples. 4.inverse_sir_pdf: the inverse of the SIR PDF, which will be used for spatial sample reuse.
After initial samples are generated, temporal resampling is applied. In this stage, for each pixel, we read the sample from the initial sample buffer, and use it to randomly update temporal reservoir,computing the RIS weight.
At first, reproject the sample into the previous frame.
The theory behind it is described in the WRS part. In fact, picking a sample from the proposal samples (the first part of the below figure) is the same as the WRS update method (the second part of the below figure) in temporal.
Result
Reservoir picked sample radiance(before temporal resuing VS after temporal reuse):
Spatial Resampling
Before talking about the real algorithm used in our implementation, we will describe the unbiased reservoir merge at first, which is not used due to performance costs.
What we will be discussing is the fourth line. The weight used to combine two reservoirs is not the wsum of other reservoirs, since the target PDF may be different from the current PDF.
For example, we use the radiance luminance as the target PDF. If the neighboring pixel’s reservoir is not visible, its target PDF is 0. Assume the current pixel’s reservoir is visible and merge the neighbor reservoir, It introduces bias.
As described in the above algorithm, for a neighbor reservoir prepared to merge, we use its proposal PDF but replace its target PDF with the current pixel’s.
Assume we only consider the sample picked from the neighbor reservoir, which means the number of proposal sample in neighbor reservoir is 1. (M = 1). In SIR part, We have proven that the SIR PDF is euqal to proposal PDF when M = 1.
However, there is still a difference in comparison with the original algorithm, which is M. In this blog, the author explains it as a scaling factor. I don’t really understand it. You can get more details on that blog if you are interested.
Above is the real spatial algorithm we used in our implementation. The major difference is that the target PDF is not replaced with the current pixel’s target PDF, which induces bias.
There are 3 methods to reduce the bias: 1.Geometric similarity test(6th line). if similarity is lower than the given threshold, skip it.
2.Sample the shadow map to calculate the visibility term, since the original paper points out that the bias is mainly in shadow areas due to visibility changes. We haven’t implemented it.
3.For spatial reservoirs, only operate on temporal reservoirs from nearby pixels and not spatial reservoirs, which is described in the original paper.
Another problems:
With spatial reuse, it is necessary to account for differences in the source PDF between pixels that are due to the fact that our sampling scheme is based on the visible point’s position and surface normal.(Such a correction was not necessary in the original ReSTIR algorithm since it sampled lights directly without considering each pixel’s local geometry.) Therefore, when we reuse a sample from a pixel q at a pixel r, we must transform its solid angle PDF to current pixel’s solid angle space by dividing it by the Jacobian determinant of the corresponding transformation
Diffuse Indirect Lighting(Scaled) and Specular Indirect Lighting(Scaled)
Spatiotemporal Filtering
Following ReSTIR, we apply spatiotemporal filtering based on the UnrealEngine, which is similar to SVGF. If you want to dive deeper into SVGF, you can read my Chinese blog related to it.
Temporal Filtering
Temporal filtering is simple. Reproject the sample into the previous frame and lerp between two frames. And we should perform a similarity test which is similar to TAA.
In the spatial filter pass, we apply a joint bilateral filter pass to the result from the temporal filter pass. I would not detail it. If you are interested in it, my blog(GI Baking With Ray Guiding) decribes some denoise method including joint bilateral filter.