Texture Gaussian Blur shader

fra3point · Oct 20, 2016

Hi, shaders gurus!

I'm working on a project named "Mesh2Bump", a small utility to convert a flat high-detailed 3D model into a bump map. For this purpose, I need to apply a small amount of blur on a runtime-generated texture to smooth the result.

Since I don't like to work with the SetPixels functions because they are slow, I need a gaussian blur shader to be used in a Blit() operation to blur the input texture by a given amount.

Could you help me?

Gistix · Oct 24, 2016

I suppose this project may have everything that you would need.

fra3point · Oct 24, 2016

That example was great, but it uses a kind of multi tap linear sampling.
What I need is the same of photoshop's (or GIMP's) gaussian blur.
It allows to specify a in pixels, starting from 1px.

I think I should use a convolution algorhitm with a gaussian kernel, but I have no idea on how to use it inside a shader.

bgolus · Oct 24, 2016

There are a ton of examples across the internet of doing gaussian (or gaussian-like) blurs of many kinds. Down sampled multi-pass separable linear sampling used by that example is essentially the "best" from an optimization standpoint if you want to get large blurs quickly. With older hardware doing high quality dynamically sized blurs was difficult since you couldn't easily do dynamically sized loops, but with DX11 hardware it's pretty easy, so if you don't care about speed that much you can calculate the blur kernel weights and number of taps needed and set those on the material dynamically.

If you absolutely need DX9 support you pretty much just make one shader that does a fixed number of "taps" that you scale the range on with a known maximum pixel range (usually the number of taps, though some people will go to 2x that) and run it multiple times for blurs larger. Unity's older MobileBlur.shader does just that.

fra3point · Oct 31, 2016

@bgolus I don't need speed since this is an Editor system and it's executed only a few times, so dynamic loops won't be a big problem.

As now I wrote this simple convolution shader. It just has a static-ish gaussian kernel with the relative offsets as 5x5 matrices.

Code (csharp):

Shader "Blur" {

Properties{

_MainTex("Base (RGB)", 2D) = "white" { }

}

SubShader{

ZTest Always Cull Off ZWrite Off Fog{ Mode Off }

Pass{

CGPROGRAM

#pragma vertex vert

#pragma fragment frag

#include "UnityCG.cginc"

sampler2D _MainTex;

float4 _MainTex_TexelSize;

float step_w;

float step_h;

struct v2f {

float4 pos : SV_POSITION;

float2 uv : TEXCOORD0;

};

float4 _MainTex_ST;

float4 _MainTex_ST_TexelSize;

v2f vert(appdata_base v) {

v2f o;

o.pos = mul(UNITY_MATRIX_MVP, v.vertex);

o.uv = TRANSFORM_TEX(v.texcoord, _MainTex);

return o;

}

fixed4 frag(v2f i) : COLOR{

step_w = _MainTex_TexelSize.x;

step_h = _MainTex_TexelSize.y;

float2 offset[25] = {

float2(-step_w*2.0, -step_h*2.0), float2(-step_w, -step_h*2.0), float2(0.0, -step_h*2.0), float2(step_w, -step_h*2.0), float2(step_w*2.0, -step_h*2.0),

float2(-step_w*2.0, -step_h), float2(-step_w, -step_h), float2(0.0, -step_h), float2(step_w, -step_h), float2(step_w*2.0, -step_h),

float2(-step_w*2.0, 0.0), float2(-step_w, 0.0), float2(0.0, 0.0), float2(step_w, 0.0), float2(step_w*2.0, 0.0),

float2(-step_w*2.0, step_h), float2(-step_w, step_h), float2(0.0, step_h), float2(step_w, step_h), float2(step_w*2.0, step_h),

float2(-step_w*2.0, step_h*2.0), float2(-step_w, step_h*2.0), float2(0.0, step_h*2.0), float2(step_w, step_h*20), float2(step_w*2.0, step_h*2.0)

};

float kernel[25] = {

0.003765, 0.015019, 0.023792, 0.015019, 0.003765,

0.015019, 0.059912, 0.094907, 0.059912, 0.015019,

0.023792, 0.094907, 0.150342, 0.094907, 0.023792,

0.015019, 0.059912, 0.094907, 0.059912, 0.015019,

0.003765, 0.015019, 0.023792, 0.015019, 0.003765

};

float4 sum = float4(0.0, 0.0, 0.0, 0.0);

for (int j = 0; j < 25; j++) {

float4 tmp = tex2D(_MainTex, i.uv + offset[j]);

sum += tmp * kernel[j];

}

return sum;

}

ENDCG //Shader End

}

}

}

It works well, but it does nothing but a little, not tweakable blur.
I tried to use it in multiple Blit() calls to repeat the effect and get larger blurs, but I failed.
It seems that using the same Render Texture for source and destination in a Blit doesn't work, so I use a temporary render texture to store the progressive result for each iteration. Here's the code:

Code (CSharp):

RenderTexture Blur(RenderTexture source, int iterations) {

RenderTexture result = source; //result will store partial results (blur iterations)

Material mat = new Material(Shader.Find("Blur")); //create blur material

RenderTexture blit = RenderTexture.GetTemporary((int)resolution, (int)resolution); //get temp RT

for (int i = 0; i < iterations; i++) {

Graphics.SetRenderTarget(blit);

GL.Clear(true, true, Color.black); //avoid artifacts in temp RT by clearing it

Graphics.Blit(result, blit, mat); //PERFORM A BLUR ITERATION

result= blit; //overwrite partial result

}

RenderTexture.ReleaseTemporary(blit);

return result; //return the last partial result

}

This code produces the same result as a single Blit() call, and I don't know why.
While I try to implement dynamic sized kernels, can someone clear my mind about this issue?

Thank you for your help!

bgolus · Oct 31, 2016

result= blit; //overwrite partial result
Click to expand...

That line right there is the problem. Doing result = blit is setting the result to be the same texture as blit, not copying the data from blit texture to the result texture. From that point on every iteration is reading and writing to the same texture, which isn't allowed* and will result in the Blit() doing nothing.

There's two main ways to handle this: Either use the "ping pong" approach where you swap the render textures back and forth and copy (with another blit) the result back to the result if you have an even number of iterations, or create a new temp buffer every iteration to write to and copy that (again, with another blit) to the output. The ping pong method is more efficient, but can make for confusing code. The new buffer every frame is fairly straightforward and isn't really that much slower. See the Blur.cs from Standard Assets's Image Effects "Image Effects/Blur/Blur" for an example of the later.

* Using a RWTexture2D you can read and write to a texture in the same shader, but it's mainly for use with compute shaders, the built in Blit() doesn't support them, and doing anything but writing to the same single pixel you read from (unlike a blur which reads from many pixels and writes to one) has "undefined" results which basically means you get junk.

fra3point · Nov 1, 2016

Thanks, @bgolus !!! The ping pong approach works well. I switched back to a 3x3 kernel and here's the final code for multiple Blit() iterations:

Code (CSharp):

RenderTexture Blur(RenderTexture source, int iterations) {

RenderTexture rt = source;

Material mat = new Material(Shader.Find("Blur"));

RenderTexture blit = RenderTexture.GetTemporary((int)resolution, (int)resolution);

for (int i = 0; i < iterations; i++) {

Graphics.SetRenderTarget(blit);

GL.Clear(true, true, Color.black);

Graphics.Blit(rt, blit, mat);

Graphics.SetRenderTarget(rt);

GL.Clear(true, true, Color.black);

Graphics.Blit(blit, rt, mat);

}

RenderTexture.ReleaseTemporary(blit);

return rt;

}

By the way, I was reading at the Wikipedia page for Gaussian Blur, and my eyes focused on this lines:

Applying multiple, successive gaussian blurs to an image has the same effect as applying a single, larger gaussian blur, whose radius is the square root of the sum of the squares of the blur radii that were actually applied. (I assume that radius = 2*std. deviation)

Does it mean that using multiple iterations is the same of using a larger kernel?

bgolus · Nov 1, 2016

fra3point said: ↑

Does it mean that using multiple iterations is the same of using a larger kernel?
Click to expand...

Yes. That's why so many real time blurs use multiple iterations. There's a cost to doing each pass, but usually far less than the massive number of samples required to do larger blurs.

Search Unity

Unity ID

Useful Searches

Texture Gaussian Blur shader