saturate和max #198

qgj · 2017-12-22T08:55:21Z

在基础光照那一章中，我看到计算worldNormal与worldLightDir的时候用的是saturate，而后面章节使用的是max（0，…），这两个方法在计算结果上会产生什么差异吗？

candycat1992 · 2017-12-22T09:47:48Z

对于那些值范围在-1到1的变量，两个方法在计算结果上没有区别。不过，saturate在Unity里通常会编译成两个指令，大致是min(max(val, 0.0), 1.0)，如果自己知道值的范围可以直接使用max来节省一条运算指令。

qgj · 2017-12-25T01:49:00Z

好的谢谢，还有一个问题，在求tangentNormal.z的时候dot(tangentNormal.xy,tangentNormal.xy) 这个操作不是很理解

candycat1992 · 2017-12-25T01:53:43Z

向量自己dot自己就是对自己取平方操作，因为tangentNormal是单位向量，所以根据xy分量来求z分量。

qgj · 2017-12-25T01:56:36Z

哦哦好的，十分感谢

alasja · 2018-03-17T08:38:27Z

我在Unity里面，点Shader属性中 “Compile and show code”按钮，看到

u_xlat0.x = dot(u_xlat1.xyz, u_xlat0.xyz);
u_xlat0.x = clamp(u_xlat0.x, 0.0, 1.0);

对应的是 saturate(dot(lightDir, worldNormal)); 。所以应该不是解析成 min(max(val, 0.0), 1.0)吧？

因为今天看到《Real-time Rendering 3rd》5.5节 115页中提到

staturate is faster than the more general max function on most hardware.

所以有点奇怪。

candycat1992 · 2018-03-19T04:41:52Z

@alasja 我之前的回答的确说得很模糊，这里补充下。

在Unity里，同一份ShaderLab代码在不同目标平台上、甚至不同设备上编译结果都是不同的。比如同样一份fragment shader：

fixed4 frag (v2f i) : SV_Target
{
	fixed4 col = i.uv.xyxy;
	col = saturate(col);
	return col;
}

在DX9下，编译出来就是：

-- Fragment shader for "d3d9":
// Stats: 1 math
Shader Disassembly:
//
// Generated by Microsoft (R) HLSL Shader Compiler 10.1
    ps_3_0
    dcl_texcoord_pp v0.xy
    mov_sat_pp oC0, v0.xyxy

// approximately 1 instruction slot used

DX11下就是：

-- Fragment shader for "d3d11":
Shader Disassembly:
//
// Generated by Microsoft (R) D3D Shader Disassembler
//
//
// Input signature:
//
// Name                 Index   Mask Register SysValue  Format   Used
// -------------------- ----- ------ -------- -------- ------- ------
// TEXCOORD                 0   xy          0     NONE   float   xy  
// SV_POSITION              0   xyzw        1      POS   float       
//
//
// Output signature:
//
// Name                 Index   Mask Register SysValue  Format   Used
// -------------------- ----- ------ -------- -------- ------- ------
// SV_Target                0   xyzw        0   TARGET   float   xyzw
//
      ps_4_0
      dcl_input_ps linear v0.xy
      dcl_output o0.xyzw
   0: mov_sat o0.xyzw, v0.xyxy
   1: ret 
// Approximately 0 instruction slots used

OpenGL ES2.0下是：

#ifdef FRAGMENT
varying highp vec2 xlv_TEXCOORD0;
void main ()
{
  lowp vec4 col_1;
  highp vec4 tmpvar_2;
  tmpvar_2 = xlv_TEXCOORD0.xyxy;
  col_1 = tmpvar_2;
  lowp vec4 tmpvar_3;
  tmpvar_3 = clamp (col_1, 0.0, 1.0);
  col_1 = tmpvar_3;
  gl_FragData[0] = tmpvar_3;
}

OpenGLES 3.0下是：

#ifdef FRAGMENT
#version 300 es

precision highp int;
in highp vec2 vs_TEXCOORD0;
layout(location = 0) out mediump vec4 SV_Target0;
void main()
{
    SV_Target0 = vs_TEXCOORD0.xyxy;
#ifdef UNITY_ADRENO_ES3
    SV_Target0 = min(max(SV_Target0, 0.0), 1.0);
#else
    SV_Target0 = clamp(SV_Target0, 0.0, 1.0);
#endif
    return;
}

#endif

Metal下是：

-- Fragment shader for "metal":
Shader Disassembly:
#include <metal_stdlib>
#include <metal_texture>
using namespace metal;
struct Mtl_FragmentIn
{
    float2 TEXCOORD0 [[ user(TEXCOORD0) ]] ;
};

struct Mtl_FragmentOut
{
    float4 SV_Target0 [[ color(0) ]];
};

fragment Mtl_FragmentOut xlatMtlMain(
    Mtl_FragmentIn input [[ stage_in ]])
{
    Mtl_FragmentOut output;
    output.SV_Target0 = input.TEXCOORD0.xyxy;
    output.SV_Target0 = clamp(output.SV_Target0, 0.0f, 1.0f);
    return output;
}

Vulkan太长了我就不粘贴了……

你可以看出来在一些平台下，比如DX9、DX11下，编译出来的确是使用真正的saturate运算操作，而在大多数移动平台上，比如ES 2.0、Metal等，是编译成了clamp，而在ES 3.0下跟设备还有关系。至于为什么要分这么多种情况，Unity应该是根据各个平台每种操作耗时的平衡，选择同等运算结果下最优的等价操作来代替saturate。

candycat1992 closed this as completed Jan 31, 2018

candycat1992 added the enhancement label Mar 19, 2018

candycat1992 reopened this Mar 19, 2018

candycat1992 closed this as completed May 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

saturate和max #198

saturate和max #198

qgj commented Dec 22, 2017

candycat1992 commented Dec 22, 2017

Uh oh!

qgj commented Dec 25, 2017

Uh oh!

candycat1992 commented Dec 25, 2017

Uh oh!

qgj commented Dec 25, 2017

Uh oh!

alasja commented Mar 17, 2018 •

edited

Loading

Uh oh!

candycat1992 commented Mar 19, 2018 •

edited

Loading

Uh oh!

saturate和max #198

saturate和max #198

Comments

qgj commented Dec 22, 2017

candycat1992 commented Dec 22, 2017

Uh oh!

qgj commented Dec 25, 2017

Uh oh!

candycat1992 commented Dec 25, 2017

Uh oh!

qgj commented Dec 25, 2017

Uh oh!

alasja commented Mar 17, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

candycat1992 commented Mar 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alasja commented Mar 17, 2018 •

edited

Loading

candycat1992 commented Mar 19, 2018 •

edited

Loading