(208,15): error X5608: Compiled shader code uses too many arithmetic instruction slots (77). Max. allowed by the target (ps_2_0) is 64.
可以参考:http://msdn.microsoft.com/en-us/library/bb509626%28v=vs.85%29.aspx
我碰到的错误来自ps_2_0,所以选择了Shader Model 2-》ps_2_0 Instructions
ps_2_0 Instructions
This section contains reference information for the pixel shader version 2_0 instructions.
There are several types of pixel shader instructions, as shown in the table. Columns to the right mean the following:
- Instruction slots - Number of instruction slots used by each instruction.
- Setup - A pixel shader must have a version instruction and it must be the first instruction.
- Arithmetic - These instructions provide the mathematical operations in a shader.
- Texture - These instructions are used to load and sample texture data, and to modify texture coordinates.
- New - These instructions are new to this version.
Instruction Set
Name | Description | Instruction slots | Setup | Arithmetic | Texture | New |
---|---|---|---|---|---|---|
abs - ps | Absolute value | 1 | x | x | ||
add - ps | Add two vectors | 1 | x | |||
cmp - ps | Compare source to 0 | 1 | x | |||
crs - ps | Cross product | 2 | x | x | ||
dcl_samplerType (sm2, sm3 - ps asm) | Declare the texture dimension for a sampler | 0 | x | x | ||
dcl - (sm2, sm3 - ps asm) | Declare the association between vertex shader output registers and pixel shader input registers. | 0 | x | x | ||
def - ps | Define constants | 0 | x | |||
dp2add - ps | 2D dot product and add | 2 | x | x | ||
dp3 - ps | 3D dot product | 1 | x | |||
dp4 - ps | 4D dot product | 1 | x | |||
exp - ps | Full precision 2x | 1 | x | x | ||
frc - ps | Fractional component | 1 | x | x | ||
log - ps | Full precision log2(x) | 1 | x | x | ||
lrp - ps | Linear interpolate | 2 | x | |||
m3x2 - ps | 3x2 multiply | 2 | x | x | ||
m3x3 - ps | 3x3 multiply | 3 | x | x | ||
m3x4 - ps | 3x4 multiply | 4 | x | x | ||
m4x3 - ps | 4x3 multiply | 3 | x | x | ||
m4x4 - ps | 4x4 multiply | 4 | x | x | ||
mad - ps | Multiply and add | 1 | x | |||
max - ps | Maximum | 1 | x | x | ||
min - ps | Minimum | 1 | x | x | ||
mov - ps | Move | 1 | x | |||
mul - ps | Multiply | 1 | x | |||
nop - ps | No operation | 1 | x | |||
nrm - ps | Normalize | 3 | x | x | ||
pow - ps | xy | 3 | x | x | ||
ps | Version | 0 | x | |||
rcp - ps | Reciprocal | 1 | x | x | ||
rsq - ps | Reciprocal square root | 1 | x | x | ||
sincos - ps | Sine and cosine | 8 | x | x | ||
sub - ps | Subtract | 1 | x | |||
texkill - ps | Kill pixel render | 1 | x | |||
texld - ps_2_0 and up | Sample a texture | 1 | x | x | ||
texldb - ps | Texture sampling with level-of-detail bias from w-component | 1 | x | x | ||
texldp - ps | Texture sampling with projective divide by w-component | 1 | x | x |
Send comments about this topic to Microsoft
Build date: 9/6/2011
http://en.wikipedia.org/wiki/High_Level_Shader_Language
Shader model comparison
[edit] Pixel shader comparison
Pixel shader version | 1.0 to 1.3[2] | 1.4[2] | 2.0[2][3] | 2.0a[2][3] | 2.0b[2][3] | 3.0[2][4] | 4.0[5] |
---|---|---|---|---|---|---|---|
Dependent texture limit | 4 | 6 | 8 | Unlimited | 8 | Unlimited | Unlimited |
Texture instruction limit | 4 | 6*2 | 32 | Unlimited | Unlimited | Unlimited | Unlimited |
Position register | No | No | No | No | No | Yes | Yes |
Instruction slots | 8+4 | 8+4 | 32 + 64 | 512 | 512 | ≥ 512 | ≥ 65536 |
Executed instructions | 8+4 | 6*2+8*2 | 32 + 64 | 512 | 512 | 65536 | Unlimited |
Texture indirections | 4 | 4 | 4 | Unlimited | 4 | Unlimited | Unlimited |
Interpolated registers | 2 + 8 | 2 + 8 | 2 + 8 | 2 + 8 | 2 + 8 | 10 | 32 |
Instruction predication | No | No | No | Yes | No | Yes | No |
Index input registers | No | No | No | No | No | Yes | Yes |
Temp registers | 2 | 6 | 12 to 32 | 22 | 32 | 32 | 4096 |
Constant registers | 8 | 8 | 32 | 32 | 32 | 224 | 16x4096 |
Arbitrary swizzling | No | No | No | Yes | No | Yes | Yes |
Gradient instructions | No | No | No | Yes | No | Yes | Yes |
Loop count register | No | No | No | No | No | Yes | Yes |
Face register (2-sided lighting) | No | No | No | No | No | Yes | Yes |
Dynamic flow control | No | No | No | No | No | 24 | Yes |
Bitwise Operators | No | No | No | No | No | No | Yes |
Native Integers | No | No | No | No | No | No | Yes |
- PS 2.0 = DirectX 9.0 original Shader Model 2 specification.
- PS 2.0a = NVIDIA GeForce FX-optimized model.
- PS 2.0b = ATI Radeon X700, X800, X850 shader model, DirectX 9.0b.
- PS 3.0 = Shader Model 3.
- PS 4.0 = Shader Model 4.
"32 + 64" for Executed Instructions means "32 texture instructions and 64 arithmetic instructions."
[edit] Vertex shader comparison
Vertex shader version | VS 1.1[6] | VS 2.0[3][6] | VS 2.0a[3][6] | VS 3.0[4][6] | VS 4.0[5] |
---|---|---|---|---|---|
# of instruction slots | 128 | 256 | 256 | ≥ 512 | 4096 |
Max # of instructions executed | Unknown | 65536 | 65536 | 65536 | 65536 |
Instruction predication | No | No | Yes | Yes | Yes |
Temp registers | 12 | 12 | 13 | 32 | 4096 |
# constant registers | ≥ 96 | ≥ 256 | ≥ 256 | ≥ 256 | 16x4096 |
Static Flow Control | ??? | Yes | Yes | Yes | Yes |
Dynamic Flow Control | No | No | Yes | Yes | Yes |
Dynamic Flow Control Depth | No | No | 24 | 24 | Yes |
Vertex Texture Fetch | No | No | No | Yes | Yes |
# of texture samplers | N/A | N/A | N/A | 4 | 128 |
Geometry instancing support | No | No | No | Yes | Yes |
Bitwise Operators | No | No | No | No | Yes |
Native Integers | No | No | No | No | Yes |
- VS 2.0 = DirectX 9.0 original Shader Model specification.
- VS 2.0a = NVIDIA GeForce FX-optimized model.
- VS 3.0 = Shader Model 3.
- Note that ATI X1000 series cards (e.g. X1900) does not support Vertex Texture Fetch, hence it does not fully comply with the VS 3.0 model. Instead, they offer a feature called "Render to Vertex Buffer (R2VB)" that provides functionality that is a superset of Vertex Texture Fetch.[citation needed]
- VS 4.0 = Shader Model 4.