Commits · useful · John Stultz / mesa

Apr 14, 2011

ARB prog parser: Compile lexer as C++ · d6db9143
Ian Romanick authored 14 years ago

useful

d6db9143

ARB prog parser: Compile parser as C++ · 9c0b233c

Ian Romanick authored 14 years ago

This is in anticipation of generating GLSL IR from the parser.

The C++ rules for enums vs. ints are just plain broken.

9c0b233c

glsl: Make bvec types accessible · 364aa728
Ian Romanick authored 14 years ago

364aa728
glsl: Add glsl_type::get_sampler_instance method · d290ec22
Ian Romanick authored 14 years ago

d290ec22

glsl: Add ir_unop_lit · c0906e06

Ian Romanick authored 14 years ago

Like the noise opcode, this opcode really shouldn't exist.  The only
reason it exists is to facilitate translation of assembly shaders to
IR.  These changes include the addition of ir_unop_lit and the
lowering pass that removes it.

c0906e06

glsl: Lower ir_unop_exp to ir_binop_pow · b6c49104

Ian Romanick authored 14 years ago

Instead of lowering e^x to 2^(x * log2(e)), lower it to e^x using the
POW opcode. On GPUs that have a POW instruction, this saves a
multiply (the log2 is already removed by the constant expression evaluator).

b6c49104

Apr 13, 2011

glsl/opt_cpe: Reenable opt_copy_propagation_elements.cpp pass. · 6a35cbb6
Emma Anholt authored 13 years ago

6a35cbb6

glsl/opt_cpe: Fix a crash when a kill kills for two reasons. · 909bd476

Emma Anholt authored 13 years ago


Fixes glsl-copy-propagation-loop-2 when this optimization pass is
re-enabled.

Reported-by: David Lamparter <equinox@diac24.net>

909bd476

glsl/opt_cpe: Kill when the assignment isn't something we recognize. · 487debfd

Emma Anholt authored 13 years ago

A few GLES2 tests tripped over this when using array dereferences to
hit channels on the LHS (see piglit test
glsl-copy-propagation-vector-indexing).  We wouldn't find the
ir_dereference_variable, and assume that that meant that it wasn't an
assignment to a scalar/vector, and thus not notice that the variable
had been changed.

487debfd

svga: defined QSZ in terms of SVGA3D_MAX_DRAW_PRIMITIVE_RANGES · b9c8b2a1
Brian Paul authored 13 years ago

b9c8b2a1
svga: define SVGA3D_MAX_DRAW_PRIMITIVE_RANGES and update comments · 32aab51d
Brian Paul authored 13 years ago

32aab51d
st/mesa: minor clean-ups in update_textures() · 4cbb261e
Brian Paul authored 13 years ago

4cbb261e
mesa: 80-column wrapping and whitespace fixes · 032a7ef0
Brian Paul authored 13 years ago

032a7ef0
mesa: fix some comments in sampler object code · 75d585e5
Brian Paul authored 13 years ago

75d585e5

i965: Change assertion condition from implicit to explicit · d3cc3901

Lina Versace authored 13 years ago


... because grokking explicit assertions requires fewer neurons.

In brw_misc_state.c:emit_depthbuffer, change assertion condition
    tiling != I915_TILING_X && tiling != I915_TILING_NONE
to
    tiling == I915_TILING_Y

Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Signed-off-by: Chad Versace <chad.versace@intel.com>

d3cc3901

i965: Define BRW_DEPTHFORMAT_D24_UNORM_X8_UINT · 4d7c1871

Lina Versace authored 13 years ago


This depth format was added in Gen5.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Signed-off-by: Chad Versace <chad.versace@intel.com>

4d7c1871

i965: Document brw_context.state.depth_region · 05173c61

Lina Versace authored 13 years ago


Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Signed-off-by: Chad Versace <chad.versace@intel.com>

05173c61

i965: Remove unnecessary release/reference of brw_context.state.depth_region · 9949d2a2

Lina Versace authored 13 years ago


Release the old depth region and reference the new one *only* if it has
changed.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Signed-off-by: Chad Versace <chad.versace@intel.com>

9949d2a2

Apr 12, 2011

i965: Add comments about URB size units and limits. · 3f7318c1

Kenneth Graunke authored 13 years ago


Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
Acked-by: Chris Wilson <chris@chris-wilson.co.uk>

3f7318c1

i965: Never enable the GS on Gen6. · 35b3f597

Kenneth Graunke authored 13 years ago


Prior to Gen6, we use the GS for breaking down quads, quad-strips,
and line loops.  On Gen6, earlier stages already take care of this,
so we never need the GS.

Since this code is likely completely untested, remove it for now.
We can write new code when enabling real geometry shaders.

Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Eric Anholt <eric@anholt.net>

35b3f597

Revert "i965: Reinstate max-index paranoia" · f703ba8c

Chris Wilson authored 13 years ago


This reverts commit b4cbd2b3.

It looked like a safe sanity check. It missed the issue of the start of
the buffer not being at 0, but even that was not enough to explain why
setting the max vertex index caused glyphs to be dropped from the game
'Achron'.

Instead, the issue appears to be related to the use of the vertex bias
and so we would need to re-emit the max-index every time we adjusted the
bias, so re-emitting the relocations and defeating the original
optimisation.

Reported-and-tested-by: Thomas Jones <thomas.jones@utoronto.ca>
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=35163


Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>

f703ba8c

egl/wayland: Update to per-surface frame events · f05751aa
Benjamin Franzke authored 13 years ago

f05751aa
nouveau_vieux: fix build since sampler objects merge · b27f206b
Dave Airlie authored 13 years ago

b27f206b

st/wgl: Prevent spurious framebuffer sizes when the window is minimized. · 16d42af6

Jose Fonseca authored 13 years ago

When the window is minimized GetClientRect will return zeros.

Instead of creating a 1x1 framebuffer, simply preserve the current window
size, until the window is restored or maximized again.

16d42af6

st/wgl: Fix debug output format specifiers of stw_framebuffer_get_size(). · b5829c0d
Jose Fonseca authored 13 years ago

b5829c0d

svga: Rebind framebuffer and tss bindings strictly when necessary. · 6b95cfb0

Jose Fonseca authored 13 years ago

The earlier change to ensure rendertargets and textures are always
rebound at every command buffer start had the downside of making
successive flushes no longer no-ops, as a command buffer with merely
the rebinding commands were being unnecessarily sent to the vGPU.

This change only re-emits the bindings when necessary, by keeping track
of the need to rebind outside of the dirty state update mechanism.

6b95cfb0

texstore: fix regression stricter check for memcpy path for unorm88 and unorm1616 · e338a1b0

Hans de Goede authored 13 years ago

According to https://bugs.freedesktop.org/show_bug.cgi?id=34280


commit 5d1387b2 causes the font corruption
problems people have been seeing under various apps and gnome-shell on r200
cards.

This commit changed (loosened) the check for using the memcpy path in the
former al88 / al1616 texstore functions, which are now also used to
store rg texures. This patch restores the old strict check in case of
al textures. I've no idea why this fixes things, since I don't know the
code in question at all. But after seeing the bisect in bfdo34280 point
to this commit, I gave this fix a try and it fixes the font issues seen on
r200 cards.

[airlied:
r200 has no native working A8, so it does an internal storage format of AL88
however srcFormat == internalFormat == ALPHA when we get to this point,
so it copies, but it wants to store into an AL88 not ALPHA so fails,
I'll also push a piglit test for this on r200].

Many thanks to Nicolas Kaiser who did all the hard work of tracking this down!

Signed-off-by: Hans de Goede <hdegoede@redhat.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

e338a1b0

ir_to_mesa: silence signed/unsigned comparison warnings · 847f991a
Brian Paul authored 13 years ago

847f991a
configs: add r600 dir to DRI_DIRS · 482a64dc
Brian Paul authored 13 years ago

482a64dc
r600: silence various compiler warnings · 155a9670
Brian Paul authored 13 years ago

155a9670
Merge branch 'arb_sampler_objects' · 1ca55854
Brian Paul authored 13 years ago

1ca55854
Revert "i965: clear global offset to zero in m0.2 for VS DP read." · 2432ca1c
Nan Hai Zou authored 13 years ago
```
This reverts commit 66b66295.
it was already fixed by commit 9d60a7ce
```
2432ca1c

Apr 11, 2011

i965: Remove hint_gs_always and resulting dead code · a7fa203f

Ian Romanick authored 13 years ago


Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

a7fa203f

intel: Fix ROUND_DOWN_TO macro · 7e809f0b

Ian Romanick authored 13 years ago


Previously the macro would (ALIGN(value - alignment - 1, alignment)).
At the very least, this was missing parenthesis around "alignment -
1".  As a result, if value was already aligned, it would be reduced by
alignment.  Condisder:

     x = ROUND_DOWN_TO(256, 128);

This becomes:

    x = ALIGN(256 - 128 - 1, 128);

Or:

    x = ALIGN(127, 128);

Which becomes:

    x = 128;

This macro is currently only used in brw_state_batch
(brw_state_batch.c).  It looks like the original version of this macro
would just use too much space in the batch buffer.  It's possible, but
not at all clear to me from the code, that the original behavior is
actually desired.

In any case, this patch does not cause any piglit regressions on my
Ironlake system.

I also think that ALIGN_FLOOR would be a better name for this macro,
but ROUND_DOWN_TO matches rounddown in the Linux kernel.

Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Keith Whitwell <keithw@vmware.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

7e809f0b

glsl: Make GL_ARB_shader_stencil_export enable block be similar to other blocks · 03eade16
Ian Romanick authored 13 years ago
```
Tested-by: Brian Paul <brianp@vmware.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
```
03eade16

glsl: Only let a shader enable GL_ARB_draw_instanced if the driver supports it · f2bda1b5

Ian Romanick authored 13 years ago


Also make the GL_ARB_draw_instanced block follow the same pattern as
the other blocks.

Tested-by: Brian Paul <brianp@vmware.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

f2bda1b5

mesa: fixup r600 DRI driver for sampler object changes · 65b024d6
Brian Paul authored 13 years ago

65b024d6

i965: Move the SF VP from state caching to state streaming. · 88022278

Emma Anholt authored 13 years ago

This is a 49.6% +/- 2.0% (n=9, IPS outlier removed) performance
improvement for the hacked-up-for-cache-misses scissor-many, and no
statistically significant performance difference for the
hacked-up-for-cache-hits version (n=9, IPS outlier removed).  No
statistically significant performance difference from ETQW (n=5) from
these last two commits.

88022278

i965: Change the SF unit from state caching to state streaming. · b1be5bd2

Emma Anholt authored 13 years ago

This is a 28.1% +/- 1.4% (n=10) performance improvement for the
hacked-up-for-cache-misses scissor-many (n=10), and no statistically
significant wall-time performance difference for the
hacked-up-for-cache-hits version (n=9, first outlier in each removed
since IPS was warming up.  User time increased by about 4.7%, but
kernel time decreased equivalently).

b1be5bd2

i965: Turn SF unit and viewport structs into pointers to prep for streaming. · aaf188e3
Emma Anholt authored 13 years ago
```
I wanted to separate this mechanical change from the actual work.
```
aaf188e3

Admin message