Commits · 19.1-branchpoint · Zeb Figura / mesa

May 04, 2019

freedreno: remove unused forward struct declaration · bdd273d8
Rob Clark authored 5 years ago
```
Signed-off-by: Rob Clark <robdclark@chromium.org>
```
19.1-branchpoint

bdd273d8
panfrost/midgard: iabs cannot run on mul · 68238732
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
68238732

panfrost/midgard: Lower mixed csel (NIR) · cdd9189a


Basically, when the conditions of a csel diverge, we scalarize to avoid
going into weird code paths during emit. We could be doing better, but
this case can't occur organically from GLSL as far as I can, though it
does fix lowered atan2.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

cdd9189a

panfrost/midgard: Fix RA when temp_count = 0 · 58a1e1f8

Alyssa Rosenzweig authored 5 years ago


A previous commit by Tomeu aborted RA early, which solves the memory
corruption issue, but then generates an incorrect compile. This fixes
that.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

58a1e1f8

panfrost/midgard: Fix integer selection · 3d7874c6
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
3d7874c6
panfrost: Support RGB565 FBOs · 31f5a43b
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
31f5a43b
panfrost/midgard/disasm: Handle dest_override generalized · f8c7ffa0
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
f8c7ffa0
panfrost/midgard/disasm: Stub out 64-bit · b6b534c7
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
b6b534c7

panfrost/midgard/disasm: Print 8-bit sources · 8c36ecd4

Alyssa Rosenzweig authored 5 years ago


This handles the usual case. 8-bit register access parallels 16-bit
access, but with one major caveat: in 8-bit mode, only half of the
register file is actually (directly) accessible as sources. In
particular, for each 16-bit integer register (hrN), we can only index a
*single* 8-bit integer (qrN), corresponding to the lower 8-bits. To get
the upper 8-bits, it is required to do an explicit shift. For example,
to add the bytes of a 16-bit integer hr0.x and get the result as an
8-bit qr0, you'd need to do something like:

   ilsr hr1.x, hr0.x, #8
   iadd qr0.x, qr0.x, qr1.x

This scheme diverges from 32-bit registers, in that both the upper and
lower halves of a 32-bit register are individually accessible as a pair
of half registers. For contrast, to add the lower and upper 16-bits of a
32-bit integer r0.x, you can just:

   iadd hr0.x, hr0.x, hr1.x

Since hr1.x = upper 16-bit of r0.x.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

8c36ecd4

panfrost/midgard/disasm: Support 8-bit destination · 2800e822

Alyssa Rosenzweig authored 5 years ago


Meanwhile, we're forced to disable dest_override, since it's not yet
clear how this interacts with other bitnesses (it'll likely need to be
overhauled in any case).

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

2800e822

panfrost/midgard: Rename ilzcnt8 -> iclz · d42c37e4
Alyssa Rosenzweig authored 5 years ago
```
Per OpenCL.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
d42c37e4
panfrost/midgard: Fix crash on unknown op · 9559280f
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
9559280f
panfrost/midgard/disasm: Fill in .int mod · 96eed4e0
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
96eed4e0
panfrost/midgard/disasm: Extend print_reg to 8-bit · 7469df70
Alyssa Rosenzweig authored 5 years ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
7469df70

panfrost/midgard/disasm: Catch mask errors · 055f6def

Alyssa Rosenzweig authored 5 years ago


We silently ignored certain bits of the mask, which causes issues when
disassembly 8/64-bit ops.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

055f6def

panfrost/midgard: reg_mode_full -> reg_mode_32, etc · 576a27fd

Alyssa Rosenzweig authored 5 years ago


In preparation for 8-bit and 64-bit operands, let's not reinforce the
32-bit-centric biases in the ISA.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

576a27fd

freedreno/a6xx: deduplicate a few lines · 2da36dd0
Rob Clark authored 5 years ago
```
Signed-off-by: Rob Clark <robdclark@chromium.org>
```
2da36dd0

freedreno: add ubwc_enabled helper · 555ca49d

Rob Clark authored 5 years ago


Since it is dependent on the tile mode (ie. disabled for smaller mipmap
levels), we should handle it a similar way to fd_resource_level_linear().
The code previously mostly did the right thing because the old helper
took the tile mode.

Signed-off-by: Rob Clark <robdclark@chromium.org>

555ca49d

freedreno: move UBWC color offset to fd_resource_offset() · 62c0b027

Rob Clark authored 5 years ago


Best to keep it encapsulated in the helper which returns layer/level
offset (and actually use that helper everywhere) rather than spreading
the logic around the code.

Also add a helper to find UBWC offset, to complete the encapsulation.

Signed-off-by: Rob Clark <robdclark@chromium.org>

62c0b027

freedreno/a6xx: buffer resources cannot be compressed · a871b5ff

Rob Clark authored 5 years ago


Small cleanup.  They are just an array of data and only ever linear/
uncompressed.

Signed-off-by: Rob Clark <robdclark@chromium.org>

a871b5ff

freedreno: mark imported resources as valid · 05f5122d

Rob Clark authored 5 years ago


If someone is importing a buffer, we can't really know the state of it's
contents, so assume it is valid.

Signed-off-by: Rob Clark <robdclark@chromium.org>

05f5122d

freedreno/a6xx: UBWC support for images · 11583dc6

Rob Clark authored 5 years ago

There are still some fallbacks we'll need to handle before we can enable
UBWC by default. I think we may need to fallback to uncompressed if
image atomic operations are used. And we still need to sort out how to
handle image and sampler views of compressed resources if the image/
sampler view is using a format that does not support compression. (I
think the latter should hopefully be uncommon outside of deqp/piglit.)

But at least this gets us to the point where supertuxkart works properly
with UBWC enabled ;-)

Signed-off-by: Rob Clark <robdclark@chromium.org>

11583dc6

freedreno/a6xx: UBWC fixes · 857d9f3b

Rob Clark authored 5 years ago


A few fixes that get UBWC working for the games/benchmarks where I
noticed problems before (in particular and manhattan, and stk (modulo
image support for UBWC when compute shaders are used for post-process
effects):

  + fix the size of the UBWC meta buffer (ie, the offset to color
    pixel data) that is returned by ->fill_ubwc_buffer_sizes()
  + correct size/layout for 8 and 16 byte per pixel formats
  + limit the supported formats.. Note all formats that can be
    tiled can be compressed.

Signed-off-by: Rob Clark <robdclark@chromium.org>

857d9f3b

freedreno: update generated headers · 6ffb5872

Rob Clark authored 5 years ago


Corrects tex state ubwc pitch/size

Signed-off-by: Rob Clark <robdclark@chromium.org>

6ffb5872

freedreno/a6xx: OUT_RELOC vs OUT_RELOCW fixes · fb1488a8
Rob Clark authored 5 years ago
```
Signed-off-by: Rob Clark <robdclark@chromium.org>
```
fb1488a8

freedreno/ir3: remove assert · 8c97b3c5

Rob Clark authored 5 years ago


Fixes dEQP-GLES31.functional.ubo.random.all_per_block_buffers.13 and .20

ca3eb5db went from silently truncating
the constant state, which was also the wrong thing to do, to an assert.
Which then showed up in a couple of dEQPs.  Actually there is nothing
wrong with larger constant file so just drop the assert.

Signed-off-by: Rob Clark <robdclark@chromium.org>

8c97b3c5

spirv/cl: support vload/vstore · 7f852831

Karol Herbst authored 6 years ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

7f852831

nir: Add nir_op_vec helper · d11b807d

Karol Herbst authored 5 years ago


with that we can simplify code where nir vectors are created

v2: merge both lines in nir_vec

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

d11b807d

nir: Add a nir_builder_alu variant which takes an array of components · 681fb7ea

Karol Herbst authored 5 years ago


v2: rename to nir_build_alu_src_arr

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

681fb7ea

vtn: handle bitcast with pointer src/dest · c91ea634

Karol Herbst authored 5 years ago


v2: use vtn_push_ssa and vtn_ssa_value

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

c91ea634

mesa: Leave aliasing of vertex and generic0 attribute to the dlist code. · c9896619

Mathias Fröhlich authored 5 years ago

Now that dlist compilation again knows if it is inside glBegin/glEnd,
we can leave the decision if aliasing should occur to the vertex attribute
setter functions instead of doing that at glArrayElement time.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

c9896619

mesa: Correct the is_vertex_position decision for dlists. · c869387d

Mathias Fröhlich authored 5 years ago


We have to use _mesa_inside_dlist_begin_end instead of
_mesa_inside_begin_end to see if we are inside a glBegin/glEnd block in
case of display lists.
So split the is_vertex_position function used in vertex attribute processing
into a imm and dlist variant and use the appropriate _mesa_inside_begin_end
variant.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

c869387d

mesa: Set CurrentSavePrimitive in vbo_save_NotifyBegin. · 5ad54217

Mathias Fröhlich authored 5 years ago


That seems to be lost somewhere. Is needed for correct outside begin/end
detection in display list compilation. And is needed for correct aliasing
in dlists restablished in the next changes.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

5ad54217

mesa: Remove the _glapi_table argument from _mesa_array_element. · 0ed7603d

Mathias Fröhlich authored 5 years ago


The value is now unused.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

0ed7603d

mesa: Constify static const array in api_arrayelt.c · 3b6f3290

Mathias Fröhlich authored 5 years ago


Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

3b6f3290

mesa: Remove the now unused _NEW_ARRAY state change flag. · 68aaf0a4

Mathias Fröhlich authored 5 years ago


Is no longer used, so we have less occasions where NewState is non zero.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

68aaf0a4

mesa: Rip out now unused gl_context::aelt_context. · 7af047c3

Mathias Fröhlich authored 5 years ago


Now this part of gl_context state is unused and can be removed.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

7af047c3

mesa: Implement _mesa_array_element by walking enabled arrays. · b9de4858

Mathias Fröhlich authored 5 years ago


In glArrayElement, use the bitmask trick to just walk the enabled
vao arrays. This should be about equivalent in execution time to
walk the prepare aelt_context list. Finally this will allow us to
reduce the _mesa_update_state calls in a few patches.

v2: Add comments.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

b9de4858

mesa: Use glVertexAttrib*NV functions for fixed function attribs. · 7a5dea63

Mathias Fröhlich authored 5 years ago


In the glArrayElement implementation, use glVertexAttrib*NV type
functions for fixed function attributes. We do the same in display
execution when the list is replayed using immediate mode attribute
functions. Using a single set of function pointers enables to
use a unified loop to walk the vertex array attributes.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

7a5dea63

mesa: Factor out index function that will have multiple use. · 60076a61

Mathias Fröhlich authored 5 years ago


For access to glArrayElement methods factor out a function to
get the table lookup index for normalized/integer/double access.
The function will be used in the next patch at least twice.

v2: Use vertex_format_to_index instead of NORM_IDX.

Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>

60076a61

Admin message

Admin message