Commits · mesa-19.0-rc1 · Jocelyn Falempe / mesa

Jan 30, 2019

VERSION: bump to 19.0.0-rc1 · 2fddad9e
Dylan Baker authored 5 years ago

2 tags

2fddad9e

android,autotools,i965: Fix location of float64_glsl.h · 2b603ee4

Dylan Baker authored 5 years ago

Android.mk and autotools disagree about where generated files should
go, which wasn't a problem until we wanted to build a dist
tarball. This corrects the problme by changing the output and include
paths to be the same on android and autotools (meson already has the
correct include path).

Fixes: 7d7b3083
       ("automake: Fix path to generated source")

2b603ee4

automake: Add --enable-autotools to distcheck flags · e7f6a5d1
Dylan Baker authored 5 years ago
```
Fixes: e68777c8
       ("autotools: Deprecate the use of autotools")
```
e7f6a5d1

configure: Bump SWR LLVM requirement to 7 · 1f5f1268

Dylan Baker authored 5 years ago

It is currently impossible to build a dist tarball that works when SWR
requires LLVM 6. To generate the tarball we'd need to configure with
LLVM 6, which is fine. But to build the dist check we need LLVM 7, as
RadeonSI and RadV require that version. Unfortunately the headers
genererated with LLVM 6 don't compile with LLVM 7, the API has changed
between the two versions.

I weighed a couple of options here. One would be to ship an
unbootstrapped tarball generated with meson. This would fix the issue
by not bootstrapping, so whatever version of LLVM used would work
because the SWR headers would be generated at compile
time. Unfortunately this would involve some heavy modifications to the
infastructure used to upload the tarballs, and I've decided not to
persue this.

1f5f1268

Jan 29, 2019

automake: Add include dir for nir src directory · 90a7a9c9

Dylan Baker authored 5 years ago


Fixes: 6281f26f
       ("v3d: Add support for shader_image_load_store.")
Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

90a7a9c9

automake: Add float64.glsl to dist tarball · 82365595

Dylan Baker authored 5 years ago


Fixes: b63a1f8e
       ("glsl: Create file to contain software fp64 functions")
Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

82365595

automake: Fix path to generated source · 7d7b3083

Dylan Baker authored 5 years ago


Fixes: b63a1f8e
       ("glsl: Create file to contain software fp64 functions")
Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

7d7b3083

nir: Optimize double-precision lower_round_even() · 9de90cac

Matt Turner authored 5 years ago


Use the trick of adding and then subtracting 2**52 (52 is the number of
explicit mantissa bits a double-precision floating-point value has) to
implement round-to-even.

Cuts the number of instructions on SKL of the piglit test
fs-roundEven-double.shader_test from 109 to 21.

Reviewed-by: Roland Scheidegger <sroland@vmware.com>

9de90cac

ac: use the correct LLVM processor name on Raven2 · 3e249b85
Marek Olšák authored 5 years ago
```
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
```
3e249b85
v3d: Fix the autotools build. · f7769b51
Emma Anholt authored 5 years ago
```
Noticed while looking at the gitlab-CI MR.
```
f7769b51

freedreno: fix sysmem rendering being used when clear is used · 31a1348a

Jonathan Marek authored 5 years ago and

Rob Clark committed 5 years ago

This batch->cleared value is only used to decide to use sysmem rendering
or not, so it should include any buffers that are affected by a clear.

This is required because the a2xx fast clear doesn't work with sysmem
rendering. The a22x "normal" clear path doesn't work with sysmem either.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>

31a1348a

freedreno: fix depth usage logic · c93d7743

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago


Depth can be used even when there is no restore/resolve of depth. This
happens when the depth buffer is invalidated after rendering to avoid
the resolve operation.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>

c93d7743

freedreno: fix invalidate logic · bcefa0f1

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago

Set dirty bits on invalidate to trigger invalidate logic in fd_draw_vbo.

Also, resource_written for color needs to be after the invalidate logic.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>

bcefa0f1

mesa/st: wire up DiscardFramebuffer · 786f9639
Jonathan Marek authored 6 years ago and Rob Clark committed 5 years ago
```
Signed-off-by: Jonathan Marek <jonathan@marek.ca>
Reviewed-by: Eric Anholt <eric@anholt.net>
```
786f9639

mesa: wire up InvalidateFramebuffer · 0c42b5f3

Rob Clark authored 6 years ago


And before someone actually starts implementing DiscardFramebuffer()
lets rework the interface to something that is actually usable.

Signed-off-by: Rob Clark <robdclark@gmail.com>
Signed-off-by: Jonathan Marek <jonathan@marek.ca>
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Eric Anholt <eric@anholt.net>

0c42b5f3

st/dri: invalidate_resource depth/stencil before flush_resource · e6855666

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago


This allows freedreno to be aware of the depth invalidate when flushing
batches on flush_resource.

AFAIK, the only other driver which might care about this change is vc4,
where I think it should help by allowing the depth invalidate to work with
GALLIUM_HUD.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>
Reviewed-by: Eric Anholt <eric@anholt.net>

e6855666

egl/wayland-drm: Only announce formats via wl_drm which the driver supports. · 820dfcea

Mario Kleiner authored 6 years ago and

Adam Jackson committed 5 years ago


Check if a pixel format is supported by the Wayland servers gpu driver
before exposing it to the client via wl_drm, so we avoid reporting formats
to the client which the server gpu can't handle.

Restrict this reporting to the new color depth 30 formats for now, as the
ARGB/XRGB8888 and RGB565 formats are probably supported by every gpu under
the sun.

Atm. this is mostly useful to allow proper PRIME renderoffload for depth
30 formats on the typical Intel iGPU + NVidia dGPU "NVidia Optimus" laptop
combo.

Tested on Intel, AMD, NVidia with single-gpu setup and on a Intel + NVidia
Optimus setup.

Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com>
Reviewed-by: Adam Jackson <ajax@redhat.com>
Reviewed-by: Daniel Stone <daniels@collabora.com>

820dfcea

egl/wayland: Allow client->server format conversion for PRIME offload. (v2) · a34b0d68

Mario Kleiner authored 6 years ago and

Adam Jackson committed 5 years ago


Support PRIME render offload between a Wayland server gpu and a Wayland
client gpu with different channel ordering for their color formats,
e.g., between Intel drivers which currently only support ARGB2101010
and XRGB2101010 import/display and nouveau which only supports ABGR2101010
rendering and display on nv-50 and later.

In the wl_visuals table, we also store for each format an alternate
sibling format which stores colors at the same precision, but with
different channel ordering, e.g., ARGB2101010 <-> ABGR2101010.

If a given client-gpu renderable format is not supported by the server
for import, but the alternate format is supported by the server, expose
the client-gpu renderable format as a valid EGLConfig to the client. At
eglSwapBuffers time, during the blitImage() detiling blit from the client
backbuffer to the linear buffer, the client format is converted to the
server supported format. As we have to do a copy for PRIME anyway,
this channel swizzling conversion comes essentially for free.

Note that even if a server gpu in principle does support sampling
from the clients native format, this conversion will be a performance
advantage if it allows to convert to the servers preferred format
for direct scanout, as the Wayland compositor may then be able to
directly page-flip a fullscreen client wl_buffer onto the primary
plane, or onto a hardware overlay plane, avoiding an extra data copy
for desktop composition.

Tested so far under Weston with: nouveau single-gpu, Intel single-gpu,
AMD single-gpu, "Optimus" Intel server iGPU for display + NVidia
client dGPU for rendering.

v2: Implement minor review comments by Eric Engestrom: Add some
    comment and assert, and some style fixes for clarity.
    No functional change.

Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com>
Reviewed-by: Adam Jackson <ajax@redhat.com>
Reviewed-by: Daniel Stone <daniels@collabora.com>

a34b0d68

intel/fs: Use split sends for surface writes on gen9+ · a920979d

Faith Ekstrand authored 6 years ago


Surface reads don't need them because they just have the one address
payload.  With surface writes, on the other hand, we can put the address
and the data in the different halves and avoid building the payload all
together.

The decrease in register pressure and added freedom in register
allocation resulting from this change reduces spilling enough to improve
the performance of one customer benchmark by about 2x.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

a920979d

intel/fs: Add interference between SENDS sources · 014edff0
Faith Ekstrand authored 5 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
014edff0
intel/fs: Support SENDS in SHADER_OPCODE_SEND · eab1c555
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
eab1c555
intel/disasm: Properly disassemble split sends · cca199fd
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
cca199fd
intel/eu: Add support for the SENDS[C] messages · 8babaa84
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
8babaa84

intel/inst: Indent some code · d6a6e103

Faith Ekstrand authored 6 years ago


We're about to add some more if cases so let's have the giant re-indent
in it's own patch to make review easier.

Acked-by: Iago Toral Quiroga <itoral@igalia.com>

d6a6e103

intel/inst: Fix the ia16_addr_imm helpers · d9696912

Faith Ekstrand authored 6 years ago


These have clearly never seen any use.... On gen8, the bottom 4 bits are
missing so we need to shift them off before we call set_bits and shift
again when we get the bits.  Found by inspection.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

d9696912

intel/disasm: Rework SEND decoding to use descriptors · e46fb331

Faith Ekstrand authored 6 years ago

Instead of fetching the information out of the instruction directly,
fetch the descriptor and then pluck the information out of the
descriptor. The current scheme works ok for SEND but with SENDS, it all
falls to pieces because the descriptor is completely shuffled around.

This commit doesn't actually convert everything. One notable exception
is URB messages which don't even use descriptors in emit_urb_WRITE yet.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

e46fb331

intel/eu: Add more message descriptor helpers · 13a6fabc

Faith Ekstrand authored 6 years ago

We want to be able to extract data from descriptors as well as unify a
bit of the descriptor construction.

One of the unifications we do is to unify the read/write and dataport
descriptors. On gen4-5, read/write are substantially different and the
read descriptors change between gen4 and gen4.x. On gen6, they unified
layouts between read, write, and dataport. Then, on gen8, they added
one bit to the message type field but left it reserved MBZ for
read/write messages. This commit chooses to treat that as if they
expanded the field everywhere and just didn't have enough enum values
for read/write to bother with the extra bit.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

13a6fabc

intel/eu/validate: SEND restrictions also apply to SENDC · c3aa436b
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
c3aa436b
intel/eu: Use GET_BITS in brw_inst_set_send_ex_desc · fee6bd8d
Faith Ekstrand authored 6 years ago
```
It's a bit more readable

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
fee6bd8d
intel/fs: Use SHADER_OPCODE_SEND for varying UBO pulls on gen7+ · b284d222
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
b284d222
intel/fs: Use SHADER_OPCODE_SEND for texturing on gen7+ · 8514eba6
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
8514eba6
intel/fs: Use a logical opcode for IMAGE_SIZE · f547cebb
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
f547cebb
intel/fs: Use SHADER_OPCODE_SEND for surface messages · d2d3e045
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
d2d3e045
intel/fs: Add a generic SEND opcode · 7f1cf046
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
7f1cf046

intel/eu: Rework surface descriptor helpers · ba3c5300

Faith Ekstrand authored 6 years ago

This commit pulls the surface descriptor helpers out into brw_eu.h and
makes them no longer depend on the codegen infrastructure. This should
allow us to use them directly from the IR code instead of the generator.
This change is unfortunately less mechanical than perhaps one would like
but it should be fairly straightforward.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

ba3c5300

intel/eu: Add has_simd4x2 bools to surface_write functions · 5b173796
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
5b173796

intel/fs: Take an explicit exec size in brw_surface_payload_size() · 2ce93b88

Faith Ekstrand authored 6 years ago


Instead of magically falling back to SIMD8 for atomics and typed
messages on Ivy Bridge, explicitly figure out the exec size and pass
that into brw_surface_payload_size.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

2ce93b88

intel/fs: Handle IMAGE_SIZE in size_read() and is_send_from_grf() · cf42b0f9

Faith Ekstrand authored 6 years ago


Like all the other sends, it's just mlen * REG_SIZE.

Fixes: 3cbc02e4 "intel: Use TXS for image_size when we have..."
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

cf42b0f9

intel/defines: Explicitly cast to uint32_t in SET_FIELD and SET_BITS · 009c0bd8

Faith Ekstrand authored 6 years ago

If you pass a bool in as the value to set, the C standard says that it
gets converted to an int prior to shifting. If you try to set a bool to
bit 31, this lands you in undefined behavior. It's better just to add
the explicit cast and let the compiler delete it for us.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

009c0bd8

intel/fs: Get rid of fs_inst::equals · 077b9557

Faith Ekstrand authored 5 years ago

There are piles of fields that it doesn't check so using it is a lie.
The only reason why it's not causing problem is because it has exactly
one user which only uses it for MOV instructions (which aren't very
interesting) and only on Sandy Bridge and earlier hardware. Just get
rid of it and inline it in the one place that it's actually used.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

077b9557

Admin message