Commits · mesa-19.0.0-rc2 · ssbertilson / mesa

Feb 05, 2019

Version: Bump for rc2 · 131f12d4
Dylan Baker authored 5 years ago

mesa-19.0.0-rc2

131f12d4

anv: wire up the state_pool_padding test · f8f68c41

Emil Velikov authored 5 years ago and

Dylan Baker committed 5 years ago


Cc: Jason Ekstrand <jason@jlekstrand.net>
Fixes: 927ba12b ("anv/tests: Adding test for the state_pool padding.")
Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com&gt;&lt;Paste>
Reviewed-by: Dylan Baker <dylan@pnwbakers.com>
(cherry picked from commit 8943eb8f)

f8f68c41

loader/dri3: Use strlen instead of sizeof for creating VRR property atom · 15e2fc16

Michel Dänzer authored 5 years ago and

Dylan Baker committed 5 years ago


sizeof counts the terminating null character as well, so that also
contributed to the ID computed for the X11 atom. But the convention is
for only the non-null characters to contribute to the atom ID.

Fixes: 2e12fe42 "loader/dri3: Enable adaptive_sync via
                     _VARIABLE_REFRESH property"
Reviewed-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
Reviewed-by: Eric Anholt <eric@anholt.net>
(cherry picked from commit c0a540f3)

15e2fc16

radeonsi: fix crashing performance counters (division by zero) · 3f509918
Marek Olšák authored 6 years ago and Dylan Baker committed 5 years ago
```
Fixes: e2b9329f "radeonsi: move remaining perfcounter code into si_perfcounter.c"
(cherry picked from commit 742d6cdb)
```
3f509918

Feb 04, 2019

anv: Fix VK_EXT_transform_feedback working with varyings packed in PSIZ · 9667d89f

Danylo Piliaiev authored 5 years ago and

Dylan Baker committed 5 years ago


Transform feedback did not set correct SO_DECL.ComponentMask for
varyings packed in VARYING_SLOT_PSIZ:
 gl_Layer         - VARYING_SLOT_LAYER    in VARYING_SLOT_PSIZ.y
 gl_ViewportIndex - VARYING_SLOT_VIEWPORT in VARYING_SLOT_PSIZ.z
 gl_PointSize     - VARYING_SLOT_PSIZ     in VARYING_SLOT_PSIZ.w

Fixes: 36ee2fd6 "anv: Implement the basic form of VK_EXT_transform_feedback"

Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
(cherry picked from commit 64d3b148)

9667d89f

intel/fs: Do the grf127 hack on SIMD8 instructions in SIMD16 mode · c6649ca9

Faith Ekstrand authored 6 years ago and

Dylan Baker committed 5 years ago


Previously, we only applied the fix to shaders with a dispatch mode of
SIMD8 but the code it relies on for SIMD16 mode only applies to SIMD16
instructions.  If you have a SIMD8 instruction in a SIMD16 shader,
neither would trigger and the restriction could still be hit.

Fixes: 232ed898 "i965/fs: Register allocator shoudn't use grf127..."
Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
(cherry picked from commit b4f0d062)

c6649ca9

Feb 01, 2019

st/mesa: Fix topogun-1.06-orc-84k-resize.trace crash · 89f84f98

Neha Bhende authored 5 years ago and

Dylan Baker committed 5 years ago


We need to initialize all fields in rs->prim explicitly while
creating new rastpos stage.

Fixes: bac85342 ("st/mesa: allow glDrawElements to work with GL_SELECT
feedback")

v2: Initializing all fields in rs->prim as per Ilia.

Reviewed-by: Brian Paul <brianp@vmware.com>
Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
(cherry picked from commit 69d736b1)

89f84f98

Jan 31, 2019

v3d: Fix leak in resource setup error path · c824f803

Ernestas Kulik authored 6 years ago and

Dylan Baker committed 5 years ago


Reported by Coverity: in the case of unsupported modifier request, the
code does not jump to the “fail” label to destroy the acquired resource.

CID: 1435704
Signed-off-by: Ernestas Kulik <ernestas.kulik@gmail.com>
Fixes: 45bb8f29 ("broadcom: Add V3D 3.3 gallium driver called "vc5", for BCM7268.")
(cherry picked from commit 90458bef)

c824f803

v3d: Fix image_load_store clamping of signed integer stores. · 7fdb0837

Emma Anholt authored 5 years ago and

Dylan Baker committed 5 years ago

This was copy-and-paste fail, that oddly showed up in the CTS's
reinterprets of r32f, rgba8, and srgba8 to rgba8i, but not r32ui and r32i
to rgba8i or reinterprets to other signed int formats.

Fixes: 6281f26f ("v3d: Add support for shader_image_load_store.")
(cherry picked from commit ab4d5775)

7fdb0837

mesa: Skip partial InvalidateFramebuffer of packed depth/stencil. · 535cc4f1

Emma Anholt authored 5 years ago and

Dylan Baker committed 5 years ago


One of the CTS cases tries to invalidate just stencil of packed
depth/stencil, and we incorrectly lost the depth contents.

Fixes dEQP-GLES3.functional.fbo.invalidate.whole.unbind_read_stencil
Fixes: 0c42b5f3 ("mesa: wire up InvalidateFramebuffer")
Reviewed-by: Marek Olšák <marek.olsak@amd.com>

(cherry picked from commit db2ae511)

535cc4f1

freedreno: more fixing release tarball · 7f91ae20

Rob Clark authored 5 years ago and

Dylan Baker committed 5 years ago


Fixes: aa0fed10 freedreno: move ir3 to common location
Signed-off-by: Rob Clark <robdclark@gmail.com>
(cherry picked from commit 39cfdf99)

7f91ae20

freedreno: fix release tarball · 0a72505a

Rob Clark authored 5 years ago and

Dylan Baker committed 5 years ago


Fixes: b4476138 freedreno: move drm to common location
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
Signed-off-by: Rob Clark <robdclark@gmail.com>
(cherry picked from commit e252656d)

0a72505a

radv/winsys: fix hash when adding internal buffers · 31d0079a

Samuel Pitoiset authored 5 years ago and

Dylan Baker committed 5 years ago


This fixes serious stuttering in Shadow Of The Tomb Raider.

Fixes: 50fd253b ("radv/winsys: Add priority handling during submit.")
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
(cherry picked from commit 9c762c01)

31d0079a

vc4: Fix leak in HW queries error path · 4d1dd3b0

Ernestas Kulik authored 6 years ago and

Dylan Baker committed 5 years ago


Reported by Coverity: in the case where there exist hardware and
non-hardware queries, the code does not jump to err_free_query and leaks
the query.

CID: 1430194
Signed-off-by: Ernestas Kulik <ernestas.kulik@gmail.com>
Fixes: 9ea90ffb ("broadcom/vc4: Add support for HW perfmon")
(cherry picked from commit f6e49d5a)

4d1dd3b0

vc4: Declare the last cpu pointer as being modified in NEON asm. · 45d1aa2f

Emil Velikov authored 5 years ago and

Dylan Baker committed 5 years ago


Earlier commit addressed 7 of the 8 instances available.

v2: Rebase patch back to master (by anholt)

Cc: Carsten Haitzler (Rasterman) <raster@rasterman.com>
Cc: Eric Anholt <eric@anholt.net>
Fixes: 300d3ae8 ("vc4: Declare the cpu pointers as being modified in NEON asm.")
Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
(cherry picked from commit 385843ac)

45d1aa2f

Jan 30, 2019

VERSION: bump to 19.0.0-rc1 · 2fddad9e
Dylan Baker authored 5 years ago

2 tags

2fddad9e

android,autotools,i965: Fix location of float64_glsl.h · 2b603ee4

Dylan Baker authored 5 years ago

Android.mk and autotools disagree about where generated files should
go, which wasn't a problem until we wanted to build a dist
tarball. This corrects the problme by changing the output and include
paths to be the same on android and autotools (meson already has the
correct include path).

Fixes: 7d7b3083
       ("automake: Fix path to generated source")

2b603ee4

automake: Add --enable-autotools to distcheck flags · e7f6a5d1
Dylan Baker authored 5 years ago
```
Fixes: e68777c8
       ("autotools: Deprecate the use of autotools")
```
e7f6a5d1

configure: Bump SWR LLVM requirement to 7 · 1f5f1268

Dylan Baker authored 5 years ago

It is currently impossible to build a dist tarball that works when SWR
requires LLVM 6. To generate the tarball we'd need to configure with
LLVM 6, which is fine. But to build the dist check we need LLVM 7, as
RadeonSI and RadV require that version. Unfortunately the headers
genererated with LLVM 6 don't compile with LLVM 7, the API has changed
between the two versions.

I weighed a couple of options here. One would be to ship an
unbootstrapped tarball generated with meson. This would fix the issue
by not bootstrapping, so whatever version of LLVM used would work
because the SWR headers would be generated at compile
time. Unfortunately this would involve some heavy modifications to the
infastructure used to upload the tarballs, and I've decided not to
persue this.

1f5f1268

Jan 29, 2019

automake: Add include dir for nir src directory · 90a7a9c9

Dylan Baker authored 5 years ago


Fixes: 6281f26f
       ("v3d: Add support for shader_image_load_store.")
Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

90a7a9c9

automake: Add float64.glsl to dist tarball · 82365595

Dylan Baker authored 5 years ago


Fixes: b63a1f8e
       ("glsl: Create file to contain software fp64 functions")
Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

82365595

automake: Fix path to generated source · 7d7b3083

Dylan Baker authored 5 years ago


Fixes: b63a1f8e
       ("glsl: Create file to contain software fp64 functions")
Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

7d7b3083

nir: Optimize double-precision lower_round_even() · 9de90cac

Matt Turner authored 6 years ago


Use the trick of adding and then subtracting 2**52 (52 is the number of
explicit mantissa bits a double-precision floating-point value has) to
implement round-to-even.

Cuts the number of instructions on SKL of the piglit test
fs-roundEven-double.shader_test from 109 to 21.

Reviewed-by: Roland Scheidegger <sroland@vmware.com>

9de90cac

ac: use the correct LLVM processor name on Raven2 · 3e249b85
Marek Olšák authored 5 years ago
```
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
```
3e249b85
v3d: Fix the autotools build. · f7769b51
Emma Anholt authored 5 years ago
```
Noticed while looking at the gitlab-CI MR.
```
f7769b51

freedreno: fix sysmem rendering being used when clear is used · 31a1348a

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago

This batch->cleared value is only used to decide to use sysmem rendering
or not, so it should include any buffers that are affected by a clear.

This is required because the a2xx fast clear doesn't work with sysmem
rendering. The a22x "normal" clear path doesn't work with sysmem either.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>

31a1348a

freedreno: fix depth usage logic · c93d7743

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago


Depth can be used even when there is no restore/resolve of depth. This
happens when the depth buffer is invalidated after rendering to avoid
the resolve operation.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>

c93d7743

freedreno: fix invalidate logic · bcefa0f1

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago

Set dirty bits on invalidate to trigger invalidate logic in fd_draw_vbo.

Also, resource_written for color needs to be after the invalidate logic.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>

bcefa0f1

mesa/st: wire up DiscardFramebuffer · 786f9639
Jonathan Marek authored 6 years ago and Rob Clark committed 5 years ago
```
Signed-off-by: Jonathan Marek <jonathan@marek.ca>
Reviewed-by: Eric Anholt <eric@anholt.net>
```
786f9639

mesa: wire up InvalidateFramebuffer · 0c42b5f3

Rob Clark authored 6 years ago


And before someone actually starts implementing DiscardFramebuffer()
lets rework the interface to something that is actually usable.

Signed-off-by: Rob Clark <robdclark@gmail.com>
Signed-off-by: Jonathan Marek <jonathan@marek.ca>
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Eric Anholt <eric@anholt.net>

0c42b5f3

st/dri: invalidate_resource depth/stencil before flush_resource · e6855666

Jonathan Marek authored 6 years ago and

Rob Clark committed 5 years ago


This allows freedreno to be aware of the depth invalidate when flushing
batches on flush_resource.

AFAIK, the only other driver which might care about this change is vc4,
where I think it should help by allowing the depth invalidate to work with
GALLIUM_HUD.

Signed-off-by: Jonathan Marek <jonathan@marek.ca>
Reviewed-by: Eric Anholt <eric@anholt.net>

e6855666

egl/wayland-drm: Only announce formats via wl_drm which the driver supports. · 820dfcea

Mario Kleiner authored 6 years ago and

Adam Jackson committed 5 years ago


Check if a pixel format is supported by the Wayland servers gpu driver
before exposing it to the client via wl_drm, so we avoid reporting formats
to the client which the server gpu can't handle.

Restrict this reporting to the new color depth 30 formats for now, as the
ARGB/XRGB8888 and RGB565 formats are probably supported by every gpu under
the sun.

Atm. this is mostly useful to allow proper PRIME renderoffload for depth
30 formats on the typical Intel iGPU + NVidia dGPU "NVidia Optimus" laptop
combo.

Tested on Intel, AMD, NVidia with single-gpu setup and on a Intel + NVidia
Optimus setup.

Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com>
Reviewed-by: Adam Jackson <ajax@redhat.com>
Reviewed-by: Daniel Stone <daniels@collabora.com>

820dfcea

egl/wayland: Allow client->server format conversion for PRIME offload. (v2) · a34b0d68

Mario Kleiner authored 6 years ago and

Adam Jackson committed 5 years ago


Support PRIME render offload between a Wayland server gpu and a Wayland
client gpu with different channel ordering for their color formats,
e.g., between Intel drivers which currently only support ARGB2101010
and XRGB2101010 import/display and nouveau which only supports ABGR2101010
rendering and display on nv-50 and later.

In the wl_visuals table, we also store for each format an alternate
sibling format which stores colors at the same precision, but with
different channel ordering, e.g., ARGB2101010 <-> ABGR2101010.

If a given client-gpu renderable format is not supported by the server
for import, but the alternate format is supported by the server, expose
the client-gpu renderable format as a valid EGLConfig to the client. At
eglSwapBuffers time, during the blitImage() detiling blit from the client
backbuffer to the linear buffer, the client format is converted to the
server supported format. As we have to do a copy for PRIME anyway,
this channel swizzling conversion comes essentially for free.

Note that even if a server gpu in principle does support sampling
from the clients native format, this conversion will be a performance
advantage if it allows to convert to the servers preferred format
for direct scanout, as the Wayland compositor may then be able to
directly page-flip a fullscreen client wl_buffer onto the primary
plane, or onto a hardware overlay plane, avoiding an extra data copy
for desktop composition.

Tested so far under Weston with: nouveau single-gpu, Intel single-gpu,
AMD single-gpu, "Optimus" Intel server iGPU for display + NVidia
client dGPU for rendering.

v2: Implement minor review comments by Eric Engestrom: Add some
    comment and assert, and some style fixes for clarity.
    No functional change.

Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com>
Reviewed-by: Adam Jackson <ajax@redhat.com>
Reviewed-by: Daniel Stone <daniels@collabora.com>

a34b0d68

intel/fs: Use split sends for surface writes on gen9+ · a920979d

Faith Ekstrand authored 6 years ago


Surface reads don't need them because they just have the one address
payload.  With surface writes, on the other hand, we can put the address
and the data in the different halves and avoid building the payload all
together.

The decrease in register pressure and added freedom in register
allocation resulting from this change reduces spilling enough to improve
the performance of one customer benchmark by about 2x.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

a920979d

intel/fs: Add interference between SENDS sources · 014edff0
Faith Ekstrand authored 5 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
014edff0
intel/fs: Support SENDS in SHADER_OPCODE_SEND · eab1c555
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
eab1c555
intel/disasm: Properly disassemble split sends · cca199fd
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
cca199fd
intel/eu: Add support for the SENDS[C] messages · 8babaa84
Faith Ekstrand authored 6 years ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
```
8babaa84

intel/inst: Indent some code · d6a6e103

Faith Ekstrand authored 6 years ago


We're about to add some more if cases so let's have the giant re-indent
in it's own patch to make review easier.

Acked-by: Iago Toral Quiroga <itoral@igalia.com>

d6a6e103

intel/inst: Fix the ia16_addr_imm helpers · d9696912

Faith Ekstrand authored 6 years ago


These have clearly never seen any use.... On gen8, the bottom 4 bits are
missing so we need to shift them off before we call set_bits and shift
again when we get the bits.  Found by inspection.

Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>

d9696912

Admin message