Commits · mesa-20.1.10 · Deepti Patil / mesa

Oct 14, 2020

VERSION: bump to release 20.1.10 · 5fa3aae8
Eric Engestrom authored 4 years ago

mesa-20.1.10

5fa3aae8
docs: add release notes for 20.1.10 · 8e440c86
Eric Engestrom authored 4 years ago

8e440c86

aco/isel: Always export position data from VS/NGG · 830dc7c7

Tony Wasserka authored 4 years ago and

Eric Engestrom committed 4 years ago

AMD ISA docs explicitly require this for VS, and this likely extends to
NGG too.

Cc: mesa-stable
Closes: mesa/mesa#3615


Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!7102>
(cherry picked from commit bf51b11c)

830dc7c7

radv: Fix mipmap extent adjustment on GFX9+. · 8e6aa247

Bas Nieuwenhuizen authored 4 years ago and

Eric Engestrom committed 4 years ago

With arrays we really have to use the correct size for the base
mipmap to get the right array pitch. In particular, using
surf_pitch results in pitch that is bigger than the base mipmap
and hence results in wrong pitches computed by the HW.

It seems that on GFX9 this has mostly been hidden by the epitch
provided in the descriptor but this is not something we do on
GFX10 anymore.

Now this has some draw-backs:

1. normalized coordinates don't work
2. Bounds checking uses slightly bigger bounds.

2 mostly is not an issue as we still ensure that they're within
the texture memory and not overlapping other layers/mips, but
we can't properly ignore writes.

1 is kinda dead in the water ... On the other hand I'd argue that
using normalized coords & a filter for sampling a block view of
a compressed format is extraordinarily useless.

The old method we employed already had these drawbacks for everything
except the base miplevel of the imageview.

AFAICT this is the same tradeoff AMDVLK makes and no CTS test hits
this. (once it does I think the HW is dead in the water ... Only
workaround I can think of is shader processing which is hard because
we don't know texture formats at compile time.)

I also removed the extra calculations when the image has only 1 mip
level because they ended up being a no-op in that case.

CC: mesa-stable
Closes: mesa/mesa#2292
Closes: mesa/mesa#2266
Closes: mesa/mesa#2483
Closes: mesa/mesa#2906
Gitlab: mesa/mesa#3607


Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!7090>
(cherry picked from commit 1fb3e1fb)

8e6aa247

scons: fix SPIR-V -> NIR build · b6d88656

Rhys Perry authored 4 years ago and

Eric Engestrom committed 4 years ago


Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Tested-by: Vinson Lee <vlee@freedesktop.org>
Reviewed-by: Roland Scheidegger <sroland@vmware.com>
Fixes: 18f9fc91 ('spirv: add and use a generator id enum')
Part-of: <mesa/mesa!7096>
(cherry picked from commit 044d2130)

b6d88656

android: fix SPIR-V -> NIR build · 36dc0d83

Rhys Perry authored 4 years ago and

Eric Engestrom committed 4 years ago


Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Mauro Rossi <issor.oruam@gmail.com>
Fixes: 18f9fc91 ('spirv: add and use a generator id enum')
Part-of: <mesa/mesa!7097>
(cherry picked from commit 1070bba1)

36dc0d83

spirv: replace discard with demote for incorrect HLSL->SPIR-V translations · 7bf69260

Rhys Perry authored 4 years ago and

Eric Engestrom committed 4 years ago


Fixes artifacts on decals in Path of Exile.

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Closes: mesa/mesa#3610
Cc: mesa-stable
Part-of: <mesa/mesa!7062>
Cherry picked from 037d9fb2

7bf69260

spirv: add and use a generator id enum · fe379368

Rhys Perry authored 4 years ago and

Eric Engestrom committed 4 years ago


Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Cc: mesa-stable
Part-of: <mesa/mesa!7062>
(cherry picked from commit 18f9fc91)

fe379368

pan/bi: Handle vector moves · 77ba26fc

Alyssa Rosenzweig authored 4 years ago and

Eric Engestrom committed 4 years ago


And fix the bad assertion that let this slip.

Like combines, nir_op_vec can be vector, and we need to lower this
ourselves. Thankfully, the lowering is simple.

Fixes
dEQP-GLES2.functional.shaders.loops.for_uniform_iterations.nested_tricky_dataflow_1_*

Fixes: b2c6cf2b ("pan/bi: Eliminate writemasks in the IR")
Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com>
Part-of: <!7081>
(cherry picked from commit a204eac7)

77ba26fc

anv: Enable multi-layer aux-map init for HIZ+CCS · 3d194ae1

Nanley Chery authored 4 years ago and

Eric Engestrom committed 4 years ago


Fixes rendering corruption in the shadowmappingcascade Sascha Willems
Vulkan demo. To see the corruption, I adjusted the demo options as
follows:

 1. Enable "Display depth map"
 2. Set "Split lambda" to 0.100
 3. Make "Cascade" non-zero.

Fixes: 80ffbe91 ("anv: Add support for HiZ+CCS")
Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com>
Part-of: <mesa/mesa!7046>
(cherry picked from commit cce6fc3b)

3d194ae1

intel/nir: Don't try to emit vector load_scratch instructions · 7412517b

Faith Ekstrand authored 4 years ago and

Eric Engestrom committed 4 years ago


In 53bfcdee, we added load/store_scratch instructions which deviate
a little bit from most memory load/store instructions in that we can't
use the normal untyped read/write instructions which can read and write
up to a vec4 at a time.  Instead, we have to use the DWORD scattered
read/write instructions which are scalar.  To handle this, we added code
to brw_nir_lower_mem_access_bit_sizes to cause them to be scalarized.
However, one case was missing: the load-as-larger-vector case.  In this
case, we take small bit-sized constant-offset loads replace it with a
32-bit load and shuffle the result around as needed.

For scratch, this case is much trickier to get right because it often
emits vec2 or wider which we would then have to lower again.  We did
this for other load and store ops because, for lower bit-sizes we have
to scalarize thanks to the byte scattered read/write instructions being
scalar.  However, for scratch we're not losing as much because we can't
vectorize 32-bit loads and stores either.  It's easier to just disallow
it whenever we have to scalarize.

Fixes: 53bfcdee "intel/fs: Implement the new load/store_scratch..."
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Part-of: <!6872>
(cherry picked from commit fd04f858)

7412517b

glsl/xxd.py: fix imports · a3de6a58

Dylan Baker authored 4 years ago and

Eric Engestrom committed 4 years ago


sys and string are unused, os is needed but not imported

fixes: 412472da
       ("glsl: Add utility to convert text files to C strings")

Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
Reviewed-by: Jesse Natalie <jenatali@microsoft.com>
Part-of: <mesa/mesa!7034>
(cherry picked from commit 3ff513ee)

a3de6a58

etnaviv: stop leaking the dummy texure descriptor BO · 2e1e97c6

Lucas Stach authored 4 years ago and

Eric Engestrom committed 4 years ago


Free the dummy texture descriptor BO on context destroy.

Fixes: eda73d71 (etnaviv: GC7000: Texture descriptors)
Signed-off-by: Lucas Stach <l.stach@pengutronix.de>
Reviewed-by: Guido Günther <agx@sigxcpu.org>
Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>
Cc: <mesa-stable@lists.freedesktop.org>
Part-of: <mesa/mesa!6986>
(cherry picked from commit 9d5ec7f6)

2e1e97c6

omx/tizonia: fix build · e68b2669

Pierre-Eric Pelloux-Prayer authored 4 years ago and

Eric Engestrom committed 4 years ago

Fixes: 24f2b0a8 ("gallium/video: remove pipe_video_buffer.chroma_format")
Closes: mesa/mesa#3595


Reviewed-by: Leo Liu <leo.liu@amd.com>
Part-of: <mesa/mesa!7026>
(cherry picked from commit 8b205402)

e68b2669

iris: Fix a fast-clear skipping optimization · 93447c73

Nanley Chery authored 4 years ago and

Eric Engestrom committed 4 years ago


When support for multi-slice fast-clears was introduced for color
surfaces, an existing optimization for skipping fast-clears was not
updated (this optimization assumed single-slice fast-clears). As a
result, the driver began to skip multi-layer fast-clears if just the
first slice was in the CLEAR state (ignoring the state of the others).

A Civilization VI trace was the only workload I found to make use of
this optimization and it did so for 2D, non-array textures. Therefore,
this fix simply checks that the depth of the clear box is 1. It also
moves the single-slice aux-state query closer to the optimization to
clarify the need for the depth check.

Enables iris to pass a case of the fcc-write-after-clear piglit test,
[fast-clear tracking across layers 0 -> 1 -> (0,1)].

Fixes: 393f659e ("iris: Enable fast clears on other miplevels and layers than 0.")
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <mesa/mesa!6973>
(cherry picked from commit 3f3a5f34)

93447c73

intel/perf: fix crash when no perf queries are supported · fe747abc

Lionel Landwerlin authored 4 years ago and

Eric Engestrom committed 4 years ago


Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Fixes: ec1fa1d5 ("intel/perf: fix raw query kernel metric selection")
Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com>
Part-of: <mesa/mesa!7024>
(cherry picked from commit 79f35444)

fe747abc

freedreno: Move rsc NULL check to before rsc dereferences. · bd5400ce

Vinson Lee authored 4 years ago and

Eric Engestrom committed 4 years ago


Fix defect reported by Coverity Scan.

Dereference before null check (REVERSE_INULL)
check_after_deref: Null-checking rsc suggests that it may be
null, but it has already been dereferenced on all paths leading
to the check.

Fixes: 6173cc19 ("freedreno: gallium driver for adreno")
Signed-off-by: Vinson Lee <vlee@freedesktop.org>
Reviewed-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!6903>
(cherry picked from commit 0a7bd14d)

bd5400ce

intel/fs: Don't use NoDDClk/NoDDClr for split SHUFFLEs · fec868f6

Faith Ekstrand authored 4 years ago and

Eric Engestrom committed 4 years ago

When I copied and pasted the code from MOV_INDIRECT for handling the
dependency controls, I missed a subtle difference between MOV_INDIRECT
and SHUFFLE.  Specifically, MOV_INDIRECT gets lowered to a narrow
instruction on Gen7 by the SIMD width lowering whereas SHUFFLE has to
split it in the generator.  Therefore, the check safety check for
whether or not we can use dependency control has to be based on the
lowered width rather than the width of the original instruction.

Fixes: a8ac61b0 "intel/fs: NoMask initialize the address..."
Closes: mesa/mesa#3593


Reviewed-by: Matt Turner <mattst88@gmail.com>
Part-of: <mesa/mesa!6989>
(cherry picked from commit 8427e560)

fec868f6

radv: Use atomics to read query results. · ebb438a3

Bas Nieuwenhuizen authored 4 years ago and

Eric Engestrom committed 4 years ago


The volatile pattern gives me flaky results for 32-bit builds on
ChromeOS Android. This is because on 32-bit the volatile 64-bit
loads gets split into 2 32-bit loads each.

So if we read the lower dword first and then the upper dword, it
can happen that the upper dword is already changed but the lower
dword isn't yet. In particular for occlusion queries this gives
false readings, as the upper dword commonly only constains the
ready bit.

With the GCC atomic intrinsics we get a call to __atomic_load_8
in libatomic.so which does the right thing.

An alternative fix would be to  explicitly split the 32-bit loads
in the right order and do a bunch of retries if things change, though
that gets messy quickly and for 32-bit builds only doesn't feel worth
it that much.

CC: mesa-stable
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!6933>
(cherry picked from commit 7568c97d)

ebb438a3

nir/opt_load_store_vectorize: Use bit sizes when checking mask compatibility · f6dfeec5

Faith Ekstrand authored 4 years ago and

Eric Engestrom committed 4 years ago


Without this, it was checking bit size compatibility with bit sizes such
as 96 which is clearly invalid.

No shader-db changes on Ice Lake

Fixes: ce9205c0 "nir: add a load/store vectorization pass"
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Jesse Natalie <jenatali@microsoft.com>
Part-of: <mesa/mesa!6871>
(cherry picked from commit 57e7c5f0)

f6dfeec5

glsl: don't duplicate state vars as uniforms in the NIR linker · 07ba04a3

Timothy Arceri authored 4 years ago and

Eric Engestrom committed 4 years ago


The linker was adding all state vars as uniforms, doubling the storage size
for shaders using only builtin uniforms, which increased CPU overhead for
constant buffer uploads.

When this code was originally ported from the GLSL IR linker we forgot
to exclude builtins because the check was not done in the
add_uniform_to_shader class but rather a check was done when passing
variables to this class for processing.

Fixes: 664e4a61 ("glsl/nir: Fill in the Parameters in NIR linker")

Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>
Tested-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <mesa/mesa!6958>
(cherry picked from commit 038fcbca)

07ba04a3

intel/fs: NoMask initialize the address register for shuffles · 55eed088

Faith Ekstrand authored 4 years ago and

Eric Engestrom committed 4 years ago

Cc: mesa-stable@lists.freedesktop.org
Closes: mesa/mesa#2979


Tested-by: Iván Briano <ivan.briano@intel.com>
Reviewed-by: Matt Turner <mattst88@gmail.com>
Reviewed-by: Francisco Jerez <currojerez@riseup.net>
Part-of: <mesa/mesa!6825>
(cherry picked from commit a8ac61b0)

55eed088

intel/gen9: Enable MSC RAW Hazard Avoidance · a4a64aba

Anuj Phogat authored 4 years ago and

Eric Engestrom committed 4 years ago


Workaround # 22011374674
Applied to i965, iris and anv drivers
No performance impact is observed with WA.

Cc: mesa-stable
Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
(cherry picked from commit 545d852a)

a4a64aba

radeonsi: Fix dead lock with aux_context_lock in si_screen_clear_buffer. · 0be1a042

Marek Olšák authored 4 years ago and

Eric Engestrom committed 4 years ago


After disable SDMA on Arcturus(gfx9), dead lock with aux_context_lock is
detected since si_screen_clear_buffer is called recursively before
release lock.

The call trace is:
si_clear_render_target->si_compute_clear_render_target->
si_launch_grid_internal->si_launch_grid->si_emit_cache_flush->
si_prim_discard_signal_next_compute_ib_start->u_suballocator_alloc->
si_resource_create->si_buffer_create->si_alloc_resource->
si_screen_clear_buffer->simple_mtx_lock->
si_sdma_clear_buffer->si_pipe_clear_buffer->
si_clear_buffer->si_compute_do_clear_or_copy->
si_launch_grid_internal->si_launch_grid->si_emit_cache_flush->
si_prim_discard_signal_next_compute_ib_start->u_suballocator_alloc->
si_resource_create->si_buffer_create->si_alloc_resource->
si_screen_clear_buffer->simple_mtx_lock

Fixes: 07a49bf5 "radeonsi: disable SDMA on gfx9"
Signed-off-by: James Zhu <James.Zhu@amd.com>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <mesa/mesa!6941>
(cherry picked from commit 5e8791a0)

0be1a042

nir/cf: Better handle intra-block splits · 8c8bc24b

Faith Ekstrand authored 4 years ago and

Eric Engestrom committed 4 years ago


In the case where end was a instruction-based cursor, we would mix up
our blocks and end up with block_begin pointing after the second split.
This causes a segfault as the cf_node list walk at the end of the
function never terminates properly.  There's also a possibility of
mix-up if begin is an instruction-based cursor which was found by
inspection.

Fixes: fc7f2d23 "nir/cf: add new control modification API's"
Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
Acked-by: Matt Turner <mattst88@gmail.com>
Part-of: <mesa/mesa!6866>
(cherry picked from commit 7dbb1f74)

8c8bc24b

.pick_status.json: Mark c02e933d as applied · 6ea486fe
Eric Engestrom authored 4 years ago
```
It was already part of the backport of 7568c97d, e7dc7f2a.
```
6ea486fe
.pick_status.json: Mark d78df70c as denominated · 0b80b9f1
Eric Engestrom authored 4 years ago

0b80b9f1
.pick_status.json: Update to 68daac28 · 6fff65b3
Eric Engestrom authored 4 years ago

6fff65b3

Sep 30, 2020

docs/relnotes: add sha256 sums to 20.1.9 · 1329a289
Eric Engestrom authored 4 years ago

1329a289
VERSION: bump to release 20.1.9 · 0a443eb1
Eric Engestrom authored 4 years ago

mesa-20.1.9

0a443eb1
docs: add release notes for 20.1.9 · bc6fd91e
Eric Engestrom authored 4 years ago

bc6fd91e

nir/lower_io_arrays: Fix xfb_offset bug · e1f6000b

Connor Abbott authored 4 years ago and

Eric Engestrom committed 4 years ago


I noticed this once I started gathering xfb_info after
nir_lower_io_arrays_to_elements_no_indirect.

Fixes: b2bbd978 ("nir: fix lowering arrays to elements for XFB outputs")
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Part-of: <mesa/mesa!6514>
(cherry picked from commit 5a88db68)

e1f6000b

st/mesa: use roundf instead of floorf for lod-bias rounding · 30b256c2

Erik Faye-Lund authored 4 years ago and

Eric Engestrom committed 4 years ago


There's no good reason not to use a symmetric rounding mode here. This
fixes the following GL CTS case for me:

GTF-GL33.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all

Fixes: 132b69c4 ("st/mesa: round lod_bias to a multiple of 1/256")
Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <mesa/mesa!6892>
(cherry picked from commit 7685c37b)

30b256c2

Sep 29, 2020

gallium/vl: add chroma_format arg to vl_video_buffer functions · 71b3582e

Pierre-Eric Pelloux-Prayer authored 4 years ago and

Eric Engestrom committed 4 years ago


vl_mpeg12_decoder needs to override the chroma_format value to get the
correct size calculated (chroma_format is used by vl_video_buffer_adjust_size).

I'm not sure why it's needed, but this is needed to get correct mpeg decode.

Fixes: 24f2b0a8 ("gallium/video: remove pipe_video_buffer.chroma_format")
Acked-by: Leo Liu <leo.liu@amd.com>
Part-of: <mesa/mesa!6817>
(cherry picked from commit 2584d48b)

71b3582e

gallium/vl: do not call transfer_unmap if transfer is NULL · fc21ef6b
Pierre-Eric Pelloux-Prayer authored 4 years ago and Eric Engestrom committed 4 years ago
```
CC: mesa-stable
Acked-by: Leo Liu <leo.liu@amd.com>
Part-of: <mesa/mesa!6817>
(cherry picked from commit b121b1b8)
```
fc21ef6b
.pick_status.json: Update to efaea653 · d74c2e74
Eric Engestrom authored 4 years ago

d74c2e74

Sep 28, 2020

.pick_status.json: Mark 89401e58 as denominated · 0dbec6b9
Eric Engestrom authored 4 years ago

0dbec6b9

spirv: fix emitting switch cases that directly jump to the merge block · db4a29d0

Samuel Pitoiset authored 4 years ago and

Eric Engestrom committed 4 years ago

As shown in the valid SPIR-V below, if one switch case statement
directly jumps to the merge block, it has no branches at all and
we have to reset the fall variable. Otherwise, it creates an
unintentional fallthrough.

       OpSelectionMerge %97 None
       OpSwitch %96 %97 1 %99 2 %100
%100 = OpLabel
%102 = OpAccessChain %_ptr_StorageBuffer_v4float %86 %uint_0 %uint_37
%103 = OpLoad %v4float %102
%104 = OpBitcast %v4uint %103
%105 = OpCompositeExtract %uint %104 0
%106 = OpShiftLeftLogical %uint %105 %uint_1
       OpBranch %97
 %99 = OpLabel
       OpBranch %97
 %97 = OpLabel
%107 = OpPhi %uint %uint_4 %75 %uint_5 %99 %106 %100

This fixes serious corruption in Horizon Zero Dawn.

v2: Changed the code to skip the entire if-block instead of resetting
    the fallthrough variable.

Closes: mesa/mesa#3460


Cc: mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <mesa/mesa!6590>
(cherry picked from commit 57fba85d)

db4a29d0

spirv: extract switch parsing into its own function · 4bff9ca6

Karol Herbst authored 4 years ago and

Eric Engestrom committed 4 years ago


v2 (Jason Ekstrand):
 - Construct a list of vtn_case objects

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
Part-of: <mesa/mesa!2401>
(cherry picked from commit 467b90fc)

4bff9ca6

.pick_status.json: Mark 6b1a56b9 as denominated · 9dcc7d4d
Eric Engestrom authored 4 years ago

9dcc7d4d

Admin message