Commits · mesa-23.1.1 · Mart Raudsepp / mesa

May 25, 2023

VERSION: bump for 23.1.1 · fa55e3c0
Eric Engestrom authored 1 year ago

mesa-23.1.1

fa55e3c0
docs: add release notes for 23.1.1 · 42789565
Eric Engestrom authored 1 year ago

42789565

vulkan/pipeline_cache: remove a bogus assert when inserting objects · e1626805

Samuel Pitoiset authored 1 year ago and

Eric Engestrom committed 1 year ago


If two threads deserialize the raw object at the same time, the
refcount could be more than 1 temporarily.

This can be reproduced with Granite during the multi-threaded pipeline
cache pre-warm on startup, and also with Dota2.

Fixes: cbab396f ("vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects")
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <!22853>
(cherry picked from commit 8126e028)

e1626805

iris: rework Wa_14017076903 to only apply with occlusion queries · ca9cfa13

Lionel Landwerlin authored 1 year ago and

Eric Engestrom committed 1 year ago


Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Fixes: 415b824b ("iris: implement occlusion query related Wa_14017076903")
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Part-of: <!22807>
(cherry picked from commit 1d13f221)

ca9cfa13

radv/video: use correct h264 levels · d4f5fc70

Dave Airlie authored 1 year ago and

Eric Engestrom committed 1 year ago


This should be set to the enum, ffmpeg has it wrong so far, but the sample decoder has it right.

convert radv to the proper answer.

Fixes: 1693c03a ("radv/video: add initial h264 decoder for VCN")
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <!23225>
(cherry picked from commit ea2eade5)

d4f5fc70

winsys/radeon: fix the scratch buffer on gfx6-7 · 1a86a603

Marek Olšák authored 1 year ago and

Eric Engestrom committed 1 year ago


I'm sure this was broken.

Fixes: 1bf39b1f - ac,radeonsi: rework how scratch_waves is used and move it to ac_gpu_info.c

Reviewed-by: Dave Airlie <airlied@redhat.com>
Part-of: <!23221>
(cherry picked from commit 474f9fbe)

1a86a603

winsys/radeon: set has_image_opcodes to unbreak gfx6-7 · d86101f0

Marek Olšák authored 1 year ago and

Eric Engestrom committed 1 year ago

Fixes: 96913bbf - ac/surface: force linear image layout for chips not supporting image opcodes

Closes: #9073



Reviewed-by: Dave Airlie <airlied@redhat.com>
Part-of: <!23221>
(cherry picked from commit fe03351b)

d86101f0

dzn: Fix src/dest confusion for some non-bindless descriptor copies · 600c9dff
Jesse Natalie authored 1 year ago and Eric Engestrom committed 1 year ago
```
Fixes: 5d2b4ee4 ("dzn: Allocate descriptor sets in buffers for bindless mode")
Part-of: <!23218>
(cherry picked from commit 6674f04f)
```
600c9dff

dzn: Partial revert of · a59bb82c

Jesse Natalie authored 1 year ago and

Eric Engestrom committed 1 year ago

Turns out there was a good reason for having both buffer count
and desc_count. They served different purposes.

Fixes: 8887852d ("dzn: Add some docs around descriptor sets and remove redundant/unused data")
Part-of: <!23218>
(cherry picked from commit b4852c4e)

a59bb82c

glthread: fix typo related to upload_vertices() · 04affaa5

Patrick Lerda authored 1 year ago and

Eric Engestrom committed 1 year ago


Fixes: 68a926a1 ("glthread: set GL_OUT_OF_MEMORY if we fail to upload vertices")
Signed-off-by: Patrick Lerda <patrick9876@free.fr>
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <!23166>
(cherry picked from commit 39a9ebde)

04affaa5

mesa: fix a VBO buffer reference leak in _mesa_bind_vertex_buffer · a8da5a0c

Marek Olšák authored 1 year ago and

Eric Engestrom committed 1 year ago


Fixes: 03ba57c6 - mesa: extend _mesa_bind_vertex_buffer to take ownership of the buffer reference

Reviewed-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com>
Part-of: <!23112>
(cherry picked from commit ce3edf51)

a8da5a0c

zink: use the per-context track_renderpasses flag in more places · c54295e9

Mike Blumenkrantz authored 1 year ago and

Eric Engestrom committed 1 year ago

this should fix some erroneous zsbuf invalidation

Fixes: 215beee1 ("zink: more explicitly track/check rp optimizing per-context")
Part-of: <!23189>
(cherry picked from commit 32b7659f)

c54295e9

zink: don't wait on queue thread if disabled · c517ab7c
Mike Blumenkrantz authored 1 year ago and Eric Engestrom committed 1 year ago
```
Fixes: 270f9c0b ("zink: add ZINK_DEBUG=flushsync")
Part-of: <!23189>
(cherry picked from commit f58594cd)
```
c517ab7c

mesa: fix refcnt imbalance related to egl_image_target_texture() · 46afe552

Patrick Lerda authored 1 year ago and

Eric Engestrom committed 1 year ago


Indeed, the locally allocated "stimg" reference was not freed
on a specific code path.

For instance, this issue is triggered on radeonsi or r600 with:
"piglit/bin/egl-ext_egl_image_storage -auto -fbo"
while setting GALLIUM_REFCNT_LOG=refcnt.log.

Fixes: 6a3f5c65 ("mesa: simplify st_egl_image binding process for texture storage")
Signed-off-by: Patrick Lerda <patrick9876@free.fr>
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Part-of: <!23165>
(cherry picked from commit 83cd7d23)

46afe552

radv: do not enable VRS flat shading if the VRS builtin is read · d3ae7a75

Samuel Pitoiset authored 1 year ago and

Eric Engestrom committed 1 year ago


When the fragment shader reads the VRS builtin, VRS flat shading
shouldn't be enabled, otherwise the value might not be what the FS
expects.

Fixes dEQP-VK.fragment_shading_rate.renderpass2.monolithic.multipass.*
on RDNA2 (VRS flat shading isn't yet enabled on RDNA3).

Cc: mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <!23187>
(cherry picked from commit b439bd5a)

d3ae7a75

ac/nir: fix slots in clamping legacy colors · caa1ba76
Filip Gawin authored 1 year ago and Eric Engestrom committed 1 year ago
```
fixes: 7c41cdb8

Reviewed-by: Qiang Yu <yuq825@gmail.com>
Part-of: <!23178>
(cherry picked from commit e3676176)
```
caa1ba76

frontends/va: remove private member and update target buffer · cc5a2ca0

Ruijing Dong authored 1 year ago and

Eric Engestrom committed 1 year ago

use update_decoder_target to update the target buffer to
let decoder obtain correct reference frame.

remove the previous logic which failed to update reference
info in time.

fixes: #8996
fixes: #8387


Cc: mesa-stable

Reviewed-by: Sil Vilerino <sivileri@microsoft.com>
Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com>
Signed-off-by: Ruijing Dong <ruijing.dong@amd.com>
Part-of: <!23061>
(cherry picked from commit 799665c9)

cc5a2ca0

radeonsi/vcn: apply update_decoder_target logic · 744d5524

Ruijing Dong authored 1 year ago and

Eric Engestrom committed 1 year ago


implement update_decoder_target and
remove corresponding obsolete logic.

Cc: mesa-stable
Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com>
Signed-off-by: Ruijing Dong <ruijing.dong@amd.com>
Part-of: <!23061>
(cherry picked from commit a89f740e)

744d5524

gallium/pipe: add interface update_decoder_target · 9d4bc470

Ruijing Dong authored 1 year ago and

Eric Engestrom committed 1 year ago


reason:
decoder uses the target buffer address in record
to indentify the reference frames. When target
buffer has changed outside of decoding process,
it has to be updated back to decoder, otherwise
the outdated reference will cause image corruption.

Cc: mesa-stable
Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com>
Reviewed-by: Sil Vilerino <sivileri@microsoft.com>
Signed-off-by: Ruijing Dong <ruijing.dong@amd.com>
Part-of: <!23061>
(cherry picked from commit 5b2544f8)

9d4bc470

.pick_status.json: Update to 4af6b601 · d2e028f6
Eric Engestrom authored 1 year ago

d2e028f6

virgl: Make query result resource as dirty before requesting result · e3937abb

Gert Wollny authored 1 year ago and

Eric Engestrom committed 1 year ago


The query result resource will be written to by the host, so we have to
declare it as dirty if we want to see the change.

Fixes: 9279a28f (virgl: ARB_query_buffer_object support)

v2: Update expectations in CI

Signed-off-by: Gert Wollny <gert.wollny@collabora.com>
Part-of: <mesa/mesa!23121>
(cherry picked from commit 330a1db0)

e3937abb

intel/fs: fix size_read() for LOAD_PAYLOAD · 28f89e96

Lionel Landwerlin authored 1 year ago and

Eric Engestrom committed 1 year ago


With Anv/Zink, the piglit test :

  arb_shader_storage_buffer_object-max-ssbo-size -auto -fbo fsexceed

is failing validation after copy propagation :

load_payload(8) vgrf15:F, vgrf1+0.12<0>:F, vgrf1+0.0<0>:F, vgrf1+0.4<0>:F, vgrf1+0.8<0>:F, vgrf1+0.12<0>:F
../src/intel/compiler/brw_fs_validate.cpp:191: A <= B failed
  A = inst->src[i].offset / REG_SIZE + regs_read(inst, i) = 2
  B = alloc.sizes[inst->src[i].nr] = 1

In most cases it works because src[0] would be at offset 0 and so
reading a full reg passes validation, but Anv/Zink started emitting
slightly different code adding an offset maybe the size read 2 GRFs.

Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Reviewed-by: Francisco Jerez <currojerez@riseup.net>
Cc: mesa-stable
Part-of: <mesa/mesa!23126>
(cherry picked from commit 21c7b55f)

28f89e96

nir: Fix serializing pointer initializers. · cb9d82bc

Tatsuyuki Ishi authored 1 year ago and

Eric Engestrom committed 1 year ago


Found by manual inspection.

Reviewed-by: Caio Oliveira <caio.oliveira@intel.com>

Fixes: 7acc8105 ("compiler/nir: Add support for variable initialization from a pointer")
Part-of: <mesa/mesa!22355>
(cherry picked from commit 1546a9de)

cb9d82bc

zink: also declare int size caps inline with signed int type usage · ae7dacac
Mike Blumenkrantz authored 1 year ago and Eric Engestrom committed 1 year ago
```
Fixes: 854fd242 ("zink: declare int/float size caps inline with type usage")
Part-of: <mesa/mesa!22934>
(cherry picked from commit 5d8103b1)
```
ae7dacac

zink: flag 'has_work' on batch when promoting a cmd · 3b23995a

Mike Blumenkrantz authored 1 year ago and

Eric Engestrom committed 1 year ago

has_work controls whether a flush can be deferred, i.e., when unset
a flush may be deferred

since a promoted cmd must still be flushed to take effect, ensure this
is always set when promoted cmds are pending

cc: mesa-stable

Part-of: <mesa/mesa!23035>
(cherry picked from commit 0f510040)

3b23995a

zink: explicitly disable reordering after restricted swapchain readback blits · c60508ba

Mike Blumenkrantz authored 1 year ago and

Eric Engestrom committed 1 year ago

when needs_present_readback is set, reordering is disabled without hitting
the path that would normally disable promotion for the resource, so this
needs to be changed manually to avoid layout desync on the swapchain

cc: mesa-stable

Part-of: <mesa/mesa!23035>
(cherry picked from commit 3c010319)

c60508ba

zink: disable unordered blits when swapchain images need aqcuire · 66ed3b4f

Mike Blumenkrantz authored 1 year ago and

Eric Engestrom committed 1 year ago

this is consistent with other cmdbuf reordering for blits

Fixes: 3a9f7d70 ("zink: implement unordered u_blitter calls")
Part-of: <mesa/mesa!23035>
(cherry picked from commit ab3914a1)

66ed3b4f

zink: add special-casing for (not) reordering certain image barriers · f38120d6

Mike Blumenkrantz authored 1 year ago and

Eric Engestrom committed 1 year ago

in a scenario where an ordered read op occurs for an image,
successive read-only barriers SHOULD be able to be promoted

...but they can't, because there isn't yet a mechanism for handling layout
transitions between the unordered cmdbuf and the ordered cmdbuf,
meaning that promoting e.g., a SHADER_READ_ONLY barrier after a TRANSFER_SRC
barrier will leave the image with the wrong layout for the transfer op:

TRANSFER_SRC(unordered) -> COPY(ordered) -> SHADER_READ_ONLY(unordered)

becomes

TRANSFER_SRC(unordered) -> SHADER_READ_ONLY(unordered) -> COPY(ordered)

ideally I'll get around to figuring this out at some point

affects:
dEQP-GLES31.functional.copy_image.non_compressed.viewclass_32_bits.r32i_r32i.texture2d_array_to_renderbuffer

Fixes: bf0af0f8 ("zink: move all barrier-related functions to c++")
Part-of: <mesa/mesa!23035>
(cherry picked from commit 9c8b6754)

f38120d6

zink: try update fb resource refs when starting new renderpass · 3c786a96

Mike Blumenkrantz authored 1 year ago and

Eric Engestrom committed 1 year ago

in the case where a draw is triggered after a flush, zink_update_descriptor_refs
will be called to set batch tracking for descriptors. this function also
handles refs for fb attachments, and everything is usually fine there

the problem with this approach is that tracking is no longer set on view
objects at renderpass begin, which makes them susceptible to early deletion
if a rp isn't started from a draw call

instead, apply batch tracking to fb attachment resources on renderpass
begin if the BATCH_CHANGED flag is set (need to rename this at some point)
in order to guarantee that the resource (object) lifetime will match the
cmdbuf runtime [since imageviews are now only freed upon batch completion]

fixes #9059

Fixes: f6bbd787 ("zink: remove batch tracking/usage from view types"
Part-of: <mesa/mesa!23132>
(cherry picked from commit 62961b17)

3c786a96

anv: fix push descriptor deferred surface state packing · 7a6aef60

Lionel Landwerlin authored 1 year ago and

Eric Engestrom committed 1 year ago


Yuzu is running into a segfault because it writes the push descriptor
twice with 2 different layouts, but without a draw/dispatch in
between.

First vkCmdPushDescriptorSetKHR() writes descriptor 0 & 1 with a
uniform buffer. We toggle the 2 first bits of
anv_descriptor_set::generate_surface_states.

Second vkCmdPushDescriptorSetKHR() writes descriptor 0 with uniform
buffer and descriptor 1 with an image view. The first bit of
anv_descriptor_set::generate_surface_states stays, but the second bit
was already set before and it should now be off.

When we finally flush the push descriptor, we try to generate a
surface state for descriptor 1, but there is no valid buffer view for
it, we access an invalid pointer and segfault.

This fix resets the anv_descriptor_set::generate_surface_states when
the descriptor layout changes.

Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Fixes: b49b18f0 ("anv: reduce BT emissions & surface state writes with push descriptors")
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Part-of: <mesa/mesa!23156>
(cherry picked from commit cab7ba00)

7a6aef60

radv: fix radv_emit_userdata_vertex for vertex offset -1 · ab570dd3

Yiwei Zhang authored 1 year ago and

Eric Engestrom committed 1 year ago


-1 is a legit vertex offset upon vkCmdDrawIndexed and other cmds. This
change fixes to track last_vertex_offset with an additional valid bit.

Cc: mesa-stable
Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org>
Part-of: <mesa/mesa!23157>
(cherry picked from commit 4c8be22c)

ab570dd3

radv: disable IMAGE_USAGE_STORAGE with depth-only and stencil-only formats · d61e3764

Samuel Pitoiset authored 1 year ago and

Eric Engestrom committed 1 year ago


This shouldn't have been enabled at all. Depth-stencil formats were
accidentally disabled but not depth-only or stencil-only formats.

This doesn't seem allowed by DX12 and both AMD/NVIDIA don't enable it.

Cc: mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!23122>
(cherry picked from commit dda7400c)

d61e3764

radv: bump the global VRS image size to maximum supported FB dimensions · b8334c49

Samuel Pitoiset authored 1 year ago and

Eric Engestrom committed 1 year ago


Super sampling on a 4K screen could hit this. 16k seems pretty big
but this image is only created on RDNA2 and on-demand if VRS attachments
are used without depth-stencil attachments, which should be rare
enough to care.

Cc: mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!23105>
(cherry picked from commit 3adc9b67)

b8334c49

.pick_status.json: Update to 1f586f94 · 2913aae4
Eric Engestrom authored 1 year ago

2913aae4

util: add Pixel Game Maker MV workaround · bc19d9bd

Timothy Arceri authored 1 year ago and

Eric Engestrom committed 1 year ago

Closes: mesa/mesa#8918


Cc: mesa-stable
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Part-of: <mesa/mesa!23095>
(cherry picked from commit 5be8acc1)

bc19d9bd

intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count · fb39004b

Kenneth Graunke authored 1 year ago and

Eric Engestrom committed 1 year ago


We only support 32-bit versions of ufind_msb, find_lsb, and bit_count,
so we need to lower them via nir_lower_int64.

Previously, we were failing to do so on platforms older than Icelake
and let those operations fall through to nir_lower_bit_size, which
used a callback to determine it should lower them for bit_size != 32.
However, that pass only emulates small bit-size operations by promoting
them to supported, larger bit-sizes (i.e. 16-bit using 32-bit).  It
doesn't support emulating larger operations (i.e. 64-bit using 32-bit).

So nir_lower_bit_size would just u2u32 the 64-bit source, causing us to
flat ignore half of the bits.

Commit 78a195f2 (intel/compiler: Postpone most int64 lowering to
brw_postprocess_nir) provoked this bug on Icelake and later as well,
by moving the nir_lower_int64 handling for ufind_msb until late in
compilation, allowing it to reach nir_lower_bit_size which broke it.

To fix this, we always set int64 lowering for these opcodes, and also
correct the nir_lower_bit_size callback to ignore 64-bit operations.

Cc: mesa-stable
Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Reviewed-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com>
Part-of: <mesa/mesa!23123>
(cherry picked from commit a2d384a5)

fb39004b

nir: Add find_lsb lowering to nir_lower_int64. · 7cc32b0e

Kenneth Graunke authored 1 year ago and

Eric Engestrom committed 1 year ago


Some GPUs can only handle 32-bit find_lsb.

Cc: mesa-stable
Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com>
Part-of: <mesa/mesa!23123>
(cherry picked from commit 9293d8e6)

7cc32b0e

anv: Fix ANV_BO_ALLOC_NO_LOCAL_MEM flag · 1b4720b3

José Roberto de Souza authored 1 year ago and

Eric Engestrom committed 1 year ago


VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT is also set in all memory types of
integrated GPUs.
This flag means that memory will be allocated in the most efficient
place for the GPU to access, which is true in integrated GPUs.

However, this was causing ANV_BO_ALLOC_WRITE_COMBINE to be set in
integrated GPUs in the block right below when allocating in the non-cached memory type.
But the comment only talks about lmem, so to still keep the write
combine behavior for iGPUs it was used VkMemoryPropertyFlags in mmap_calc_flags().

Additionally, this was causing anv_bo.has_implicit_ccs to always be
set, which could change the expected behavior of
anv_BindImageMemory2() in MTL.

Fixes: fbd32a04 ("anv: add a third memory type for LLC configuration") added a new heap
Fixes: 582bf4d9 ("anv: flag BO for write combine when CPU visible and potentially in lmem")
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Signed-off-by: José Roberto de Souza <jose.souza@intel.com>
Part-of: <mesa/mesa!22483>
(cherry picked from commit a6c5746b)

1b4720b3

radv: reserve cmdbuf space in radv_flush_gfx2ace_semaphore() · 433e4e8f

Samuel Pitoiset authored 1 year ago and

Eric Engestrom committed 1 year ago


Fixes an assertion with test_amplification_shader in vkd3d-proton.

Cc: mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!23057>
(cherry picked from commit b83ce03a)

433e4e8f

radv: fix a sync issue with primitives generated query and NGG/legacy · d5541307

Samuel Pitoiset authored 1 year ago and

Eric Engestrom committed 1 year ago


On RDNA1&2, the driver needs to support both NGG and legacy for
primitives generated query because we can't know that before starting
queries.

To get the query pool results, we check the availability bit wrote by
the SAMPLE_STREAMOUTSTATS packet but the GDS copy was emitted after,
which means the availability bit might be TRUE before the GDS copy is
actually done.

Fix this by emitting the GDS copy before to ensure the availability is
TRUE for both results.

This fixes recent updates in
dEQP-VK.transform_feedback.primitives_generated_query.* because the
tests no longer wait for the fence.

Cc: mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!23080>
(cherry picked from commit 9ba41ed7)

d5541307

Admin message