Commits · main · André Almeida / mesa

Apr 01, 2023

ac/nir: When task->mesh dispatch Y or Z are 0, also set X to 0. · 4de9a4b2

Timur Kristóf authored 1 year ago and

Marge Bot committed 1 year ago


AMD recommends doing this to speed up the CP when it processes
the draw ring entries. LLPC also does this.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!22211>

4de9a4b2

ac/nir: Store only lowest 8 bits for task draw ring DWORD3. · 4683b213

Timur Kristóf authored 1 year ago and

Marge Bot committed 1 year ago


When writing the draw ready bit, don't write the high 24 bits
of DWORD3, because that is used by the HW for something else
according to LLPC.

Cc: mesa-stable
Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!22211>

4683b213

aco: fix nir_var_shader_out barriers for task shaders · 6974e547

Rhys Perry authored 1 year ago and

Marge Bot committed 1 year ago


These will be used in a future commit.

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Timur Kristóf <timur.kristof@gmail.com>
Cc: mesa-stable
Part-of: <mesa/mesa!22211>

6974e547

freedreno: Support the disable_throttling=true driconf option · d698bf05

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


At this point, mostly just to make it easier to disable throttling for
performance debugging.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

d698bf05

freedreno: Move driconf settings into sub-struct · 77a57788

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


Organize all one of them in a single place before adding more.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

77a57788

freedreno: Avoid looping shader stages if nothing dirty · 8620b649

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


We have corresponding global dirty bits for each of the per-stage dirty
bits.  We can use this to skip iterating over shader stages when there
is no per-stage dirty state to handle.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

8620b649

freedreno: Re-work dirty-resource tracking · 0a62a874

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


If a resource is dirty but already tracked by the current batch, no need
to process it at draw time.

Note that the batch could change (ie. new fb state bound, etc) after the
check if we need resource dirty tracking, but in these cases all the
dirty-resource state is marked dirty.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

0a62a874

freedreno: Inline single-use helpers · 4c0fdef4

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


bind_sampler_states() / set_sampler_views() have just a single caller.
So inline them.  Needed for next commit.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

4c0fdef4

freedreno: Extract out a helper · 7099f628
Rob Clark authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>
```
7099f628

freedreno: Hoist dirty vars · 0408ddcd

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


Prep to re-work how we track dirty-resource.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

0408ddcd

freedreno: Stop being too clever by half · 19a138ad

Rob Clark authored 1 year ago and

Marge Bot committed 1 year ago


This wasn't taking into account a change in corresponding bit in
writeable_bitmask, causing problem if an SSBO was first bound for
read, and then rebound for write, we wouldn't update the buffers
valid range.  Instead just drop the premature optimization.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

19a138ad

freedreno: Fix or/and'ing two BitmaskEnums · b123ee70

Connor Abbott authored 1 year ago and

Marge Bot committed 1 year ago


Previously when there was an & or | with two BitmaskEnums, the compiler
would try to cast the RHS and find a matching overload, but there were
many different casts (to the enum itself, to an integer, to a boolean,
etc.) each with a matching overload which meant that it couldn't pick
one and errored out due to an ambiguous overload. Fix this by
explicitly providing an overload that takes a BitmaskEnum on the RHS.
It has to also provide a BitmaskEnum output, so that subsequent
operators with the result on the LHS (e.g. when or'ing together three
BitmaskEnums without any parentheses tricks) also get the right
overload.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <mesa/mesa!22224>

b123ee70

nine: use separate register for aL emulation · 5825f9dd

Pavel Ondračka authored 2 years ago and

Marge Bot committed 1 year ago


NIR loop unrolling is only working if the loop counter is a scalar.
So keep the loop counter separate and move the aL emulation and
the aL increment to a new register.

This allows loop unrolling with vec4 backends where unconditional
scalarizing of phi nodes is undesirable, like for example r300.

Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com>
Reviewed-by: Axel Davy <davyaxel0@gmail.com>
Closes: mesa/mesa#7222
Part-of: <mesa/mesa!21243>

5825f9dd

Mar 31, 2023

rusticl/kernel: make use of cso info · ac993ae8

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

ac993ae8

panfrost: implement get_compute_state_info · c7dd3677

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

c7dd3677

panfrost: move max_thread_count and take reg_count into account · 87aeea20

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


We'll need it to report proper thread counts for OpenCL.

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

87aeea20

nvc0: implement get_compute_state_info · 3212ac46

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

3212ac46

nv50: implement get_compute_state_info · 52f03f63

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

52f03f63

iris: implement get_compute_state_info · c1c0362d

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

c1c0362d

lp: implement get_compute_state_info · 5fa297da

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

5fa297da

gallium: add get_compute_state_info · 6305d1cb

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

6305d1cb

rusticl/kernel: set has_variable_shared_mem on the nir · 87147e2b

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

87147e2b

nir: track existence of variable shared memory · 0e5722cd

Karol Herbst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!19855>

0e5722cd

Revert "d3d12: Honor suggested driver profile/level for H264/HEVC encode" · 1995762d
Sil Vilerino authored 1 year ago and Marge Bot committed 1 year ago
```
This reverts commit 37652da6.

Part-of: <mesa/mesa!22239>
```
1995762d

aco: don't optimize s_or_b64(v_cmp_u_f32(a, b), cmp(a, a)) · 0f60c18f

Rhys Perry authored 1 year ago and

Marge Bot committed 1 year ago


Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Georg Lehmann <dadschoorse@gmail.com>
Part-of: <mesa/mesa!22214>

0f60c18f

docs: add a few vulkan extensions supported by multiple drivers · 46e7a127
Charlie Birks authored 3 years ago and Marge Bot committed 1 year ago
```
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
Reviewed-by: Eric Engestrom <eric@igalia.com>
Part-of: <mesa/mesa!11445>
```
46e7a127
radv/ci: Update ray tracing pipeline fail/skip lists · 7b837531
Konstantin Seurer authored 1 year ago and Marge Bot committed 1 year ago
```
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22018>
```
7b837531

radv: fix binding raytracing/compute pipelines · e98aded5

Samuel Pitoiset authored 1 year ago and

Marge Bot committed 1 year ago


If a compute pipeline is bound after a raytracing pipeline, the
computes shader slot (aka RT prolog) will be overwritten.

To fix this, move the RT prolog outside of the compute shader slot.

Fixes: d109362a ("radv: copy bound shaders to the cmdbuf state")
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22235>

e98aded5

radv: add the raygen shader BO to the cmdbuf list · 56493a5f
Samuel Pitoiset authored 1 year ago and Marge Bot committed 1 year ago
```
Found by inspection.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22235>
```
56493a5f

ac/nir/ngg: Slightly improve attribute ring offset calculation. · 115958b6

Timur Kristóf authored 1 year ago and

Marge Bot committed 1 year ago


Inspired by Nicolai Hähnle's commit in LLPC.
Instead of using a SALU instruction to add to the scalar
offset, rely on the buffer swizzling and use constant offset.

Fossil DB stats on GFX1100:

Totals from 47910 (35.51% of 134913) affected shaders:
CodeSize: 87927612 -> 86968136 (-1.09%)
Instrs: 17584007 -> 17440094 (-0.82%)
Latency: 97232173 -> 97126311 (-0.11%)
InvThroughput: 9904586 -> 9905288 (+0.01%); split: -0.02%, +0.02%
VClause: 544430 -> 542566 (-0.34%)

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!22227>

115958b6

radv: Use radv_get_shader to get vertex shader when binding pipeline. · 61003e36

Timur Kristóf authored 1 year ago and

Marge Bot committed 1 year ago

The shaders[MESA_SHADER_VERTEX] can be NULL for merged shaders.

Fixes: b2ac40e7
Closes: mesa/mesa#8749


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22229>

61003e36

radv: configure PA_SC_MODE_CNTL_1 during cmdbuf recording · f8558d1f

Samuel Pitoiset authored 1 year ago and

Marge Bot committed 1 year ago


Two graphics pipeline parameters need to be copied to the cmdbuf
state.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22218>

f8558d1f

radv: set PS_ITER_SAMPLE(1) for sample shading during cmdbuf recording · 66da73e8
Samuel Pitoiset authored 1 year ago and Marge Bot committed 1 year ago
```
This shouldn't be configured in the pipeline.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22218>
```
66da73e8

radv: copy db_render_control to the cmdbuf state · b750fe4c

Samuel Pitoiset authored 1 year ago and

Marge Bot committed 1 year ago


This register is only used for meta operations.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!22218>

b750fe4c

iris: Implement Xe version of bo_madvise() and bo_set_caching() · e6c9b6ed

José Roberto de Souza authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: José Roberto de Souza <jose.souza@intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <mesa/mesa!22060>

e6c9b6ed

iris: Place scanout buffers only into lmem for discrete GPUs · c10ff197

Maarten Lankhorst authored 2 years ago and

Marge Bot committed 1 year ago


Signed-off-by: Maarten Lankhorst <maarten.lankhorst@linux.intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <mesa/mesa!22060>

c10ff197

iris: Handle allocation of scanout buffers in Xe · d72705ce

José Roberto de Souza authored 2 years ago and

Marge Bot committed 1 year ago


Bos that will be scanout in display need to be allocated with
flags = XE_GEM_CREATE_FLAG_SCANOUT in Xe and that implies to different
caching rules for this buffer.

So here not allowing to get scanout buffer from cache or allow it
to be placed in a cache bucket for reuse.

Signed-off-by: José Roberto de Souza <jose.souza@intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <mesa/mesa!22060>

d72705ce

iris: Handle allocation of exported buffers in Xe kmd · ccffcec0

José Roberto de Souza authored 2 years ago and

Marge Bot committed 1 year ago


Bos that will be exported need to be allocated with vm_id = 0 in Xe,
so don't try to get a bo from cache that was allocated with a
valid vm_id.

Signed-off-by: José Roberto de Souza <jose.souza@intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <mesa/mesa!22060>

ccffcec0

iris: Add BO_ALLOC_SHARED · 41ddecc8

José Roberto de Souza authored 2 years ago and

Marge Bot committed 1 year ago


Xe KMD requires special handling for exported buffers during creation.

Signed-off-by: José Roberto de Souza <jose.souza@intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <mesa/mesa!22060>

41ddecc8

anv: Use the new vk_device_memory base struct · 4b0b75c2
Faith Ekstrand authored 1 year ago and Marge Bot committed 1 year ago
```
Reviewed-by: Lina Versace <lina@kiwitree.net>
Part-of: <mesa/mesa!22038>
```
4b0b75c2

Admin message

Admin message