Commits · main · Piotr Kocia / mesa

Aug 23, 2023

asahi: Fix shader stage dirtying · 4b84e769

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Now this is actually doing what I expect. drawoverhead #1 score more than
doubles (6091->13375).

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

4b84e769

asahi: Dirty the shader stage when the shader changes · bb663b85

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


We need to re-emit all descriptors in this case for correctness. Avoids
regressions from the following commit.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

bb663b85

asahi: Dirty track VBOs + blend const separately · 581514d9
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
We're staging everything anyway.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
581514d9
asahi: Use proper dirty tracking for VBOs · 24238cc5
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
24238cc5
asahi: Use finer dirty tracking for blend constant · 0a5ca3f3
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
0a5ca3f3

asahi: Decouple sysval lowering from uniform assignment · d6ca887f

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago

For merging shader states, we'll need to lower sysvals separately for each
shader but assign uniforms together for the final merged shader. The easiest way
to do that is to decouple the lowering of sysvals to driver uniform reads, from
the assignment of driver uniform reads to actual uniform registers.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

d6ca887f

asahi: Put unuploaded uniforms on the batch · 17563210
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Less copying needed this way.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
17563210
asahi: Extract sampler upload · 871d97f7
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Dirty track it.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
871d97f7

asahi: Add real per-stage dirty flags · 9fa5dec7

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Instead of just using ~0 as a stub todo.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

9fa5dec7

asahi: Upload a single draw_uniforms per draw · 9a604789

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Not per stage per draw. This is less frequent.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

9a604789

asahi: Extract agx_upload_textures · 4717b08f

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


By uploading textures ahead-of-time, we can upload uniforms ahead-of-time too.
This will also allow some overhead shaving optimizations, I guess.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

4717b08f

asahi: Collapse grid_info · 0e6cb6d8
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
0e6cb6d8
asahi: Split out per-stage sysvals · b049b1c9
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
b049b1c9

asahi: Add sysval tables for each shader stage · 31afce2f

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


So we can model the descriptors of each shader stage independently, as required
for merged shaders.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

31afce2f

asahi: Move UBO lowering into GL driver · 5189bae5

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


In Vulkan, UBOs are lowered by nir_lower_explicit_io, and the ubo_base_agx
sysval is unused (since it doesn't handle descriptor sets). That makes the UBO
lowering GL-only and hence belongs with the GL driver rather than the compiler.
This lets us delete the ubo_base_agx sysval.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

5189bae5

nir,asahi: Remove texture_base_agx · 1d77fb96

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago

Doing a descriptor crawl with binding tables requires a real binding table in
the shader, which won't work for VK or merged shader stages in GL. Instead,
let's lower anything that needs a crawl to bindless in the driver, so the
compiler code doesn't need to know anything about descriptor binding models.
That gets rid of the texture_base_agx sysval, which is problematic when there
are multiple descriptor sets worth of textures.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

1d77fb96

agx: Add helper returning if a descriptor crawl is needed · cd25f753

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago

For agx_nir_lower_texture to lower to a descriptor crawl, the driver needs to
make sure the address of the descriptor is available. This means a slightly
different code path should be used in the driver. Rather than the drivers
needing to know what exactly will be lowered, add a helper in the same file as
agx_nir_lower_texture that returns whether descriptor-based lowering will be
needed so the driver can act appropriately.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

cd25f753

agx: Do some texture lowering early · 1e118627

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


We want to make the implicit txs in operations explicit before lower_bindings so
lower_bindings knows to force bindless.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

1e118627

asahi: Add missing LOD source for agx_meta's txfs · 6e1bdc12

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


These would be inserted by nir_lower_tex anyway, but we shouldn't be relying on
that behaviour for the meta shaders when we can just create the correct thing
from the start.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

6e1bdc12

agx: Do not fence write-only images · 176484d7
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Reduces fencing significantly.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
176484d7
agx/fence_images: Use intrinsics_pass · d49ed63d
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
d49ed63d

asahi: Add get_query_address helper · d42bb650

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


This is the counterpart of get_oq_index for non-occlusion hardware queries.
These are not tracked with occlusion queries, since occlusion query allocations
are limited, and they are not based on indexing but rather general
batch-allocated space.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

d42bb650

asahi: Add non-occlusion query tracking · a620e86f

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


For other GPU queries, handled similarly.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

a620e86f

asahi: Sync when beginning a query · 9845814c

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Otherwise batch->writer might be non-null. Fixes Piglit occlusion_query_conform
(which I think regressed when we added proper syncing).

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

9845814c

asahi: Only touch batch->occlusion_queries for occlusion · a13f2332

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


We will soon have other types of queries with non-null writers.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

a13f2332

asahi: Refactor agx_get_query_result · dfde9345

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


In preparation for other types of GPU queries.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

dfde9345

asahi: Simplify occlusion query batch tracking · e5dd0536

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Yes, this means we now lie to the app. There's nothing more in the spirit of
dumb OpenGL features than lying!

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

e5dd0536

asahi: Generalize query logic · e72facab

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


We will need to do the same flushing dance for non-occlusion GPU queries.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

e72facab

agx: Use 16-bit reg for pixel_coord · 542a317a

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Mistake during IR translation, this is 16-bit in NIR.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

542a317a

asahi: Preserve atomic ops when rewriting image to bindless · 58efa64c

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Bug fix on its own, and prevents regressions from using bindless more.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

58efa64c

agx: Clear image_array after lowering · 8ae3eebb

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


We lower to access to a non-array 2D image, so we need to update the image_array
flag when we lower or otherwise we get an incorrect 2D Array store to a 2D image
which the hardware doesn't want.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

8ae3eebb

agx: Clear sample count after lowering MSAA · c8ea02a8
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Pedantic.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
c8ea02a8

asahi: Pass layer stride in pixels, not elements · a51c3f63

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


We do all the math in pixels and only multiply by the sample count at the end,
meaning the layer stride needs to be in terms of pixels (not samples) for
correct addressing of multisample array images in our texture lowering. This is
particularly used for lowering the multisample array stores we get from eMRT
with multisampled layered framebuffers.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

a51c3f63

asahi: Use local_size from compiler directly · 486fb759

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


This avoids an unnecessary trip through agx_uncompiled_shader.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

486fb759

asahi: Report local_size from compiler · 6247e617

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


So we can add more shared in the compiler.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

6247e617

asahi/decode: Turn assert into error · 5b3f4cf6

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


To allow us to debug broken fetches.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

5b3f4cf6

asahi: Advertise OpenGL ES 3.1! · 6aa1cf6e
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
6aa1cf6e

agx: Implement imul_high · c8b44eb4

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Like umul_high. Fixes
dEQP-VK.spirv_assembly.instruction.compute.mul_extended.signed_16bit

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

c8b44eb4

agx: Convert 8-bit comparisons · cf12429c

Alyssa Rosenzweig authored 1 year ago and

Marge Bot committed 1 year ago


Fixes dEQP-VK.spirv_assembly.type.vec3.i8.slessthan_frag

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>

cf12429c

agx: Handle b2i8 · 72231b04
Alyssa Rosenzweig authored 1 year ago and Marge Bot committed 1 year ago
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Part-of: <mesa/mesa!24847>
```
72231b04

Admin message

Admin message