Commits · ci-rust · Emma Anholt Anholt / mesa

Oct 12, 2020
- DO NOT MERGE: add a hello-world under src/ to test the rust integration · ea683d8a
  Emma Anholt authored 4 years ago
  
  ea683d8a
- ci: Add support for building rust code, with linting before merge. · 6009fdc7
  Emma Anholt authored 4 years ago
  
  It's typical in rust code to maintain the standard style, as set by rustfmt. That way, there can be fewer style arguments and nitpicking, since the tool just tells you.
  6009fdc7
Oct 09, 2020

CI: build our own spirv tools · b0df97b5

Dave Airlie authored 4 years ago


This causes a lot of hiccups on the CL tests, but I've got most of
them fixed in another MR in pieces.

This should at least give a much more realistic baseline.

v2: use script in both places

Reviewed-by: Michel Dänzer <mdaenzer@redhat.com>
Part-of: <mesa/mesa!7073>

b0df97b5

ci: fix deqp clone + fetch · d166188b

Dave Airlie authored 4 years ago


This was taking > 10 minutes and I got bored, don't do a depth 1 fetch
in the first place just to do a proper fetch later.

Acked-by: Eric Anholt <eric@anholt.net>
Acked-by: Daniel Stone <daniels@collabora.com>
Part-of: <mesa/mesa!7073>

d166188b

disk_cache: build option for disabled-by-default · 5de56937

John Bates authored 4 years ago


On some systems it is problematic to have the shader cache enabled
by default. This adds a build option to support the disk cache but
keep it disabled unless the environment variable
MESA_GLSL_CACHE_DISABLE=false.

For example, on Chrome OS, Chrome already has it's own shader
disk cache implementation so it disables the mesa feature. Tests
do not want the shader disk cache enabled because it can cause
inconsistent performance results and the default 1GB for the
disk cache could lead to problems that require more effort to
work around. The Mesa shader disk cache is useful for VMs though,
where it is easy to configure the feature with environment
variables. With the current version of Mesa, Chrome OS would need
to have a system-wide environment variable to disable the disk
cache everywhere except where needed. More elegant to just build
Mesa with the cache feature disabled by default.

Reviewed-by: Rob Clark <robdclark@chromium.org>
Reviewed-by: Eric Anholt <eric@anholt.net>
Part-of: <mesa/mesa!6967>

5de56937

radv: use radv_optimize_nir() less in radv_link_shaders() · 8e981453

Rhys Perry authored 4 years ago


fossil-db (Navi):
Totals from 11 (0.01% of 137413) affected shaders:
CodeSize: 99372 -> 99480 (+0.11%)
Instrs: 19119 -> 19110 (-0.05%)
Cycles: 222144 -> 222000 (-0.06%)

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!6891>

8e981453

radv: move optimizations in shader_compile_to_nir() to after io_to_scalar · 55254f24

Rhys Perry authored 4 years ago


This results in at least one less radv_optimize_nir() iteration.

No fossil-db changes.

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!6891>

55254f24

nir: return progress from nir_lower_io_to_scalar_early · 5f2671bc

Rhys Perry authored 4 years ago


Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!6891>

5f2671bc

panfrost: Move the blend shader cache at the context level · fd4d0b44

Boris Brezillon authored 4 years ago


Blend shaders can be shared among blend states, so let's move the blend
shader one level up so we don't have to re-create/re-compile shaders
when another blend state already asked for it.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

fd4d0b44

panfrost: Get rid of the constant patching done on blend shader binaries · a5005c34

Boris Brezillon authored 4 years ago


When constants are used in the blend equation we simply recompile the
shader.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

a5005c34

panfrost: Let compile_blend_shader() allocate the blend shader object · c9739941

Boris Brezillon authored 4 years ago


This way we avoid an extra copy in panfrost_get_blend_shader().
Note that the allocation is attached to the blend state object
which simplifies the delete_blend_state() path.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

c9739941

panfrost: Don't leak NIR blend shaders · dbc33e88

Boris Brezillon authored 4 years ago


Right now we create shaders that are not attached to any memory
context, leading to memory leaks. Ideally, we should free the NIR
shader as soon as we've turned it into a binary, but there's no
function explicitly destroy a shader. Let's attach those to the blend
state so they get destroyed when this state is freed.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

dbc33e88

panfrost: Allocate blit_blend with ralloc() · 8a5b885c

Boris Brezillon authored 4 years ago


This way we can use blend states as memory context which will help
simplify the blend shader creation/destruction logic.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

8a5b885c

panfrost: Pass compile arguments through a struct · 0a74a04b

Boris Brezillon authored 4 years ago


So we can extend it more easily without having to patch all callers.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

0a74a04b

panfrost: Move the blend constant mask extraction out of make_fixed_blend_mode() · 78ec5225

Boris Brezillon authored 4 years ago


This way we can get a constant mask for the blend shader case too.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

78ec5225

panfrost: Constify the rt_fmts arg passed to pan_lower_framebuffer() · 4441e803

Boris Brezillon authored 4 years ago


Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Part-of: <mesa/mesa!7066>

4441e803

radv: Set fce metadata correctly on DCC initialization. · da132d80

Bas Nieuwenhuizen authored 4 years ago


The fce metadata can always be set to false as we don't care about
the compressed clear color.

Avoiding useless fast clear eliminates improves basemark performance by
1%-1.5%.

Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!7005>

da132d80

aco/ngg: Calculate workgroup size of NGG shaders. · 5ae36568

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

5ae36568

aco/ngg: Allocate NGG GS space early for const vertex/primitive counts. · 61280bb4

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

61280bb4

aco/ngg: Use more efficient LDS layout to help reduce bank conflicts. · e8a0409d

Timur Kristóf authored 4 years ago


The LLVM backend has a trick which helps reduce LDS bank conflicts
by swizzling the LDS address where each vertex is emitted.
This commit implements the same thing for ACO.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

e8a0409d

radv/aco: Enable NGG GS by default. · 9bf92d43

Timur Kristóf authored 4 years ago


ACO NGG GS now supports everything we need except streamout
(aka. transform feedback), but we don't use NGG anyway when
streamout is needed.

Also add a note to the new features txt.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <mesa/mesa!6964>

9bf92d43

aco/ngg: Add shader query support to NGG GS. · dd737198

Timur Kristóf authored 4 years ago


In each GS thread, we calculate the number of "real" primitives that
were emitted (points, lines, triangles, not strips). Then we
accumulate the number of "real" primitives emitted by the
entire threadgroup in GDS.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

dd737198

aco/ngg: Place workgroup barrier outside control flow for NGG GS. · df62c8fb

Timur Kristóf authored 4 years ago


Merged shaders have a workgroup barrier which makes sure that
the first half is completed in every wave before the 2nd half
is started.

This barrier is located in divergent control flow, so that waves
that don't have any invocations in the 2nd half can finish as early
as possible. This is problematic for NGG GS because it has more
workgroup barriers after the 2nd half.

So, for NGG GS we need to put the barrier outside
control flow because otherwise the waves that have 0 GS threads
won't be able to wait for the waves which have non-zero GS threads.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

df62c8fb

aco/ngg: Implement NGG GS output. · 1129575d

Timur Kristóf authored 4 years ago


We store emitted GS vertices in LDS.
Then, at the end of the shader, the emitted vertices are compacted
and each thread loads a single vertex from LDS in order to export
a primitive as needed, and the vertex attributes.

The reason this	is done is because there is an impedance mismatch
between	how API	GS and the NGG HW works. API GS can emit an arbitrary
number of vertices and primites	in each	thread,	but NGG	HW can only
export one vertex per thread.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

1129575d

aco/ngg: Implement workgroup reduce / exclusive scan for NGG GS. · 62b5012e

Timur Kristóf authored 4 years ago


This function calculates two things at once:

1. The total number of vertices emitted by the threadgroup.
2. Exclusive scan of emitted vertex count accross the threadgroup.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

62b5012e

aco/ngg: Create LDS layout for NGG GS. · c29e288f

Timur Kristóf authored 4 years ago


For NGG GS, we need to store the following in LDS:

1. The ESGS ring, similarly to legacy ESGS.
2. Emitted vertices from the GS threads.
3. Temporary space used by the workgroup scan.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

c29e288f

aco/ngg: Setup NGG GS. · 2680329f

Timur Kristóf authored 4 years ago


Make it possible for ACO to recognize when to use HW NGG GS.
Also add a few notes about the various GS stages in the comments.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

2680329f

aco/ngg: Allow NGG GS to create VS exports. · 9c3d8404

Timur Kristóf authored 4 years ago


NGG GS need to use the same instructions to export vertex
attributes at the end.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

9c3d8404

aco/ngg: Allow NGG GS to load per-vertex GS inputs. · b67878f3

Timur Kristóf authored 4 years ago


They work the same way as in legacy GS, so we can reuse that.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

b67878f3

aco/ngg: Allow NGG GS to store ES outputs. · 8f25d9f8

Timur Kristóf authored 4 years ago


We can reuse the existing ES output code.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

8f25d9f8

aco/ngg: Clean up and reorganize NGG VS/TES code. · b57b1a06

Timur Kristóf authored 4 years ago


Make the NGG VS/TES code easier to follow, give better names to
some functions and make ngg_nogs_early_prim_export a variable.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

b57b1a06

aco/ngg: Make primitive export packing less prone to error. · 3645a310

Timur Kristóf authored 4 years ago


Use lshl_or instead of lshl_add, which makes it more robust in
handling -1 and -2 indices which will now just become null
exports, which is what we want.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

3645a310

aco/ngg: Refactor ngg_emit_prim_export in preparation for NGG GS. · 0bfe0495

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

0bfe0495

aco/ngg: Refactor gs_alloc_req in preparation for NGG GS. · b08ced08

Timur Kristóf authored 4 years ago


Previously, this function inferred the vertex and primitive counts
from the gs_tg_info shader argument, but in case of NGG GS, it will
need to be calculated in runtime.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

b08ced08

aco: Add wave-specific opcode for s_lshl and s_flbit. · ecfabfd6

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

ecfabfd6

aco: Optimize thread_id_in_threadgroup when there is just one wave. · 57d87992

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

57d87992

aco: Use thread_id_in_threadgroup helper for ES outputs. · 5e31fb49

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

5e31fb49

aco: Extract thread_id_in_threadgroup to a separate function. · 924f816f

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

924f816f

aco: Extract lanecount_to_mask to a separate function. · b1964ad4

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

b1964ad4

aco: Clarify missing export error message in assembler. · 0b8e7be0

Timur Kristóf authored 4 years ago


Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <mesa/mesa!6964>

0b8e7be0

Admin message

Admin message