Commits · lava-ci-t720-III · Tomeu Vizoso / mesa

Nov 15, 2019

0x400 · 5533293d
Tomeu Vizoso authored Nov 15, 2019

5533293d
TMP: Comment assert out · 579e5230
Tomeu Vizoso authored Nov 12, 2019

579e5230
rerun · c6e2d45a
Tomeu Vizoso authored Nov 08, 2019

c6e2d45a

gitlab-ci: Test Panfrost on T720 GPUs · 91604617

Tomeu Vizoso authored Oct 25, 2019



Work is starting on supporting the Mali T720 GPU, so test it on PINE64
H64 boards.

Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>

91604617

panfrost: White list the Mali T720 · cfb0b2f6

Tomeu Vizoso authored Oct 28, 2019



Support for this GPU is close to that of T760, so whitelist it now.

Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>

cfb0b2f6

panfrost: Make sure the shader descriptor is in sync with the GL state · 9747aee8

Tomeu Vizoso authored Nov 12, 2019



State was leaking from previous frames as we weren't updating the
descriptor in all cases.

Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>

9747aee8

pan/midgard: Prioritize texture registers · fa038239

Alyssa Rosenzweig authored Nov 13, 2019 and

Tomeu Vizoso committed Nov 15, 2019



On newer GPUs, this is a no-op. On older GPUs, this prevents needless
spilling since texture registers are shared with a subset of work
registers.

Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

fa038239

pan/midgard: Disassemble with old pipeline always on T720 · 16796550
Alyssa Rosenzweig authored Nov 13, 2019 and Tomeu Vizoso committed Nov 15, 2019
```
Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
```
16796550
pan/midgard: Use texture, not textureLod, on early Midgard · 5c1c8329
Alyssa Rosenzweig authored Nov 11, 2019 and Tomeu Vizoso committed Nov 15, 2019
```
We have to disable the fixup.

Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
```
5c1c8329

pan/midgard: Fix vertex texturing on early Midgard · 15ac263a

Alyssa Rosenzweig authored Nov 11, 2019 and

Tomeu Vizoso committed Nov 15, 2019



We use a different set of texture registers, probably to save hardware.

Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

15ac263a

pan/midgard: Generalize texture registers across GPUs · f77be5aa

Alyssa Rosenzweig authored Nov 11, 2019 and

Tomeu Vizoso committed Nov 15, 2019



Early Midgard uses a different set of texture registers; let's not
hardcode.

Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

f77be5aa

panfrost: Multiply offset_units by 2 · 7d24cef2

Tomeu Vizoso authored Nov 13, 2019



Per the spec, the units passed to glPolygonOffset are to be multiplied
by an implementation-defined constant.

On Midgard, this constant seems to be 2.

Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

7d24cef2

intel/perf: add EHL performance query support · c061185e

Lionel Landwerlin authored Oct 30, 2019



Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Rafael Antognolli <rafael.antognolli@intel.com>

c061185e

intel/dev: flag the Elkhart Lake platform · 39fd11a9

Lionel Landwerlin authored Oct 30, 2019

We'll use this for performance metrics which are different from ICL.

Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>

39fd11a9

gitlab-ci: update Piglit commit, update skips · 7a893a0d

Tapani Pälli authored Nov 15, 2019



Signed-off-by: Tapani Pälli <tapani.palli@intel.com>
Reviewed-by: Michel Dänzer <mdaenzer@redhat.com>

7a893a0d

mesa: allow bit queries for EXT_disjoint_timer_query · 1d970f15

Tapani Pälli authored Nov 12, 2019

Closes: mesa/mesa#2090


Signed-off-by: Tapani Pälli <tapani.palli@intel.com>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>

1d970f15

radv: make sure to not clear the ds attachment after resolves · 41a1152c

Samuel Pitoiset authored Nov 06, 2019



To not overwrite the resolve if there is pending clear aspects,
same as color resolves.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

41a1152c

radv: remove useless RADV_DEBUG=unsafemath debug option · 519d9b30

Samuel Pitoiset authored Nov 08, 2019



This option is useless and shouldn't be used at all.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

519d9b30

llvmpipe: Check thread creation errors · 9a80b7fd

Nathan Kidd authored Nov 15, 2019



In the case of glibc, pthread_t is internally a pointer.  If
lp_rast_destroy() passes a 0-value pthread_t to pthread_join(), the
latter will SEGV dereferencing it.

pthread_create() can fail if either the user's ulimit -u or Linux
kernel's /proc/sys/kernel/threads-max is reached.

Choosing to continue, rather than fail, on theory that it is better to
run with the one main thread, than not run at all.

Keeping as many threads as we got, since lack of threads severely
degrades llvmpipe performance.

Signed-off-by: Nathan Kidd <nkidd@opentext.com>
Reviewed-by: Roland Scheidegger <sroland@vmware.com>

9a80b7fd

Nov 14, 2019

llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders · 9c3be6d2

Ben Crocker authored Nov 13, 2019 and

Dave Airlie committed Nov 14, 2019

Large programs, e.g. gnome-shell and firefox, may tax the
addressability of the Medium code model once a (potentially unbounded)
number of dynamically generated JIT-compiled shader programs are
linked in and relocated.  Yet the default code model as of LLVM 8 is
Medium or even Small.

The cost of changing from Medium to Large is negligible:
- an additional 8-byte pointer stored immediately before the shader entrypoint;
- change an add-immediate (addis) instruction to a load (ld).

Testing with WebGL Conformance 
(https://www.khronos.org/registry/webgl/sdk/tests/webgl-conformance-tests.html)
yields clean runs with this change (and crashes without it).

Testing with glxgears shows no detectable performance difference.

Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1753327, 1753789, 1543572, 1747110, and 1582226

Closes: #223



Co-authored by: Nemanja Ivanovic <nemanjai@ca.ibm.com>, Tom Stellard <tstellar@redhat.com>

CC: mesa-stable@lists.freedesktop.org

Signed-off-by: Ben Crocker <bcrocker@redhat.com>

9c3be6d2

iris: Wrap iris_fix_edge_flags in NIR_PASS · 4242c572

Kenneth Graunke authored Nov 14, 2019



So nir_validate happens properly.  Unfortunately this means we have
to play the metadata song and dance, so walk over all impls and say
that we didn't hurt anything.

Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

4242c572

iris: Properly move edgeflag_out from output list to global list · 39c23fd1

Kenneth Graunke authored Nov 14, 2019

When demoting it from an output to a global, we need to actually move
it to the correct list.  While here, we also refactor so it's clear
we aren't mutating the list while iterating.

Closes: mesa/mesa#2106


Fixes: f9fd04ac ("nir: Fix non-determinism in lower_global_vars_to_local")
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

39c23fd1

mesa: Move compile of common Mesa core files to a static lib. · 790d0ebe

Emma Anholt authored Nov 12, 2019



We were compiling them twice, costing extra build time.  Reduces my
ccache-hot clean build time by a second (24.3s to 23.3s, 3 runs each).

The windows args are a little strange -- it's not clear to me that
they're actually used for building these files, but keep them in place
just in case, since we don't have a good windows CI story yet.  We
should want them on both gallium and classic regardless: Only osmesa
could be built for windows in classic, and classic OSMesa's scons
build defines these flags too.

Closes: #2052
Acked-by: Dylan Baker <dylan@pnwbakers.com>

790d0ebe

Appveyor: Quickly fix meson build. · cc758f12

Prodea Alexandru-Liviu authored Nov 14, 2019


As this required use of Python 3.8, mako module also had to be updated.

v2 - Unbind mako module version when using Meson.
Signed-off-by: Prodea Alexandru-Liviu <liviuprodea@yahoo.com>
Reviewed-by: Dylan Baker <dylan@pnwbakers.com>

cc758f12

intel/fs: Do not lower large local arrays to scratch on gen7 · 0904ee0c

Danylo Piliaiev authored Nov 12, 2019

On gen7 and earlier the scratch space size is limited to 12kB.
By enabling this optimization we may easily exceed this limit
without having any fallback.

arb_compute_shader/linker/bug-93840.shader_test crashes with
this lowering on IVB due to exceeding scratch size limit.

Closes: #2092


Fixes: 69244fc7
Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

0904ee0c

util: Move gallium's PIPE_FORMAT utils to /util/format/ · 882ca6df

Emma Anholt authored Jun 27, 2019



To make PIPE_FORMATs usable from non-gallium parts of Mesa, I want to
move their helpers out of gallium.  Since u_format used
util_copy_rect(), I moved that in there, too.

I've put it in a separate directory in util/ because it's a big chunk
of related code, and it's not clear to me whether we might want it as
a separate library from libmesa_util at some point.

Closes: #1905
Acked-by: Marek Olšák <marek.olsak@amd.com>
Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

882ca6df

gitlab-ci: auto-cancel CI runs when a newer commit is pushed to the same branch · ac78ca4b

Eric Engestrom authored Nov 12, 2019



Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Eric Anholt <eric@anholt.net>
Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Reviewed-by: Michel Dänzer <mdaenzer@redhat.com>

ac78ca4b

aco: Optimize out trivial code from uniform bools. · 9b8dc692

Timur Kristóf authored Nov 05, 2019



This should remove most of the excess code size that was
introduced by making all booleans per-lane.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>

9b8dc692

aco: Treat all booleans as per-lane. · 8995c0b3

Timur Kristóf authored Nov 04, 2019



Previously, instruction selection had two kinds of booleans:
1. divergent which was per-lane and stored in s2 (VCC size)
2. uniform which was stored in s1
Additionally, uniform booleans were made per-lane when they resulted
from operations which were supported only by the VALU.

To decide which type was used, we relied on the destination size,
which was not reliable due to the per-lane uniform bools, but it
mostly works on wave64.
However, in wave32 mode (where VCC is also s1) this approach
makes it impossible keep track of which boolean is uniform and
which is divergent.

This commit makes all booleans per-lane.
The resulting excess code size will be taken care of by the optimizer.

v2 (by Daniel Schürmann):
- Better names for some functions
- Use s_andn2_b64 with exec for nir_op_inot
- Simplify code due to using s_and_b64 in bool_to_scalar_condition

v3 (by Timur Kristóf):
- Fix several subgroups regressions

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>

8995c0b3

aco: use s_and_b64 exec to reduce uniform booleans to one bit · a1622c1a
Daniel Schürmann authored Nov 12, 2019 and Timur Kristóf committed Nov 14, 2019
```
Reviewed-By: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
```
a1622c1a

aco: Make sure not to mistakenly propagate 64-bit constants. · 94e35514

Timur Kristóf authored Nov 06, 2019



ACO's optimizer would try to propagate 64-bit constants, but
does so in such a way that wouldn't work due to how the 64-bit
constants are handled in the IR.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>

94e35514

aco: value number instructions using the execution mask · 9d3e0705

Daniel Schürmann authored Nov 11, 2019 and

Timur Kristóf committed Nov 14, 2019



This patch tries to give instructions with the same execution
mask also the same pass_flags and enables VN for SALU instructions
using exec as Operand.
This patch also adds back VN for VOPC instructions and removes VN for phis.

v2 (by Timur Kristóf):
- Fix some regressions.
v3 (by Daniel Schürmann):
- Fix additional issues

Reviewed-By: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>

9d3e0705

aco: check if SALU instructions are predeceeded by exec when calculating WQM needs · 8657eede
Daniel Schürmann authored Nov 11, 2019 and Timur Kristóf committed Nov 14, 2019
```
Reviewed-By: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
```
8657eede

ac: fix build with recent LLVM · ee9811a0

Samuel Pitoiset authored Nov 14, 2019

Build is broken since "Move CodeGenFileType enum to Support/CodeGen.h".

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>

ee9811a0

Revert "mesa: allow bit queries for EXT_disjoint_timer_query" · 94cb4916
Tapani Pälli authored Nov 14, 2019
```
This reverts commit 66d24a9e.

This commit made Mesa CI red because commit depends on a Piglit test
change.
```
94cb4916

nir: Fix non-determinism in lower_global_vars_to_local · f9fd04ac

Connor Abbott authored Oct 22, 2019



Using a hash-table walk means that variables will get inserted in
different orders on different runs. Just walk the list of globals
instead, even if some of them can't be turned into locals.

Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>
Reviewed-by: Eric Anholt <eric@anholt.net>

f9fd04ac

mesa/st: make sure we remove dead IO variables before handing NIR to backends · f512965b

Iago Toral authored Nov 13, 2019

Commit "1c2bf82d glsl: disable lower_fragdata_array() for NIR drivers"
disabled the GLSL IR lowering that turned gl_FragData from an array into a
collection of scalar outputs under the assumption that this was already being
handled properly elsewhere, however there are some corner cases where NIR
would fail to do this, leaving gl_FragData[] as an array variable. This can
break backends that assume that all their outputs will be scalar and use the
variable definitions from the shader to do their output setup, such as the
case of V3D.

At least one corner case was found in some Portal shaders from shader-db, where
NIR would optimize out the full body of a fragment shader. In this scenario,
the empty shader would keep the original array definition of gl_FragData[],
causing the backend to assert.

We need to do this late enough for it to be effective, since doing it in
st_nir_preprocess does not fix the original problem.

Closes: #2091

Fixes: 1c2bf82d ("glsl: disable lower_fragdata_array() for NIR drivers")
Reviewed-by: Marek Olšák <marek.olsak@amd.com>

f512965b

mesa: allow bit queries for EXT_disjoint_timer_query · 66d24a9e

Tapani Pälli authored Nov 12, 2019

Closes: #2090


Signed-off-by: Tapani Pälli <tapani.palli@intel.com>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>

66d24a9e

Revert "dri_interface: add interface for EGL_EXT_image_flush_external" · 1a093a06

Tapani Pälli authored Nov 12, 2019



This reverts commit 75204784.

This series caused unexpected flickering artifacts with Iris driver on
Chrome OS and EGL_EXT_image_flush_external spec has not been published
yet.

Acked-by: Eric Engestrom <eric@engestrom.ch>
Acked-by: Kristian H. Kristensen <hoegsberg@google.com>

1a093a06

Revert "st/dri: assume external consumers of back buffers can write to the buffers" · 7951eb14

Tapani Pälli authored Nov 12, 2019



This reverts commit 1d1b4578.

This series caused unexpected flickering artifacts with Iris driver on
Chrome OS and EGL_EXT_image_flush_external spec has not been published
yet.

Acked-by: Eric Engestrom <eric@engestrom.ch>
Acked-by: Kristian H. Kristensen <hoegsberg@google.com>

7951eb14

Admin message