Commits · 19.2-branchpoint · nina / mesa

Aug 20, 2019

lima/ppir: use ra_get_best_spill_node to select spill node · 71fb721c

Erico Nunes authored 5 years ago


ra_get_best_spill_node is what other users of the mesa register
allocator use.
Switching to it now also fixes an infinite loop issue with ppir regalloc
with the ppir control flow patchset, and also provides a small gain over
the previous herusitic on number of spilled nodes testing with
shader-db.

Signed-off-by: Erico Nunes <nunes.erico@gmail.com>
Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>

71fb721c

tgsi: Remove unused tgsi_check_soa_dependencies(). · c1dc84e7

Emma Anholt authored 5 years ago


Acked-by: Eric Engestrom <eric@engestrom.ch>
Reviewed-By: Gert Wollny <gert.wollny@collabora.com>

c1dc84e7

tgsi: Drop the SSE2 constants setup that's been dead code since 2011. · 4ebe6b2e

Emma Anholt authored 5 years ago


The SSE2 executor was removed in 4eb3225b ("Remove tgsi_sse2.")

Acked-by: Eric Engestrom <eric@engestrom.ch>
Reviewed-By: Gert Wollny <gert.wollny@collabora.com>

4ebe6b2e

tgsi: drop a stale comment · 98c58355

Emma Anholt authored 5 years ago


This was fixed in 912ed84f ("tgsi: move to using vector for system
values.")

Acked-by: Eric Engestrom <eric@engestrom.ch>
Reviewed-By: Gert Wollny <gert.wollny@collabora.com>

98c58355

gitlab-ci: Enable the GLES2/3 CTS on softpipe. · 553cd82d

Emma Anholt authored 5 years ago

The GLES2 CTS takes about 8 minutes of total runtime (at parallel 4 is
~2 minutes in the test stage if runners are free), while GLES3 takes
about 25.  Since the GLES3 run is pretty expensive, just do a cheap
touch test of 1 out of every 10 tests in the test list on MRs, until
we can get the runtime down.

v2: Drop the full run for now until we can bring runtime down or bring
    up a dedicated mesa runner.

Reviewed-by: Eric Engestrom <eric@engestrom.ch> (v1)
Reviewed-By: Gert Wollny <gert.wollny@collabora.com> (v1)

553cd82d

mesa: reverse no_error on compressed_tex_sub_image for TEX_MODE_CURRENT · 6c904773

José María Casanova Crespo authored 5 years ago


This fixes the regression introduced on "mesa: refactor
compressed_tex_sub_image function" that started to crash
KHR-GLES2.texture_3d.compressed_texture.negative_compressed_tex_sub_image

Fixes: 7df233d6 ("mesa: refactor compressed_tex_sub_image function")
Reviewed-by: Eric Anholt <eric@anholt.net>

6c904773

glx: Eliminate glx_config::{rgb,float,colorIndex}Mode · b2839193
Adam Jackson authored 5 years ago
```
These are redundant with glx_config::renderType, let's just use that
consistently.
```
b2839193
glx: Remove unused glx_config::pixmapMode · 74ca87e4
Adam Jackson authored 5 years ago
```
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
```
74ca87e4

glx: convert glx_config_create_list to one big calloc · 35fc7bdf

Adam Jackson authored 5 years ago


Simpler, less failure prone, less malloc overhead, what's not to like.

Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>

35fc7bdf

glx: convert a malloc+memset to calloc · 97d58eab
Adam Jackson authored 5 years ago
```
Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>
```
97d58eab

glx: Fix parameter documentation of glx_config_create_list · cabd09c9

Adam Jackson authored 5 years ago


'minimum_size' is not, in fact, an argument to this function.

Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>

cabd09c9

anv: inline uniforms blocks don't count toward descriptor set limits · 38355355

Arcady Goldmints-Orlov authored 5 years ago


In a descriptor set inline uniform blocks don't use up any bindings.
However, the presence of any inline uniform blocks doed require the
use of the descriptor buffer, which takes up one binding.

Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

38355355

nir: add divergence analysis pass. · df86c5ff

Daniel Schürmann authored 5 years ago

This pass expects the shader to be in LCSSA form.
The algorithm is based on 'The Simple Divergence Analysis' from
Diogo Sampaio, Rafael De Souza, Sylvain Collange, Fernando Magno Quintão Pereira.
Divergence Analysis. ACM Transactions on Programming Languages and Systems (TOPLAS)

Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
Reviewed-by: Connor Abbott <cwabbott0@gmail.com>

df86c5ff

nir/subgroups: Lower clustered reductions with cluster_size >= subgroup_size into reductions · 7b070349
Rhys Perry authored 5 years ago and Daniel Schürmann committed 5 years ago
```
The behavior for reductions with cluster_size >= subgroup_size is implementation defined.

Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
```
7b070349

nir/lcssa: allow to create LCSSA phis for loop-invariant booleans · 911a1dfa

Rhys Perry authored 5 years ago and

Daniel Schürmann committed 5 years ago


ACO depends on LCSSA phis for divergent booleans to work correctly.

Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

911a1dfa

nir/lcssa: Skip loop invariant variables when converting to LCSSA. · 9c40ad49

Daniel Schürmann authored 5 years ago and

Daniel Schürmann committed 5 years ago


Co-authored-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

9c40ad49

nir: make nir_to_lcssa() a general NIR pass. · 8a6cfaa1
Rhys Perry authored 5 years ago and Daniel Schürmann committed 5 years ago
```
Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
```
8a6cfaa1

nir/lcssa: handle deref instructions properly · 204846ad

Daniel Schürmann authored 5 years ago


Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
Fixes: 414148cd "nir: Support deref instructions in loop_analyze"

204846ad

tgsi_to_nir: only update TGSI properties of the current shader stage · 7c56a68c

José María Casanova Crespo authored 5 years ago and

José María Casanova Crespo committed 5 years ago

The implementation introduced in "tgsi_to_nir: be careful about not
losing any TGSI properties silently (v2)" updates all the TGSI properties,
but it didn't take into account that the shader_info structure uses a union
to store the different attributes for each shader stage.

Now we only update the attributes if they affect current shader stage,
avoiding to overwrite members of the union that should be overwritten.
This has created hundreds of regressions in v3d.

For example the TGSI_PROPERTY_VS_BLIT_SGPRS_AMD was overwritting the
same position used by TGSI_PROPERY_CS_FIXED_BLOCK_DEPTH.

Fixes: e3003651 ("tgsi_to_nir: be careful about not losing any TGSI properties silently (v2)")

Reviewed-by: Marek Olšák <marek.olsak@amd.com>

7c56a68c

radv/gfx10: do not emit PA_SC_TILE_STEERING_OVERRIDE twice · 83a63a5b

Samuel Pitoiset authored 5 years ago


CLEAR_STATE emits it for us.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

83a63a5b

radv: do not emit PKT3_CONTEXT_CONTROL with AMDGPU 3.6.0+ · 2ca8629f

Samuel Pitoiset authored 5 years ago


It's emitted by the kernel.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

2ca8629f

mesa/program: Take ARB_framebuffers_no_attachments into account in wpos correction · 6a094053

Gert Wollny authored 5 years ago


If a drawbuffer is an fbo without an attachment then its 'Height' will be zero,
and we have to take its 'DefaultGeometry.Height' into account.

Fixes on softpipe (with the exception of tests that use multisample):
  dEQP-GLES31.functional.fbo.no_attachments.*

Signed-off-by: Gert Wollny <gert.wollny@collabora.com>
Reviewed-by: Marek Olšák <marek.olsak@amd.com>

6a094053

iris: Enable non coherent framebuffer fetch on broadwell · fe0e9db7

Sagar Ghuge authored 5 years ago and

Kenneth Graunke committed 5 years ago


v2: Use GEN_GEN in iris_state (Kenneth Graunke)

Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

fe0e9db7

iris: Free resource if failed to allocate surface state · 57ce422e
Sagar Ghuge authored 5 years ago and Kenneth Graunke committed 5 years ago
```
Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
```
57ce422e

iris: Pass isl_surf to fill_surface_state · 02244bc5

Sagar Ghuge authored 5 years ago and

Kenneth Graunke committed 5 years ago


Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Suggested-by: Kenneth Graunke <kenneth@whitecape.org>

02244bc5

iris: Add infrastructure to support non coherent framebuffer fetch · 638a157e

Sagar Ghuge authored 5 years ago and

Kenneth Graunke committed 5 years ago


Create separate SURFACE_STATE for render target read in order to support
non coherent framebuffer fetch on broadwell.

Also we need to resolve framebuffer in order to support CCS_D.

v2: Add outputs_read check (Kenneth Graunke)

v3: 1) Import Curro's comment from get_isl_surf
    2) Rename get_isl_surf method
    3) Clean up allocation in case of failure

Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

638a157e

iris: Add helper functions to get tile offset · 61c0637a

Sagar Ghuge authored 5 years ago and

Kenneth Graunke committed 5 years ago


All helper functions are ported from i965 driver.

Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

61c0637a

iris: Add helper function to get isl dim layout · 7e816991

Sagar Ghuge authored 5 years ago and

Kenneth Graunke committed 5 years ago


v2: Add missing space (Caio)

Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

7e816991

iris: Add render target read entry in binding table · 58471e20

Sagar Ghuge authored 5 years ago and

Kenneth Graunke committed 5 years ago


This will be used in next patches for supporting non coherent
framebuffer fetch on Broadwell.

v2: Fix comment (Kenneth Graunke)

v3: 1) Fix a few nits (Caio)
    2) Add comment (Caio)

Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

58471e20

build: Bump C++ standard requirement to C++14 to fix FTBFS with LLVM 10 · 1abe8738

Kai Wasserbäch authored 5 years ago and

Dave Airlie committed 5 years ago


When building Mesa against a recent LLVM 10 with C++11, the build fails
if the AMD common code is built as well due to "std::index_sequence"
being undeclared.

LLVM requires a minimum of C++14.

Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org>
Acked-by: Eric Engestrom <eric@engestrom.ch>

1abe8738

panfrost: Add madvise support to BO cache · d0ec5d38

Rob Herring authored 5 years ago


The kernel now supports madvise ioctl to indicate which BOs can be freed
when there is memory pressure. Mark BOs purgeable when they are in the
BO cache. The BOs must also be munmapped when they are in the cache or
they cannot be purged.

We could optimize avoiding the madvise ioctl on older kernels once the
driver version bump lands, but probably not worth it given the other
driver features also being added.

Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>
Signed-off-by: Rob Herring <robh@kernel.org>

d0ec5d38

panfrost: Sync UAPI header from kernel · c45c2d79

Rob Herring authored 5 years ago


Sync the panfrost_drm.h UAPI header with the latest from the kernel.
This adds madvise ioctl and GPU feature params.

Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>
Signed-off-by: Rob Herring <robh@kernel.org>

c45c2d79

Aug 19, 2019
- mesa: add ext_dsa GetMultiTexLevelParameterEXT · 0f07d18e
  Pierre-Eric Pelloux-Prayer authored 5 years ago
  
  Reviewed-by: Marek Olšák <marek.olsak@amd.com>
  0f07d18e
- mesa: add EXT_dsa glCompressedMultiTex* functions display list support · e8c5dc9c
  Pierre-Eric Pelloux-Prayer authored 5 years ago
  
  Reviewed-by: Marek Olšák <marek.olsak@amd.com>
  e8c5dc9c
- mesa: add EXT_dsa glCompressedMultiTex* functions · 1cb8e127
  Pierre-Eric Pelloux-Prayer authored 5 years ago
  
  Reviewed-by: Marek Olšák <marek.olsak@amd.com>
  1cb8e127
- mesa: add EXT_dsa glCompressedTex* functions display list support · a886025e
  Pierre-Eric Pelloux-Prayer authored 5 years ago
  
  Reviewed-by: Marek Olšák <marek.olsak@amd.com>
  a886025e
- mesa: add EXT_dsa glCompressedTexture(Sub)Image1D/2D/3D functions · 8c762218
  Pierre-Eric Pelloux-Prayer authored 5 years ago
  
  Reviewed-by: Marek Olšák <marek.olsak@amd.com>
  8c762218
- mesa: refactor compressed_tex_sub_image function · 7df233d6
  Pierre-Eric Pelloux-Prayer authored 5 years ago
  
  Combine compressed_tex_sub_image, compressed_tex_sub_image_error and compressed_tex_sub_image_no_error in a single function. The added "enum tex_mode mode" parameter allows to implement the DSA / non-DSA variants and their error/no_error combination. Reviewed-by: Marek Olšák <marek.olsak@amd.com>
  7df233d6
- radv: Add Renoir support. · 6c5d9838
  Bas Nieuwenhuizen authored 5 years ago
  
  Took the freedom to enable dfsm even though I don't have benchmark results yet, but it seems Raven-like. Rest is from radeonsi. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
  6c5d9838
- radeonsi/nir: always lower ballot masks as 64-bit, codegen handles it · 223b3174
  Marek Olšák authored 5 years ago
  
  This fixes KHR-GL45.shader_ballot_tests.ShaderBallotBitmasks. This solution is better, because the IR isn't dependent on wave32.
  223b3174

Admin message