Commits · fixes-32 · Alyssa Rosenzweig / mesa

Feb 13, 2019
- Totally unite SFBD/MFBD · 9d97f27f
  Alyssa Rosenzweig authored Feb 13, 2019
  
  9d97f27f
- Split up SFBD/MFBD for clears · 3e76db1d
  Alyssa Rosenzweig authored Feb 13, 2019
  
  3e76db1d
- Fix issue for MFBD · adb991e9
  Alyssa Rosenzweig authored Feb 13, 2019
  
  adb991e9
- Remove some preprocessed blocked SFBD/MFBD distinctions · b3b9be1d
  Alyssa Rosenzweig authored Feb 13, 2019
  
  b3b9be1d
- Move SFBD definition inward · 1a9a9020
  Alyssa Rosenzweig authored Feb 13, 2019
  
  1a9a9020
- Totally dynamic version of T6XX/T8XX checks · 831c1cc3
  Alyssa Rosenzweig authored Feb 13, 2019
  
  831c1cc3
- Set random unknown bit on both arches · 13820c2c
  Alyssa Rosenzweig authored Feb 13, 2019
  
  13820c2c
- Explain why we do it the way we do · 7821d5ac
  Alyssa Rosenzweig authored Feb 13, 2019
  
  7821d5ac
- Explain some of the ifdefs we keep · d356953e
  Alyssa Rosenzweig authored Feb 13, 2019
  
  d356953e
- Remove point of divergence · 8bcbff13
  Alyssa Rosenzweig authored Feb 13, 2019
  
  8bcbff13
- Remove unknown T6XX dead code · f86a6461
  Alyssa Rosenzweig authored Feb 13, 2019
  
  f86a6461
- Let include/panfrost-job.h be Midgard version independent · 09347ad9
  Alyssa Rosenzweig authored Feb 13, 2019
  
  09347ad9
- Attribute some T6XX/T8XX differences to bitness · 64e48bbf
  Alyssa Rosenzweig authored Feb 13, 2019
  
  64e48bbf
- Remove prints · 16edad6f
  Alyssa Rosenzweig authored Feb 13, 2019
  
  16edad6f
- Handle blending on SFBD · 30ea904b
  Alyssa Rosenzweig authored Feb 13, 2019
  
  30ea904b
Feb 12, 2019
- UBOs are fine here · bfda5f3c
  Alyssa Rosenzweig authored Feb 12, 2019
  
  bfda5f3c
- Fix tiler issue on SFBD · e779b131
  Alyssa Rosenzweig authored Feb 12, 2019
  
  e779b131
- Fix blend descriptor on SFBD · e8f30656
  Alyssa Rosenzweig authored Feb 12, 2019
  
  e8f30656
- Rule out UBO issues for now · 244a403e
  Alyssa Rosenzweig authored Feb 12, 2019
  
  244a403e
Feb 11, 2019
- It goes · 25f0ae3c
  Alyssa Rosenzweig authored Feb 11, 2019
  
  25f0ae3c
- SFBD resurrect · 727aa60b
  Alyssa Rosenzweig authored Feb 11, 2019
  
  727aa60b
- Fuffle · b12e6101
  Alyssa Rosenzweig authored Feb 11, 2019
  
  b12e6101
Feb 10, 2019

Repo · beaa45a2
Alyssa Rosenzweig authored Feb 10, 2019

beaa45a2
panfrost: Fix build when using overlay · 107c5c84
Alyssa Rosenzweig authored Feb 10, 2019
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
107c5c84
panfrost: Elucidate texture op scheduling comment · 24587972
Alyssa Rosenzweig authored Feb 09, 2019
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
24587972
panfrost: Remove speculative if 0'd format bit code · 658961ae
Alyssa Rosenzweig authored Feb 09, 2019
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
658961ae
panfrost: Remove if 0'd dead code · b1213a39
Alyssa Rosenzweig authored Feb 08, 2019
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
b1213a39

panfrost: Add kernel-agnostic resource management · e91e1786

Alyssa Rosenzweig authored Feb 07, 2019



Various methods relating to resource management were previously marked
as kernel-specific, forcing them to stay downstream in the vendor
overlay and eventually be duplicated for DRM code. This patch adds back
this code in kernel-neutral space, allowing for code sharing and
minimising the diff to downstream.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

e91e1786

panfrost: Don't hardcode number of nir_ssa_defs · 4ed23b19
Alyssa Rosenzweig authored Feb 07, 2019
```
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
```
4ed23b19

panfrost: Clean-up one-argument passing quirk · 97dcad8d

Alyssa Rosenzweig authored Feb 07, 2019

Most Midgard instructions take two-arguments logically; there are always
two arguments at the assembly level. For the few instructions that take
only a single argument, generally the second argument slot is unused,
with a zero inline constant occupying the space. fmov/imov are the
exception, where the first argument is filled with r24 and the logical
argument is in the second slot.

Previously, these constraints were handled by a delicate, buggy series
of hacks. This commit removes these hacks. Instead, we look at the
logical number of arguments (from NIR), switching between two argument
and one-argument-one-zero style. We then introduce a quirk for the
flipped style, which applies to fmov/imov.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>

97dcad8d

Feb 09, 2019

glsl_type: initialize offset and location to -1 for glsl_struct_field · 49397a3c
Karol Herbst authored Feb 09, 2019
```
Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
```
49397a3c

nouveau: Silence unhandled cap warnings · 55e00a2e

Kenneth Graunke authored Feb 08, 2019



Nouveau apparently uses the u_screen helper but prints a warning in the
default case, so running any GL program would start grumbling.

Fixes: 8fa54bc5 gallium: Add a PIPE_CAP_NIR_COMPACT_ARRAYS capability bit.

Reviewed-by: Karol Herbst <kherbst@redhat.com>
Acked-by: Ilia Mirkin <imirkin@alum.mit.edu>

55e00a2e

Feb 08, 2019

intel/compiler: use 0 as sampler in emit_mcs_fetch · ee670d09

Caio Oliveira authored Feb 07, 2019



The sampler will be ignored since the underlying 'ld_mcs' operation
won't use it, so just fill the field with 0 instead of the texture to
make it clearer that's the case.

This will also avoid is_high_sampler() to kick in unnecessarily, in
case we are using the operation for a texture with index >= 16.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

ee670d09

wsi: query the ICD's max dimensions instead of hard-coding them · e8e54443

Eric Engestrom authored Nov 25, 2018



anv and radv both happened to already return 2^14 for these, but
querying the ICD is safer and will help if vdreno (or whatever it's
called) doesn't have the same max.

Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>

e8e54443

nir: Convert a bcsel with only phi node sources to a phi node · b031c643

Ian Romanick authored Jan 17, 2019

v2: Remove the original ALU instruciton after all of its readers are
modified to read the new ALU instruction.

v3: Fix an issue where a bcsel that may not be executed on a loop
iteration due to a break statement is converted to a phi (and therefore
incorrectly "executed").  Noticed by Tim.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109216


Fixes: 8fb8ebfb ("intel/compiler: More peephole select")
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>

b031c643

nir: Split ALU instructions in loops that read phis · 0881e90c

Ian Romanick authored Jan 17, 2019



A single shader in Unigine Superposition is affected by this change.
A single iadd is moved to the end of a loop.  This iadd is involved in
a complex set of logic to terminate the loop, and an extra mov
instruction is inserted.  This shader really needs the optimization
suggested by bugzilla #94747, and I expect that to make this tiny
regression go away.

All Gen7+ platforms had similar results. (Skylake shown)
total instructions in shared programs: 15047543 -> 15047545 (<.01%)
instructions in affected programs: 565 -> 567 (0.35%)
helped: 0
HURT: 2

total cycles in shared programs: 369977253 -> 369978253 (<.01%)
cycles in affected programs: 127910 -> 128910 (0.78%)
helped: 0
HURT: 2

v2: Skip nir_op_vec{2,3,4} and nir_op_[fi]mov instructions to avoid
infinite optimization loops.  Remove the original ALU instruciton after
all of its readers are modified to read the new ALU instruction.

v3: Extend to the more general case.  The if the prev-block value from
the phi is not undef, this means the ALU instruction has to be
duplicated in both the prev-block and the continue-block.

Fixes: 8fb8ebfb ("intel/compiler: More peephole select")
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>

0881e90c

nir: Select phi nodes using prev_block instead of continue_block · 0c0c6972

Ian Romanick authored Jan 14, 2019



This simplifies some changes coming later.

Fixes: 8fb8ebfb ("intel/compiler: More peephole select")
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>

0c0c6972

nir: Refactor code that checks phi nodes in opt_peel_loop_initial_if · 8d8f80af

Ian Romanick authored Jan 14, 2019



This will be used in a couple more places soon.

The function name is... horribly long.  Neither Matt nor I could think
of any thing that was shorter and still more descriptive than
"is_phi_foo".  I'm willing to entertain suggestions.

Fixes: 8fb8ebfb ("intel/compiler: More peephole select")
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>

8d8f80af

nir: Document some fields of nir_loop_terminator · 4d65d2b1
Ian Romanick authored Jan 16, 2019
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
```
4d65d2b1

intel/compiler: Silence warning about value that may be used uninitialized · 28ef5bb7

Ian Romanick authored Jan 24, 2019



For some reason, this warning only occurs for me in release builds.

In file included from src/intel/compiler/brw_nir_lower_mem_access_bit_sizes.c:25:0:
src/intel/compiler/brw_nir_lower_mem_access_bit_sizes.c: In function ‘brw_nir_lower_mem_access_bit_sizes’:
src/compiler/nir/nir_builder.h:501:26: warning: ‘src_swiz[2]’ may be used uninitialized in this function [-Wmaybe-uninitialized]
       alu_src.swizzle[i] = swiz[i];
       ~~~~~~~~~~~~~~~~~~~^~~~~~~~~
src/intel/compiler/brw_nir_lower_mem_access_bit_sizes.c:225:16: note: ‘src_swiz[2]’ was declared here
       unsigned src_swiz[4];
                ^~~~~~~~

Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>

28ef5bb7

Admin message