Commits · iris-rebind-vb · Kenneth Graunke / mesa

Sep 24, 2019
- iris: Fix iris_rebind_buffer() for VBOs with non-zero offsets. · d304e879
  Kenneth Graunke authored Sep 23, 2019
  
  d304e879
- glx: Move vertex array protocol state into the indirect backend · 01e43798
  Adam Jackson authored Nov 14, 2017
```
Only relevant for indirect contexts, so let's get that code out of the
common path.
```
  01e43798
Sep 23, 2019

intel: Increase Gen11 compute shader scratch IDs to 64. · b9e93db2

Kenneth Graunke authored Aug 22, 2019



From the MEDIA_VFE_STATE docs:

   "Starting with this configuration, the Maximum Number of Threads must
    be set to (#EU * 8) for GPGPU dispatches.

    Although there are only 7 threads per EU in the configuration, the
    FFTID is calculated as if there are 8 threads per EU, which in turn
    requires a larger amount of Scratch Space to be allocated by the
    driver."

It's pretty clear that we need to increase this for scratch address
calculations, because the FFTID has a certain bit-pattern.  The quote
above seems to indicate that we should increase the actual thread count
programmed in MEDIA_VFE_STATE as well, but we think the intention is to
only bump the scratch space.

Fixes GPU hangs in Bioshock Infinite and Synmark's CSDof on Icelake 8x8.

Fixes: 5ac804bd ("intel: Add a preliminary device for Ice Lake")
Reviewed-by: Matt Turner <mattst88@gmail.com>

b9e93db2

Revert "intel/gen11+: Enable Hardware filtering of Semi-Pipelined State in WM" · 50c0dd86

Kenneth Graunke authored Sep 23, 2019

This reverts commit 729de148.

It turns out that, although the register is in the logical context,
it isn't whitelisted, so we can't actually write it from userspace
batch buffers.  The write just becomes a noop, which is why we saw
no performance changes.

I manually whitelisted it, and still observed no performance gains, but
it did regress KHR-GL46.texture_cube_map_array.color_depth_attachments
on the iris driver.  So we might need to fix something before enabling
this.  To prevent it randomly getting turned on should the kernel ever
whitelist this register, we revert the patch for now.

50c0dd86

util/rb_tree: Replace useless ifs with asserts · 03911195
Faith Ekstrand authored Sep 23, 2019
```
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
```
03911195

broadcom/genxml: Stop manually scrubbing 'α' -> "alpha" · a733423d

Kenneth Graunke authored Sep 20, 2019



'α' has never appeared in any genxml files, so there's no need to
replace it with the word "alpha".

Reviewed-by: Eric Anholt <eric@anholt.net>

a733423d

intel/genxml: Stop manually scrubbing 'α' -> "alpha" · 8489206e

Kenneth Graunke authored Aug 22, 2019



'α' has never appeared in any genxml files, so there's no need to
replace it with the word "alpha".

Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>

8489206e

freedreno/a6xx: do streamout only in binning pass · d8cbf1ad

Rob Clark authored Sep 20, 2019 and

Rob Clark committed Sep 23, 2019



Use VPC_SO_OVERRIDE to control whether we do streamout in binning or
draw pass.  Normally we want to do streamout in binning pass, except
when there is a single tile and binning passed is skipped.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>

d8cbf1ad

freedreno/a6xx: fix binning pass vs. xfb · b9bf3745

Rob Clark authored Sep 20, 2019 and

Rob Clark committed Sep 23, 2019



We could bit doing streamout from binning pass.  In this case we want to
use the full VS which doesn't have (potentially streamed out) varyings
stripped out.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>

b9bf3745

freedreno/a6xx: un-open-code PC_PRIMITIVE_CNTL_1.PSIZE · 331f89a9

Rob Clark authored Sep 19, 2019 and

Rob Clark committed Sep 23, 2019



Signed-off-by: Rob Clark <robdclark@chromium.org>
Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>

331f89a9

ac/nir: force unnormalized coordinates for RECT · 05d32850
Marek Olšák authored Sep 18, 2019
```
This fixes VAAPI.

Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
```
05d32850
ac/nir: port Z compare value clamping from radeonsi · 500181b2
Marek Olšák authored Sep 18, 2019
```
This fixes some dEQP tests.

Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
```
500181b2
tgsi_to_nir: fix 2-component system values like tess_level_inner_default · 09447ccc
Marek Olšák authored Sep 18, 2019
```
Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
Reviewed-by: Eric Anholt <eric@anholt.net>
```
09447ccc

tgsi_to_nir: fix masked out image loads · 3906fce8

Marek Olšák authored Sep 18, 2019



This caused a failure in NIR validation.

Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
Reviewed-by: Eric Anholt <eric@anholt.net>

3906fce8

nir: define 8-byte size and alignment for bindless variables · 780eeaf2
Marek Olšák authored Sep 18, 2019
```
Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
```
780eeaf2
nir: don't add bindless variables to num_textures and num_images · f5c103ce
Marek Olšák authored Sep 18, 2019
```
It confuses radeonsi.

Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
```
f5c103ce
amd: remove all PCI IDs supported by amdgpu · 150f6ffb
Marek Olšák authored Sep 18, 2019
```
Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
150f6ffb

loader: always map the "amdgpu" kernel driver name to radeonsi (v2) · 5a545e35

Sonny Jiang authored Sep 03, 2019



v2: cleanup

Signed-off-by: Sonny Jiang <sonny.jiang@amd.com>
Signed-off-by: Marek Olšák <marek.olsak@amd.com>
Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>

5a545e35

ac: stop using PCI IDs for chip identification · 94297142

Marek Olšák authored Sep 18, 2019



PCI IDs for amdgpu will be removed from Mesa.

Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>

94297142

ac/addrlib: fix chip identification for Vega10, Arcturus, Raven2, Renoir · 48742de6

Marek Olšák authored Sep 18, 2019



Cc: 19.2 <mesa-stable@lists.freedesktop.org>
Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>

48742de6

amd: add more PCI IDs for Navi14 · 65b69813
Marek Olšák authored Sep 23, 2019
```
trivial and urgent

Cc: 19.2 <mesa-stable@lists.freedesktop.org>
```
65b69813

meson: split compiler warnings one per line · c29c4101

Eric Engestrom authored Sep 23, 2019



Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>

c29c4101

nir/repair_ssa: Replace the unreachable check with the phi builder · d63162cf

Faith Ekstrand authored Sep 09, 2019

In a3268599, I attempted to fix nir_repair_ssa for unreachable
blocks. However, that commit missed the possibility that the use is in
a block which, itself, is unreachable. In this case, we can end up in
an infinite loop trying to replace a def with itself. Even though a
no-op replacement is a fine operation, it keeps extending the end of the
uses list as we're walking it. Instead of explicitly checking for the
group of conditions, just check if the phi builder gives us a different
def. That's guaranteed to be 100% reliable and, while it lacks symmetry
with the is_valid checks, should be more reliable.

Fixes: a3268599 "nir/repair_ssa: Repair dominance for unreachable..."
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>

d63162cf

aco: only emit waitcnt on loop continues if we there was some load or export · 2c050b49
Daniel Schürmann authored Sep 19, 2019
```
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
```
2c050b49
nv50/ir/nir: comparison of integer expressions of different signedness warning · 70e39294
Karol Herbst authored Sep 20, 2019
```
Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Rhys Kidd <rhyskidd@gmail.com>
```
70e39294

nv50/ir: fix unnecessary parentheses warning · 61ccca12

Karol Herbst authored Sep 20, 2019



Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Rhys Kidd <rhyskidd@gmail.com>

61ccca12

lima: remove partial clear support from pipe->clear() · ab49a0e7

Erico Nunes authored Sep 19, 2019



pipe->clear() is not called for partial clears, which mesa emulates by
drawing a quad.
Furthermore, drivers should not use rasterizer state information for
scissor information (which was being used to handle the partial clears).
So, remove the partial clear support since it was not supposed to be
handled by pipe->clear() anyway.
This fixes issues with clearing after switching to different sized
framebuffers.

Signed-off-by: Erico Nunes <nunes.erico@gmail.com>
Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>
Reviewed-by: Qiang Yu <yuq825@gmail.com>

ab49a0e7

dEQP-GLES2.functional.buffer.write.use.index_array.* are passing now. · 0c6ca0a6
Boris Brezillon authored Sep 18, 2019
```
Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
```
0c6ca0a6

panfrost: Fix indexed draws · 055497fa

Boris Brezillon authored Sep 18, 2019



->padded_count should be large enough to cover all vertices pointed by
the index array. Use the local vertex_count variable that contains the
updated vertex_count value for the indexed draw case.

Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

055497fa

clover/nir: fix compilation with g++-5.5 and maybe earlier · 697eb8f9

Karol Herbst authored Sep 22, 2019 and

Karol Herbst committed Sep 23, 2019

fixes "sorry, unimplemented: non-trivial designated initializers not supported"

Fixes: deb04adf ("clover: add support for passing kernels as nir to the driver")
Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Eric Engestrom <eric@engestrom.ch>

697eb8f9

st/mesa: Bail on incomplete attachments in discard_framebuffer · ec81f19b

Kenneth Graunke authored Sep 20, 2019

Incomplete attachments don't have an associated pipe_surface, so
this would crash.

Fixes a WebGL conformance test that uses incomplete attachments:
https://www.khronos.org/registry/webgl/sdk/tests/conformance2/renderbuffers/invalidate-framebuffer.html?webglVersion=2&quiet=0&quick=1

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111756

Reviewed-By: Tapani Pälli <tapani.palli@intel.com>

ec81f19b

lima: implement BO cache · d2147787

Vasily Khoruzhick authored Sep 07, 2019



Allocating BOs is expensive, so we should avoid doing that by caching
freed BOs.

BO cache is modelled after one in v3d driver and works as follows:

- in lima_bo_create() check if we have matching BO in cache and return
  it if there's one, allocate new BO otherwise.
- in lima_bo_unreference() (renamed from lima_bo_free()): put BO in
  cache instead of freeing it and remove all stale BOs from cache

Reviewed-by: Qiang Yu <yuq825@gmail.com>
Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>

d2147787

lima: use 0 to poll if BO is busy in lima_bo_wait() · 9f897a2b

Vasily Khoruzhick authored Sep 07, 2019



os_time_get_absolute_timeout(0) returns current time, while kernel
driver expects 0 as value to poll BO status and return immediately.
Fix it by setting abs_timeout to 0 if timeout_ns is 0

Reviewed-by: Qiang Yu <yuq825@gmail.com>
Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>

9f897a2b

lima: move damage bound build to resource · 7f7ac210

Qiang Yu authored Aug 25, 2019



Reviewed-and-Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>
Signed-off-by: Qiang Yu <yuq825@gmail.com>

7f7ac210

lima: don't use damage system when full damage · 4ed569ee

Qiang Yu authored Aug 25, 2019



Some time weston set full damage region. It is
more effient to use the cached pp stream instead
of dynamically create one.

Reviewed-and-Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>
Signed-off-by: Qiang Yu <yuq825@gmail.com>

4ed569ee

lima: implement EGL_KHR_partial_update · afbaed90

Qiang Yu authored Jun 30, 2019



This extension set a damage region for each
buffer swap which can be used to reduce buffer
reload cost by only feed damage region's tile
buffer address for PP.

Reviewed-and-Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>
Signed-off-by: Qiang Yu <yuq825@gmail.com>

afbaed90

Sep 22, 2019

lima: fix PLBU viewport configuration · 8278b236

Icenowy Zheng authored Sep 22, 2019



The PLBU expects the viewport's 4 borders' coordinates, however
currently we're feeding the coordinate of the left-bottom point and the
size to it, which leads to misrendering when the left-bottom point is
not (0,0).

Change the macros for the viewport PLBU command, and the data feed to
it. The code to calculate the 4 borders is ported from Panfrost.

Signed-off-by: Icenowy Zheng <icenowy@aosc.io>
Reviewed-by: Qiang Yu <yuq825@gmail.com>

8278b236

Sep 21, 2019

amd: Build aco only if radv is enabled · 40087ffc

Bas Nieuwenhuizen authored Sep 20, 2019

ACO depends on C++14, but radeonsi/radv with LLVM 8,9 do not. Let us
only require it for RADV, since that is the only user.

Fixes: a70a9987 "radv/aco: Setup alternate path in RADV to support the experimental ACO compiler"
Reviewed-by: Marek Olšák <marek.olsak@amd.com>

40087ffc

nvc0: expose spirv support · 7955fabc

Karol Herbst authored May 10, 2019 and

Karol Herbst committed Sep 21, 2019



required for OpenCL

v2: adjust to changes in previous commits
v3: properly convert to NIR in nvc0_cp_state_create

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Pierre Moreau <pierre.morrow@free.fr> (v1)

7955fabc

clover: add support for passing kernels as nir to the driver · deb04adf

Karol Herbst authored Aug 06, 2019 and

Karol Herbst committed Sep 21, 2019



v2: minor formatting fixes
v3: call glsl_type_singleton_init_or_ref and glsl_type_singleton_decref
v4: capitalize and punctuate comments
    fix text_executable -> text_intermediate in TODO
    make glsl_type_singleton wrapper static
v5: rewrite how we run the nir passes
v6: fix unhandled case switch warning in st/mesa

Signed-off-by: Karol Herbst <kherbst@redhat.com>
Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v4)

deb04adf

Admin message