Commits · mesa-11.0.0-rc3 · Julien Isorce / mesa

Sep 06, 2015

Update version to 11.0.0-rc3 · 271290f0
Emil Velikov authored 9 years ago
```
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>
```
mesa-11.0.0-rc3

271290f0
nouveau: don't mark full range as used on unmap with explicit flush · 7bf27c23
Ilia Mirkin authored 9 years ago and Emil Velikov committed 9 years ago
```
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: mesa-stable@lists.freedesktop.org
(cherry picked from commit a7788317)
```
7bf27c23

nv50: avoid using inline vertex data submit when gl_VertexID is used · 7f80a238

Ilia Mirkin authored 9 years ago and

Emil Velikov committed 9 years ago


The hardware only generates vertexid when vertices come from a VBO. This
fixes:

  vertexid-drawelements
  vertexid-drawarrays

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: "11.0" <mesa-stable@lists.freedesktop.org>
(cherry picked from commit c830d193)

7f80a238

nv50: don't flush vertex arrays when index buffer changes · 3e1fde76

Ilia Mirkin authored 9 years ago and

Emil Velikov committed 9 years ago


The index buffer is fed in inline over a pushbuf. It's not related to
vertices or any caching that might be done on them.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: mesa-stable@lists.freedesktop.org
(cherry picked from commit 4a025c6b)

3e1fde76

nv50: rebind bo to bufctx when invalidating idxbuf storage · 747e1b03

Ilia Mirkin authored 9 years ago and

Emil Velikov committed 9 years ago


There is nothing to be done on a dirty idxbuf, but the bo may have
changed, so we have to rebind it to the bufctx.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: mesa-stable@lists.freedesktop.org
(cherry picked from commit 1f62d36a)

747e1b03

nv50: clear buffer status on all vertex bufs, not just the first one · b85ec1e3
Ilia Mirkin authored 9 years ago and Emil Velikov committed 9 years ago
```
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: mesa-stable@lists.freedesktop.org
(cherry picked from commit 114cc18b)
```
b85ec1e3

nv50: fix drawing from tfb, direct-to-pushbuf submits · acb822f1

Ilia Mirkin authored 10 years ago and

Emil Velikov committed 9 years ago


The stride was being set to 0, which is illegal (and also non-sensical).
Also we must wait for the buffer to become available for reading as
otherwise a wrong value may be prefetched. Since we must wait for the
buffer anyways, and it's mapped and in GART, we may as well avoid the
annoyance of the indirect pushbuf submit.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: mesa-stable@lists.freedesktop.org
(cherry picked from commit 75e34d1d)

acb822f1

llvmpipe: convert double to long long instead of unsigned long long · ddf45949

Oded Gabbay authored 9 years ago and

Emil Velikov committed 9 years ago


round(val*dscale) produces a double result, as val and dscale are double.
However, LLVMConstInt receives unsigned long long, so there is an
implicit conversion from double to unsigned long long.
This is an undefined behavior. Therefore, we need to first explicitly
convert the round result to long long, and then let the compiler handle
conversion from that to unsigned long long.

This bug manifests itself in POWER, where all IMM values of -1 are being
converted to 0 implicitly, causing a wrong LLVM IR output.

Signed-off-by: Oded Gabbay <oded.gabbay@gmail.com>
CC: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Reviewed-by: Roland Scheidegger <sroland@vmware.com>
(cherry picked from commit 4f2290d1)

ddf45949

nv30: Implement color resolve for msaa · fcdaa190

Hans de Goede authored 9 years ago and

Emil Velikov committed 9 years ago


Note this is not ideal. Since the sifm can only do source sizes upto
1024x1024 we end up using the blitter on nv4x, which is not that fast.

And on nv3x we end up using the cpu which is really slow.

Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
Signed-off-by: Hans de Goede <hdegoede@redhat.com>
Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
(cherry picked from commit 3c6c4d4f)

fcdaa190

nv30: Fix creation of scanout buffers · 0abcd9c8

Hans de Goede authored 9 years ago and

Emil Velikov committed 9 years ago


Scanout buffers on nv30 must always be non-swizzled and have special
width alignment constraints.

These constrains have been taken from the xf86-video-nouveau
src/nv_accel_common.c: nouveau_allocate_surface() function.

nouveau_allocate_surface() applies these width constraints only when a
tiled attribute is set, which it sets for all surfaces allocated via
dri, and this "tiling" is not the same as swizzling, scanout surfaces
must be linear / have a uniform_pitch or only complete garbage is shown.

This commit fixes dri3 on nv30 showing a garbled display, with dri3 the
scanout buffers are allocated by mesa, rather then by the ddx, and the
wrong stride of these buffers was causing the garbled display.

Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
Signed-off-by: Hans de Goede <hdegoede@redhat.com>
Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
(cherry picked from commit 3329703e)

0abcd9c8

vc4: Initialize pack field of qreg to 0 in qir_get_temp · 0b14d358

Boyan Ding authored 9 years ago and

Emil Velikov committed 9 years ago


This avoids generation of undefined packing in qir and qpu instructions,
fixing a lot of rendering errors.

Fixes 8b36d107 (vc4: Pack the unorm-packing bits into a src MUL
instruction when possible.)

Cc: mesa-stable@lists.freedesktop.org
Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com>
Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
(cherry picked from commit 48de40ce)

0b14d358

i965: Disallow PixelTransfer operations for tiled-memcpy TexImage/ReadPixels · a6710090

Chris Wilson authored 9 years ago and

Emil Velikov committed 9 years ago

The tiled memcpy fast paths perform a simple blit (with only a couple of
trivial pixel conversion routines) and do not accommodate PixelTransfer
operations. Therefore if any are set, fallback to the regular routines.
Note that PixelTransfer only applies to TexImage and ReadPixels, not to
GetTexImage.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
Cc: Jason Ekstrand <jason.ekstrand@intel.com>
Cc: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Cc: mesa-stable@lists.freedesktop.org
(cherry picked from commit 099f5b3a)

a6710090

i965: Fix copy propagation type changes. · 0c98ba7a

Kenneth Graunke authored 9 years ago and

Emil Velikov committed 9 years ago


commit 472ef9a0 introduced code to
change the types of SEL and MOV instructions for moves that simply
"copy bits around".  It didn't account for type conversion moves,
however.  So it would happily turn this:

   mov(8) vgrf6:D, -vgrf5:D
   mov(8) vgrf7:F, vgrf6:UD

into this:

   mov(8) vgrf6:D, -vgrf5:D
   mov(8) vgrf7:D, -vgrf5:D

which erroneously drops the conversion to float.

Cc: "11.0 10.6" <mesa-stable@lists.freedesktop.org>
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>
Reviewed-by: Matt Turner <mattst88@gmail.com>
(cherry picked from commit 2ace64fd)

0c98ba7a

winsys/radeon: remove exported buffers from the cache · eef8258a

Marek Olšák authored 9 years ago and

Emil Velikov committed 9 years ago


Cc: 11.0 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
(cherry picked from commit efea7c3a)

eef8258a

winsys/amdgpu: remove exported buffers from the cache · 747cd2c2

Marek Olšák authored 9 years ago and

Emil Velikov committed 9 years ago


Cc: 11.0 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
(cherry picked from commit 54964c77)

747cd2c2

gallium/pb_bufmgr_cache: add a way to remove buffers from the cache explicitly · ecdd69cd

Marek Olšák authored 9 years ago and

Emil Velikov committed 9 years ago


This must be done before exporting a buffer as dmabuf fds, because
we lose track of who is using it and can't trust the reference counter.

Cc: 11.0 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
(cherry picked from commit 35d0f127)

ecdd69cd

glsl: Handle attribute aliasing in attribute storage limit check. · 74fa1069

Kenneth Graunke authored 9 years ago and

Emil Velikov committed 9 years ago


In various versions of OpenGL and GLSL, it's possible to declare
multiple VS input variables with aliasing attribute locations.

So, when computing the storage requirements for vertex attributes,
we can't simply add up the sizes.  Instead, we need to look at the
enabled slots.

This patch begins tracking which attributes are double types that
are larger than 128-bits (i.e. take up two vec4 slots).  We then
count normal attributes once, and count the double-size attributes
a second time.

Fixes deQP functional.attribute_location.bind_aliasing.max_cond_* tests
on i965, which regressed with commit ad208d97.

No Piglit changes on llvmpipe (which actually supports dvecs).

Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
Tested-by: Mark Janes <mark.a.janes@intel.com>
Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
Reviewed-by: Dave Airlie <airlied@redhat.com>
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
(cherry picked from commit c3294ca5)

74fa1069

mesa: Don't allow wrong type setters for matrix uniforms · 11534200

Ian Romanick authored 9 years ago and

Emil Velikov committed 9 years ago


Previously we would allow glUniformMatrix4fv on a dmat4 and
glUniformMatrix4dv on a mat4.  Both are illegal.  That later also
overwrites the storage for the mat4 and causes bad things to happen.

Should fix the (new) arb_gpu_shader_fp64-wrong-type-setter piglit test.

Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au>
Cc: Dave Airlie <airlied@redhat.com>
Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
(cherry picked from commit 7237c937)

11534200

mesa: Pass the type to _mesa_uniform_matrix as a glsl_base_type · 5704d473

Ian Romanick authored 9 years ago and

Emil Velikov committed 9 years ago

This matches _mesa_uniform, and it enables the bug fix in the next
patch.

v2: s/type/basicType/ in the assert in _mesa_uniform_matrix.

Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> [v1]
Cc: Dave Airlie <airlied@redhat.com>
Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
(cherry picked from commit a6976f09)

5704d473

i965/fs: Handle MRF destinations in lower_integer_multiplication(). · eb2b88c4

Matt Turner authored 9 years ago and

Emil Velikov committed 9 years ago


The lowered code reads from the destination, which isn't possible from
message registers.

Fixes the following dEQP tests on SNB:

    dEQP-GLES3.functional.shaders.precision.int.highp_mul_fragment
    dEQP-GLES3.functional.shaders.precision.int.mediump_mul_fragment
    dEQP-GLES3.functional.shaders.precision.int.lowp_mul_fragment

Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>
Tested-by: Mark Janes <mark.a.janes@intel.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>
(cherry picked from commit 9390cb84)

eb2b88c4

mesa/readpixels: check strides are equal before skipping conversion · 5c08afc8

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


The CTS packed_pixels test checks that readpixels doesn't write
into the space between rows, however we fail that here unless
we check the format and stride match.

This fixes all the core mesa problems with CTS packed_pixels
tests.

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 32769ac0)

5c08afc8

texcompress_s3tc/fxt1: fix stride checks (v1.1) · 5fb758a4

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


The fastpath currently checks the RowLength != width, but
if you have a RowLength of 7, and Alignment of 4, then
that shouldn't match.

align the rowlength to the pack alignment before comparing.

This fixes compressed cases in CTS packed_pixels_pixelstore
test when SKIP_PIXELS is enabled, which causes row length
to get set.

v1.1: add fxt1 fix (Iago)

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit b4a70401)

5fb758a4

st/readpixels: fix accel path for skipimages. · bb378249

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


We don't need to use the 3d image address here as that will
include SKIP_IMAGES, and we are only blitting a single
2D anyways, so just use the 2D path.

This fixes some memory overruns under CTS
 packed_pixels.packed_pixels_pixelstore when PACK_SKIP_IMAGES
is used.

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 6a3e1fb9)

bb378249

mesa/formats: 8-bit channel integer formats addition · 8fc2cbb0

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


Add enough 8-bit channel formats to handle all the
different things CTS throws at us.

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit c3c24207)

8fc2cbb0

mesa/formats: add some formats from GL3.3 · b497b88d

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


GL3.3 added GL_ARB_texture_rgb10_a2ui, which specifies
a lot more things than just rgb10/a2ui.

While playing with ogl conform one of the tests must
attempted all valid formats for GL3.3 and hits the
unreachable here.

This adds the first chunk of formats that hit the
assert.

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 8185a023)

b497b88d

mesa: handle SwapBytes in compressed texture get code. · dcb220f2

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


This case just wasn't handled, so add support for it.

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 5b6c7da4)

dcb220f2

mesa: fix SwapBytes handling in numerous places · d9534e47

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


In a number of places the SwapBytes handling didn't handle cases with
GL_(UN)PACK_ALIGNMENT set and 7 byte width cases aligned to 8 bytes.

This adds a common routine to swap bytes a 2D image and uses this
code in:

texture storage
texture get
readpixels
swrast drawpixels.

[airlied: updated with Brian's nitpicks].

Cc: "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Brian Paul <brianp@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 0ad3a475)

d9534e47

radeonsi: fix memory usage checking for big IBs · 63b4e6bf

Marek Olšák authored 9 years ago and

Emil Velikov committed 9 years ago


Cc: 11.0 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Acked-by: Christian König <christian.koenig@amd.com>
(cherry picked from commit 05af645a)

63b4e6bf

radeonsi: set all 16 viewport Z bounds for GL 4.1 · a5dee227

Marek Olšák authored 9 years ago and

Emil Velikov committed 9 years ago


Cc: 11.0 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Acked-by: Christian König <christian.koenig@amd.com>
(cherry picked from commit 08775a21)

a5dee227

radeonsi: fix a Unigine Heaven hang when drirc is missing · 1aea7812

Marek Olšák authored 9 years ago and

Emil Velikov committed 9 years ago


Cc: 10.6 11.0 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Acked-by: Christian König <christian.koenig@amd.com>
(cherry picked from commit 9b510a96)

1aea7812

i965: Prevent coordinate overflow in intel_emit_linear_blit · f0180a37

Chris Wilson authored 9 years ago and

Emil Velikov committed 9 years ago


Fixes regression from
commit 8c17d538
Author: Kenneth Graunke <kenneth@whitecape.org>
Date:   Wed Apr 15 03:04:33 2015 -0700

    i965: Make intel_emit_linear_blit handle Gen8+ alignment restrictions.

which adjusted the coordinates to be relative to the nearest cacheline.
However, this then offsets the coordinates by up to 63 and this may then
cause them to overflow the BLT limits. For the well aligned large
transfer case, we can use 32bpp pixels and so reduce the coordinates by
4 (versus the current 8bpp pixels). We also have to be more careful
doing the last line just in case it may exceed the coordinate limit.

Reported-and-tested-by:  <kaillasse91@hotmail.fr>
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=90734


Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
Cc: Kenneth Graunke <kenneth@whitecape.org>
Cc: Ian Romanick <ian.d.romanick@intel.com>
Cc: Anuj Phogat <anuj.phogat@gmail.com>
Cc: mesa-stable@lists.freedesktop.org
Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>
(cherry picked from commit d38a5601)

f0180a37

r600g: fix calculation for gpr allocation · fe77d714

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


I've been chasing a geom shader hang on rv635 since I wrote
r600 geom code, and finally I hacked some values from fglrx
in and I could run texelfetch without failures.

This is totally my fault as well, maths fail 101.

This makes geom shaders on r600 not fail heavily.

Cc: "10.6" "11.0" <mesa-stable@lists.freedesktop.org>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 0de53ccc)

fe77d714

r600/sb: update last_cf for finalize if. · fb119b22

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


As Glenn did for finalize_loop we need to update_cf when we
add a POP at the end of a shader.

I think this fixes one of the earlier shader going off end
of memory problems we've stopped.

Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com>
Cc: "10.6" "11.0" <mesa-stable@lists.freedesktop.org>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 3063913f)

fb119b22

Sep 01, 2015

egl: scons: fix the haiku build, do not build the dri2 backend · 50306a33
Alexander von Gluck authored 9 years ago and Emil Velikov committed 9 years ago
```
Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
(cherry picked from commit 5abbd1ca)
Fixes: 78674631(egl: remove the non-haiku scons build)
```
50306a33

freedreno/a4xx: formats update · cf007af8

Rob Clark authored 9 years ago and

Emil Velikov committed 9 years ago


Fixes glamor, which wants to use R8 integer textures.

Signed-off-by: Rob Clark <robclark@freedesktop.org>
(cherry picked from commit 000e2253)

cf007af8

freedreno: update generated headers · 7d576419
Rob Clark authored 9 years ago and Emil Velikov committed 9 years ago
```
Signed-off-by: Rob Clark <robclark@freedesktop.org>
(cherry picked from commit afb6c24a)
```
7d576419

r600: move prim convert from geom shader to function. · 893caebf

Dave Airlie authored 9 years ago and

Emil Velikov committed 9 years ago


This should avoid C++ fail including this header.

Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>
(cherry picked from commit 03b7ec87)
Fixes: 69418831 (r600: port si_conv_prim_to_gs_out from radeonsi)
Nominated-by: Marek Olšák <marek.olsak@amd.com>

893caebf

Aug 31, 2015

Update version to 11.0.0-rc2 · 3f8d4421
Emil Velikov authored 9 years ago
```
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>
```
mesa-11.0.0-rc2

3f8d4421
gallium/radeon: fix the ADDRESS_HI mask for EVENT_WRITE CIK packets · 579ca506
Marek Olšák authored 9 years ago and Emil Velikov committed 9 years ago
```
Cc: mesa-stable@lists.freedesktop.org
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
(cherry picked from commit 437cb1e3)
```
579ca506

freedreno/a3xx: add basic clip plane support · 94205d0a

Ilia Mirkin authored 9 years ago and

Emil Velikov committed 9 years ago


The hardware is capable of dealing with GL1-style user clip planes.
No clip vertex, no clip distances. Fixes a number of ucp tests, as well
as neverball.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Cc: "11.0" <mesa-stable@lists.freedesktop.org>
(cherry picked from commit 58e24b47)

94205d0a

Admin message