- 14 Nov, 2020 6 commits
-
-
Icecream95 authored
-
Icecream95 authored
-
Icecream95 authored
u_transfer_helper writes full blocks for RGTC and S3TC, so make sure the staging resource is big enough for this to avoid memory corruption.
-
Icecream95 authored
-
Icecream95 authored
Passing separate boolean values made it hard to determine what each argument did in the u_transfer_helper_create call.
-
Icecream95 authored
-
- 13 Nov, 2020 34 commits
-
-
Vinson Lee authored
Fix defect reported by Coverity Scan. Extra argument to printf format specifier (PRINTF_ARGS) extra_argument: This argument was not used by the format string: vmixer->max_layers. Fixes: 89b98632 ("vdpau: Add support for parameters") Signed-off-by:
Vinson Lee <vlee@freedesktop.org> Reviewed-by:
Leo Liu <leo.liu@amd.com> Part-of: <mesa/mesa!7200>
-
Vinson Lee authored
Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member score is not initialized in this constructor nor in any functions that it calls. Signed-off-by:
Vinson Lee <vlee@freedesktop.org> Reviewed-by:
Karol Herbst <kherbst@redhat.com> Part-of: <mesa/mesa!7324>
-
Rob Clark authored
Fixes: 74140c2e ("freedreno/ir3: convert over to ralloc") Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <mesa/mesa!7612>
-
Rob Clark authored
With indirect load_uniform, we can only encode 10b of constant base offset. This pass detects problematic cases and peels out the high bits of the base offset. Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <mesa/mesa!7612>
-
Nanley Chery authored
With the last commit, there are no more users of this code. Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <mesa/mesa!5388>
-
Nanley Chery authored
During a blorp_copy between two color surfaces, the source and destination formats are re-interpreted to UINT (if possible) to avoid losing bits. If either surface has CCS_E, then extra steps are taken to support fast-cleared blocks with this format re-interpretation. Each clear value is packed in the original format, then unpacked in the new UINT format. This is then placed into the surface state object for some platforms. There are couple problems here: 1. This is only being done for CCS_E, but MCS also supports fast-clears. 2. These steps aren't enough for fast-clears on gen11. On gen11, the clear color isn't part of the surface state object that BLORP creates. Instead it's stored in a separate BO, that the surface state object references. Since that BO doesn't get updated during blorp_copy, the incorrect/unconverted clear color is used for the copy operation. I didn't measure any performance gain from this code, so this patch simply disables the feature. Makes i965 pass the nv_copy_image-simple piglit test on gen11. Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <!5388>
-
Nanley Chery authored
During a blorp_copy between two color surfaces, the source and destination formats are re-interpreted to UINT (if possible) to avoid losing bits. If either surface has CCS_E, then extra steps are taken to support fast-cleared blocks with this format re-interpretation. Each clear value is packed in the original format, then unpacked in the new UINT format. This is then placed into the surface state object for some platforms. There are couple problems here: 1. This is only being done for CCS_E, but MCS also supports fast-clears. 2. These steps aren't enough for fast-clears on gen11+. On gen11+, the clear color isn't part of the surface state object that BLORP creates. Instead it's stored in a separate BO, that the surface state object references. Since that BO doesn't get updated during blorp_copy, the incorrect/unconverted clear color is used for the copy operation. I didn't measure any performance gain from this code, so this patch simply disables the feature. Makes iris pass the nv_copy_image-simple piglit test on gen11+. Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <mesa/mesa!5388>
-
Icecream95 authored
Fixes flickering in Neverwinter Nights. Closes: mesa/mesa#3789 Fixes: e6152091 ("panfrost: Use canonical characterization of tls_size") Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7613>
-
Alyssa Rosenzweig authored
Fixes glmark2 -bshadow, which uses a depth-only render target. Signed-off-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by:
Boris Brezillon <boris.brezillon@collabora.com> Tested-by:
Robin Murphy <robin.murphy@arm.com> Part-of: <mesa/mesa!7557>
-
Alyssa Rosenzweig authored
Always permit falling back to linear, now that linear Z/S is supported on SFBD. Signed-off-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by:
Boris Brezillon <boris.brezillon@collabora.com> Tested-by:
Robin Murphy <robin.murphy@arm.com> Part-of: <mesa/mesa!7557>
-
Alyssa Rosenzweig authored
Signed-off-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by:
Boris Brezillon <boris.brezillon@collabora.com> Tested-by:
Robin Murphy <robin.murphy@arm.com> Part-of: <mesa/mesa!7557>
-
Nanley Chery authored
Until recently, the depth value from glClearBufferfv wasn't clamped. Before then, this patch enabled the driver to fail the clearbuffer-depth piglit test with INTEL_DEBUG=nofc. This is because convert_depth_value relies on the assumption that the depth value is clamped. Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <mesa/mesa!7410>
-
Nanley Chery authored
Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <mesa/mesa!7410>
-
Nanley Chery authored
OpenGL 3.0 spec, section 4.2.3 "Clearing the Buffers": depth and stencil are the values to clear the depth and stencil buffers to, respectively. Clamping and type conversion for fixed-point depth buffers are performed in the same fashion as for ClearDepth. Enables iris to pass the clearbuffer-depth-stencil piglit test. Cc: mesa-stable Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <mesa/mesa!7410>
-
Nanley Chery authored
OpenGL 3.0 spec, section 4.2.3 "Clearing the Buffers": If buffer is DEPTH, drawbuffer must be zero, and value points to the single depth value to clear the depth buffer to. Clamping and type conversion for fixed-point depth buffers are performed in the same fashion as for ClearDepth. Enables iris to pass the clearbuffer-depth piglit test. v2. Add spec citation. (Eric Anholt) v3. Don't clamp floating point formats. (Eric Anholt) Cc: mesa-stable Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <mesa/mesa!7410>
-
Nanley Chery authored
Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <mesa/mesa!7410>
-
Rhys Perry authored
fossil-db (Navi): Totals from 710 (0.51% of 138917) affected shaders: SGPRs: 45007 -> 44791 (-0.48%) VGPRs: 36116 -> 36284 (+0.47%); split: -0.03%, +0.50% CodeSize: 3811540 -> 3795332 (-0.43%); split: -0.43%, +0.00% MaxWaves: 8018 -> 8005 (-0.16%) Instrs: 758383 -> 755084 (-0.44%); split: -0.44%, +0.01% Cycles: 5786240 -> 5758848 (-0.47%); split: -0.48%, +0.00% Signed-off-by:
Rhys Perry <pendingchaos02@gmail.com> Reviewed-by:
Daniel Schürmann <daniel@schuermann.dev> Part-of: <mesa/mesa!7600>
-
Rhys Perry authored
Signed-off-by:
Rhys Perry <pendingchaos02@gmail.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by:
Daniel Schürmann <daniel@schuermann.dev> Part-of: <mesa/mesa!7600>
-
Daniel Schürmann authored
Affected games are Detroit : Become Human and Doom : Eternal. Totals from 6262 (4.54% of 138013) affected shaders (RAVEN): SGPRs: 678472 -> 678640 (+0.02%) VGPRs: 498288 -> 498360 (+0.01%) CodeSize: 67064196 -> 65926000 (-1.70%) MaxWaves: 19390 -> 19382 (-0.04%) Instrs: 13175372 -> 12932517 (-1.84%) Cycles: 1444043256 -> 1443022576 (-0.07%); split: -0.08%, +0.01% VMEM: 929560 -> 908726 (-2.24%); split: +0.39%, -2.63% SMEM: 406207 -> 400062 (-1.51%); split: +0.46%, -1.97% VClause: 215168 -> 215031 (-0.06%) SClause: 443312 -> 442324 (-0.22%); split: -0.25%, +0.03% Copies: 1350793 -> 1344326 (-0.48%); split: -0.52%, +0.04% Branches: 506432 -> 506370 (-0.01%); split: -0.02%, +0.01% PreSGPRs: 619652 -> 619619 (-0.01%) PreVGPRs: 473212 -> 473168 (-0.01%) Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <mesa/mesa!175>
-
Mike Blumenkrantz authored
for shader keys to work right, these need to actually update the shader module that's being used Reviewed-by:
Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <mesa/mesa!7193>
-
Mike Blumenkrantz authored
eliminate gl_SampleMask writes when necessary to mimic GL behavior Reviewed-by:
Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <mesa/mesa!7193>
-
Mike Blumenkrantz authored
we need to check for rebuild any time fb samples becomes 0 in order to remove any writes to gl_SampleMask Reviewed-by:
Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <mesa/mesa!7193>
-
Mike Blumenkrantz authored
this is going to get messy as we fill them out, so at least we can keep things split up a bit for organizational sake Reviewed-by:
Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <mesa/mesa!7193>
-
Mike Blumenkrantz authored
we want to be able to reuse this between programs with matching slot maps, and refcounting allows that without having to copy the table also add some docs about all the different shader caching structs Reviewed-by:
Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <mesa/mesa!7193>
-
Mike Blumenkrantz authored
these are cached per-program for now since we have to ensure that the slot map always matches up between shader states Reviewed-by:
Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <mesa/mesa!7193>
-
Boris Brezillon authored
Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
The lane selection only cares about intra-32b swizzling. Add a modulo on the test to allow selecting lanes when the swizzle is above 4 (needed for MKVEC.v4i8 lane selection). Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
Ideally we should choose "abcd" instead of "xyzw" when accessing a vector that has more than 4 components, but bi_instruction does not provide this information, so let's keep things simple for now. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
Those are needed to implement derivatives. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
The encoding is different between v6 and v7. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
Instead of adding a BIR_INDEX_ per FAU index, let's group some of those together. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
Some instructions differ between v6 and v7 and we'll need to know which architecture we're compiling for if we want to generate the right instructions. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-
Boris Brezillon authored
We use opaque uint to encode formats everywhere else, so let's make things consistent and convert the only user to an opaque int too. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <mesa/mesa!7530>
-