- Oct 09, 2020
-
-
Timur Kristóf authored
They work the same way as in legacy GS, so we can reuse that. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <mesa/mesa!6964>
-
Timur Kristóf authored
We can reuse the existing ES output code. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Make the NGG VS/TES code easier to follow, give better names to some functions and make ngg_nogs_early_prim_export a variable. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Use lshl_or instead of lshl_add, which makes it more robust in handling -1 and -2 indices which will now just become null exports, which is what we want. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Previously, this function inferred the vertex and primitive counts from the gs_tg_info shader argument, but in case of NGG GS, it will need to be calculated in runtime. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <mesa/mesa!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6964>
-
Will be useful for NGG GS and probably testing. The helpers take care of divergence but not creating correct phis. Signed-off-by:
Rhys Perry <pendingchaos02@gmail.com> Reviewed-by:
Timur Kristóf <timur.kristof@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
This makes it easier for ACO to implement NGG GS: 1. No need to keep track of vertex and primitive counts. 2. No need to discard incomplete primitives. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <!6964>
-
Timur Kristóf authored
Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <!6964>
-
Timur Kristóf authored
After each end_primitive and at the end of the shader before emitting set_vertex_and_primitive_count, we check if the primitive that is being emitted has enough vertices or not, and we adjust the vertex and primitive counters accordingly. As a result, if the backend uses this option, the backend compiler will not have to worry about discarding the unneeded vertices and primitives. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <!6964>
-
Timur Kristóf authored
Add an option to nir_lower_gs_intrinsics so that it can also track the number of emitted vertices per primitive, not just the total vertex count. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <!6964>
-
Timur Kristóf authored
Add an option to nir_lower_gs_intrinsics which tells it to track the number of emitted primitives, not just vertices. Additionally, also make it per-stream. Also rename the set_vertex_count intrinsic to set_vertex_and_primitive_count. Signed-off-by:
Timur Kristóf <timur.kristof@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <mesa/mesa!6964>
-
Val Packett authored
CLOCK_MONOTONIC_FAST is a similar clock from FreeBSD. Acked-by:
Eric Engestrom <eric.engestrom@intel.com> Reviewed-by:
Matt Turner <mattst88@gmail.com> Part-of: <mesa/mesa!6995>
-
Emmanuel Vadot authored
Based on the OpenBSD variant. The only difference between those two system is the sysctl mib. Signed-off-by:
Emmanuel Vadot <manu@FreeBSD.org> Reviewed-by:
Matt Turner <mattst88@gmail.com> Part-of: <!6995>
-
Samuel Pitoiset authored
No changes, they are already similar. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Timur Kristóf <timur.kristof@gmail.com> Part-of: <!6938>
-
Samuel Pitoiset authored
To match ACO. Totals from 268 (0.20% of 136420) affected shaders: CodeSize: 1214060 -> 1214096 (+0.00%); split: -0.05%, +0.06% Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Timur Kristóf <timur.kristof@gmail.com> Part-of: <!6938>
-
Andreas Baierl authored
Reviewed-by:
Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by:
Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <!7044>
-
Samuel Pitoiset authored
Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!6979>
-
Dave Airlie authored
Suggested on irc seems like a good plan. Reviewed-by:
Pierre Moreau <dev@pmoreau.org> Reviewed-by:
Daniel Stone <daniels@collabora.com> Part-of: <!7051>
-
- Oct 08, 2020
-
-
Timothy Arceri authored
Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Part-of: <!7059>
-
Vinson Lee authored
mpTrackMemAccessFuncTy is not used anywhere. Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member mpTrackMemAccessFuncTy is not initialized in this constructor nor in any functions that it calls. Suggested-by:
Jan Zielinski <jan.zielinski@intel.com> Signed-off-by:
Vinson Lee <vlee@freedesktop.org> Reviewed-by:
Jan Zielinski <jan.zielinski@intel.com> Part-of: <!6930>
-
Vinson Lee authored
GCC and Clang support --std and -std options but Intel C++ Compiler only supports -std. icpc: command line warning #10159: invalid argument for option '--std' Fixes: 8a05d6ff ("driconf: Make the driver's declarations be structs instead of XML.") Signed-off-by:
Vinson Lee <vlee@freedesktop.org> Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <!7020>
-
Emma Anholt authored
This is lifted straight from anv, which seems like a reasonable way to go. Part-of: <!7015>
-
Emma Anholt authored
And document where to find information on qcom gralloc's private handle layout. I chose not to #include the gralloc_priv because it seems that there's not much we need yet, and I'm hoping we can avoid the build-time dependency on the specific platform. Part-of: <!7015>
-
Emma Anholt authored
Part-of: <!7015>
-
Emma Anholt authored
Need to land KGSL in CI! Fixes: 8163c818 ("turnip: implement timestamp fences/semaphores for kgsl backend") Part-of: <!7015>
-
Nanley Chery authored
This reverts commit bcfec61d. The previous patch fixed the underlying issue that the above commit was actually working around. It turns out that the previously observed performance regression was due to invalid aux-map entries for multi-layer HiZ+CCS buffers. Reviewed-by:
Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <!7046>
-
Nanley Chery authored
Fixes rendering corruption in the shadowmappingcascade Sascha Willems Vulkan demo. To see the corruption, I adjusted the demo options as follows: 1. Enable "Display depth map" 2. Set "Split lambda" to 0.100 3. Make "Cascade" non-zero. Fixes: 80ffbe91 ("anv: Add support for HiZ+CCS") Reviewed-by:
Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <mesa/mesa!7046>
-
Emma Anholt authored
Shaders may not use a particular region of a UBO in a given shader (think UBOs shared between stages, or between shaders), and by just always extending the existing range for a given UBO, we'd waste bandwidth uploading it, and also waste our precious const space in storing the unused data. Instead, only upload exactly the ranges we can use, and merge ranges when they're neighbors. We may end up with more upload packets, but the bandwidth savings is surely going to be worth it (and if find we want a distance threshold for merging with nearby uploads, that would be easy to add). total instructions in shared programs: 9266114 -> 9255092 (-0.12%) total full in shared programs: 343162 -> 341709 (-0.42%) total constlen in shared programs: 1454368 -> 1275236 (-12.32%) total cat6 in shared programs: 93073 -> 82589 (-11.26%) total (ss) in shared programs: 212402 -> 206404 (-2.82%) total (sy) in shared programs: 122905 -> 114007 (-7.24%) Part-of: <!7036>
-
Emma Anholt authored
No more start > end for signaling that the slot isn't used, no more funny setup of num_enabled. Part-of: <!7036>
-
Emma Anholt authored
We've seen the new GS flake once or twice a week for a long time. Part-of: <!7070>
-
Mike Blumenkrantz authored
rehashing a populated hash table is very expensive, so for the case where the maximum/likely table size is already known, this function allows for pre-sizing the table to avoid ever needing a rehash Acked-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <!7037>
-
Faith Ekstrand authored
It assumes the parent is a vector or scalar so we need to fail if it isn't. Fixes: 9190f82d "nir/opt_deref: Add an optimization for bitcasts" Reviewed-by:
Karol Herbst <kherbst@redhat.com> Part-of: <mesa/mesa!7064>
-