- 14 Apr, 2019 2 commits
-
-
Vasily Khoruzhick authored
Hardware supports writing back Z/S buffers and sampling from them, so add support for that. Signed-off-by:
Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by:
Qiang Yu <yuq825@gmail.com> Tested-by:
Icenowy Zheng <icenowy@aosc.io>
-
Vasily Khoruzhick authored
Looks like it's somehow used by subsequent PP job, so we have to preserve its contents until PP job is done. Signed-off-by:
Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by:
Qiang Yu <yuq825@gmail.com> Tested-by:
Icenowy Zheng <icenowy@aosc.io>
-
- 13 Apr, 2019 3 commits
-
-
Christian Gmeiner authored
Port TGSI TRUNC lowering to nir Signed-off-by:
Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net>
-
maurossi authored
Adding \ prior to " in llvm version string fixes the following building errors: external/mesa/src/gallium/drivers/r600/r600_pipe_common.c:1290:14: error: expected ')' ", LLVM " MESA_LLVM_VERSION_STRING ^ <command line>:8:34: note: expanded from here ^ external/mesa/src/gallium/drivers/r600/r600_pipe_common.c:1287:10: note: to match this '(' snprintf(rscreen->renderer_string, sizeof(rscreen->renderer_string), ^ 1 error generated. Fixes: 05b114e5 ("simplify LLVM version string printing") Signed-off-by:
Mauro Rossi <issor.oruam@gmail.com> Reviewed-by:
Eric Engestrom <eric@engestrom.ch>
-
Lionel Landwerlin authored
In 628c9ca9 I forgot to apply the same -4Gb of the high address of the high heap VMA. This was previously computed in the HIGH_HEAP_MAX_ADDRESS. Many thanks to James for pointing this out. Signed-off-by:
Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by:
Xiong, James <james.xiong@intel.com> Fixes: 628c9ca9 ("anv: store heap address bounds when initializing physical device") Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net>
-
- 12 Apr, 2019 35 commits
-
-
Eric Anholt authored
We can use the same register spilling infrastructure for our loads/stores of indirect access of temp variables, instead of doing an if ladder. Cuts 50% of instructions and max-temps from 2 KSP shaders in shader-db. Also causes several other KSP shaders with large bodies and large loop counts to not be force-unrolled. The change was originally motivated by NOLTIS slightly modifying register pressure in piglit temp mat4 array read/write tests, triggering register allocation failures.
-
Jason Ekstrand authored
This commit adds new nir_load/store_scratch opcodes which read and write a virtual scratch space. It's up to the back-end to figure out what to do with it and where to put the actual scratch data. v2: Drop const_index comments (by anholt) Reviewed-by:
Eric Anholt <eric@anholt.net>
-
Eric Anholt authored
We were missing a * 4 even if the particular hardware matched our assumption.
-
Eric Anholt authored
-
Eric Anholt authored
This code is so touchy, trying to emit the minimum amount of address math. Some day we'll move it all to NIR, I hope.
-
Eric Anholt authored
While waiting for the CSD UABI to get reviewed, I keep having to rebase the CS patch. Just land the compiler side for now to keep it from diverging. For now this covers just GLES 3.1 compute shaders, not CL kernels.
-
Eric Anholt authored
We're using ARB_debug_output for the main shader-db, but I had this env var left around from the shader-db-2 support (vc4 apitrace-based). Keep the env var around since it's nice sometimes to get the stats on a shader you're optimizing without having to do a shader-db run, but drop the old formatting that's not useful and keeps tricking me when I go to add another measurement to the shader-db output.
-
Eric Anholt authored
This gives us finer-grained feedback on how we're doing on register pressure than "did we trigger a new shader to spill or drop thread count?"
-
Eric Anholt authored
-
Eric Anholt authored
A shader invocation always executes 16 channels together, so we often end up multiplying things by this magic 16 number. Give it a name.
-
Eric Anholt authored
I was thinking about a refactor, and needed to read this first. Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net>
-
Eric Anholt authored
Please don't make me read a const_index[] expression ever again. Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net>
-
Eric Anholt authored
The constant_index slots are named right there in the intrinsic definition, and the comment is just a chance to get out of sync. Noticed while reviewing the lower_to_scratch changes that copy-and-pasted wrong comments, and load_ubo and load_per_vertex_output had incorrect comments currently. Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net>
-
Sagar Ghuge authored
We will never hit a condition where we have src1 and src2 as immediate operands. Signed-off-by:
Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by:
Matt Turner <mattst88@gmail.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net>
-
Kenneth Graunke authored
gl_nir_lower_samplers_as_deref splits structure uniform variables, creating new variables for individual fields. As part of that, it calculates a new location. It then never set this on the new variables. Thanks to Michael Fiano for finding this bug. Fixes crashes on i965 with Piglit's new tests/spec/glsl-1.10/execution/samplers/uniform-struct test, which was reduced from the failing case in Michael's app. Fixes: f003859f nir: Make gl_nir_lower_samplers use gl_nir_lower_samplers_as_deref Reviewed-by:
Timothy Arceri <tarceri@itsqueeze.com>
-
kszaq authored
32-bit needs mmap64 for 64-bit offsets. We get 64-bit offsets from kernel. Signed-off-by:
Mateusz Krzak <kszaquitto@gmail.com> Reviewed-by:
Alyssa Rosenzweig <alyssa@rosenzweig.io>
-
kszaq authored
Required for 64-bit kernel to interpret the pointer from 32-bit userspace. Signed-off-by:
Mateusz Krzak <kszaquitto@gmail.com> Reviewed-by:
Alyssa Rosenzweig <alyssa@rosenzweig.io>
-
Jason Ekstrand authored
We were always programming it with the Broadwell convention which is too large by a factor of two on Haswell and just plain wrong on IVB and BYT. Reviewed-by:
Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org
-
Eric Engestrom authored
Suggested-by:
Karol Herbst <karolherbst@gmail.com> Signed-off-by:
Eric Engestrom <eric.engestrom@intel.com>
-
Marek Olšák authored
Reviewed-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com>
-
Marek Olšák authored
Reviewed-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com>
-
Marek Olšák authored
Reviewed-by:
Tapani Pälli <tapani.palli@intel.com>
-
Marek Olšák authored
for Viewperf 13 Acked-by:
Timothy Arceri <tarceri@itsqueeze.com>
-
Samuel Pitoiset authored
Sources are always casted to integers. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
ac_build_image_opcode() casts if necessary and buffer images are casted too. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
It's already casted if necessary in ac_build_image_opcode(). Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
Trivial. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
This changes the predicate from LessThan to Equal. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
This changes the predicate from LessThan to Equal. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
This changes the predicate from LessThan to Equal. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Samuel Pitoiset authored
Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Karol Herbst authored
Fixes: 035759b6 ("nir/i965/freedreno/vc4: add a bindless bool to type size functions") Signed-off-by:
Karol Herbst <kherbst@redhat.com> Tested-by:
Icenowy Zheng <icenowy@aosc.io> Reviewed-by:
Eric Engestrom <eric.engestrom@intel.com>
-
Karol Herbst authored
Signed-off-by:
Karol Herbst <kherbst@redhat.com>
-