- 29 Jun, 2020 15 commits
-
-
Eric Anholt authored
We weren't initializing the VCM bits in the !gs path, but v33 doesn't have GS so we can just mark it unreachable. Reviewed-by:
Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <!2952>
-
Eric Anholt authored
Dave wants to drop the !TEXCOORD path from NIR, and it's easy enough to do. Untested. Reviewed-by:
Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <!2952>
-
Eric Anholt authored
Dave wants to drop the !TEXCOORD path from NIR, and it's easy enough to do. Untested. Reviewed-by:
Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <!2952>
-
Eric Anholt authored
Many drivers will need to do the same thing here, so consolidate it. Reviewed-by:
Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <!2952>
-
Jonathan Marek authored
Passes at least dEQP-VK.dynamic_state.rs_state.depth_bias_clamp Signed-off-by:
Jonathan Marek <jonathan@marek.ca> Part-of: <!5678>
-
Jonathan Marek authored
Passes dEQP-VK.rasterization.primitive_size.points.point_size_* Signed-off-by:
Jonathan Marek <jonathan@marek.ca> Part-of: <!5678>
-
Jonathan Marek authored
This is not completely tested, but matches the max array pitch allowed by A6XX_TEX_CONST_9_FLAG_BUFFER_ARRAY_PITCH. Note this still doesn't allow all image sizes, but it allows 16384x16384 cpp=4 images to work. Signed-off-by:
Jonathan Marek <jonathan@marek.ca> Part-of: <!5678>
-
Satyajit Sahu authored
VP9 allows frame to use another resolution frame as reference frames so updating the resolution for decoder when there is a resolution change. Signed-off-by:
Satyajit Sahu <satyajit.sahu@amd.com> Reviewed-by:
Leo Liu <leo.liu@amd.com> Part-of: <!5646>
-
Iago Toral authored
Reviewed-by:
Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by:
Jose Maria Casanova Crespo <jmcasanova@igalia.com> Fixes: 97566efe ("v3d: Rematerialize MOVs of uniforms instead of spilling them.") Part-of: <!5664>
-
Samuel Pitoiset authored
SPI_SHADER_COL_FORMAT allocates export memory and CB_SHADER_MASK map them to higher MRTs if necessary. The hardware allows to remap MRTs to avoid holes somehow. For example, if we have a scenario where MRT0 is unused and only MRT1 and MRT2 are used, SPI_SHADER_COL_FORMAT is 0x77 and CB_SHADER_MASK/CB_TARGET_MASK are 0x770 (this assumes SPI_SHADER_UINT16_ABGR is set). This allows us to remove one workaround that was added for fixing GPU hangs with DXVK. I think this is because SPI_SHADER_COL_FORMAT expects contiguous MRTs to be allocated. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <!5434>
-
Samuel Pitoiset authored
When colorWriteMask is 0 we can assume that this color attachment is unused. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <!5434>
-
Samuel Pitoiset authored
To prepare for MRTs compaction. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <!5434>
-
Samuel Pitoiset authored
Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <!5434>
-
Samuel Pitoiset authored
It gives +1-2 FPS with Doom Eternal on Pitcairn. Signed-off-by:
Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by:
Daniel Schürmann <daniel@schuermann.dev> Part-of: <!5659>
-
Boris Brezillon authored
nir_load_store_vectorize_test.ssbo_load_adjacent_32_32_64_64 expectations need to be fixed accordingly. Signed-off-by:
Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by:
Jason Ekstrand <jason@jlekstrand.net> Part-of: <!5589>
-
- 28 Jun, 2020 7 commits
-
-
Dave Airlie authored
The code was expecting the lower 32-bits of the 64-bit to be what it wanted, don't be implicit, pull the value from the union. This should fix rendering on big endian systems since NIR was introduced. Fixes: 44a6b010 ("gallivm: add nir->llvm translation (v2)") Reviewed-by:
Timothy Arceri <tarceri@itsqueeze.com> Part-of: <!5677>
-
Jonathan Marek authored
resinfo always writes 3 components, which was not being taken into account Fixes these tests: dEQP-VK.renderpass.suballocation.attachment_sparse_filling.input_attachment_3 dEQP-VK.renderpass.suballocation.attachment_sparse_filling.input_attachment_7 Signed-off-by:
Jonathan Marek <jonathan@marek.ca> Part-of: <!5674>
-
Erik Faye-Lund authored
Ref-link have two benefits over generic links: 1. They produce the right result for non-HTML outputs 2. They get validated at build-time So let's use them for internal references instead. Reviewed-by:
Eric Engestrom <eric@engestrom.ch> Part-of: <!5671>
-
Erik Faye-Lund authored
It seems last time I tried to fix these, I missed a few spots. So let's try to get things right this time. Fixes: 429ff054 ("docs/relnotes: update internal references") Reviewed-by:
Eric Engestrom <eric@engestrom.ch> Part-of: <!5671>
-
Erik Faye-Lund authored
These were accidentally dropped when cleaning up the TOC, making links to them dead. Because we used plain links, sphinx didn't inform us that these became dead. Let's restore them. Fixes: 14f2a81b ("docs: drop open-coded toc for articles") Reviewed-by:
Eric Engestrom <eric@engestrom.ch> Part-of: <!5671>
-
Erik Faye-Lund authored
The section referenced here was removed a while ago, but it was always empty anyway. Let's just remove it instead of trying to fix it up. Reviewed-by:
Eric Engestrom <eric@engestrom.ch> Part-of: <!5671>
-
Bas Nieuwenhuizen authored
We fixed the compile error a while ago. Fixes: cc10b34e "util/disk_cache: Fix disk_cache_get_function_timestamp with disabled cache." Reviewed-by:
Drew Davenport <ddavenport@chromium.org> Reviewed-by:
Kristian H. Kristensen <hoegsberg@google.com> Part-of: <!5649>
-
- 27 Jun, 2020 2 commits
-
-
Vinson Lee authored
Fix warning reported by Coverity Scan. Double unlock (LOCK) double_unlock: mtx_unlock unlocks rb_pipe->call_mutex while it is unlocked. Fixes: 07838ff9 ("rbug: Use the call mutex") Closes: #3023Signed-off-by:
Vinson Lee <vlee@freedesktop.org> Reviewed-by:
Jakob Bornecrantz <jakob@collabora.com> Part-of: <!5196>
-
Alejandro Piñeiro authored
So it could be used by both the OpenGL and the Vulkan driver. In addition to the move, some small changes were needed to be made on the API. For example, the simulator was receiving v3d_screen on initialization, and that code setted v3d_screen->sim_file. Now it returns the new sim_file created. Reviewed-by:
Eric Anholt <eric@anholt.net> Part-of: <!5666>
-
- 26 Jun, 2020 16 commits
-
-
Kristian H. Kristensen authored
tu_ImportFenceFdKHR is used by tu_AcquireImageANDROID, which may or may not work, but let's at least keep things compiling until somebody has time to tie up the loose ends on the Android side. Part-of: <!5670>
-
Kenneth Graunke authored
Chris Wilson noted that u_default_texture_subdata's transfer path sometimes results in wasteful double copies. This patch is based on an earlier path he wrote, but updated now that we have staging blits for busy or compressed textures. Consider the case of idle, non-CCS-compressed, tiled images: The transfer-based CPU path has to return a "linear" mapping, so upon map, it mallocs a temporary buffer. u_default_texture_subdata then copies the client memory to this malloc'd buffer, and transfer unmap performs a tiled_memcpy to copy it back into the texture. By writing a direct texture_subdata() implementation, we're able to directly do a tiled_memcpy from the client memory into the destination texture, resulting in only one copy. For linear buffers, there is no advantage to doing things directly, so we simply fall back to u_default_texture_subdata()'s transfer path to avoid replicating those cases. We still may want to use GPU staging buffers for busy destinations (to avoid stalls) or CCS-compressed images (to compress the data), at which point we also fall back to the existing path. We thought to try and use a tiled temporary, but this didn't appear to help. Improves performance in x11perf -shmput500 by 1.96x on my Icelake. Closes: #2500Reviewed-by:
Tapani Pälli <tapani.palli@intel.com> Part-of: <!3818>
-
Eric Anholt authored
The device lost support closely matches the anv code for the same. Part-of: <!2769>
-
Eric Anholt authored
drmCommandWriteRead gives us a -errno, and we only checked for -1 (-EPERM, incidentally). All the callers wanted 0 for errors, which they were getting by the fact that req.value was 0-initialized in our stack allocation (though this only works as long as the kernel doesn't return an error after setting req.value to something), and -EPERM not really being an answer we would expect from an ioctl at this stage in the driver. Part-of: <!2769>
-
Eric Anholt authored
Part-of: <!2769>
-
Bas Nieuwenhuizen authored
There is only one queue for now, so for non-shared semaphores, the implementation is basically a no-op. For shared semaphores, this always uses syncobjs. This depends on syncobj support in the msm kernel driver. Part-of: <!2769>
-
Eric Anholt authored
We need this to test the new VK feature we're about to land. Part-of: <!2769>
-
Daniel Schürmann authored
While we don't allow partial subdword copies, we still need to be able to split 64bit registers Reviewed-by:
Rhys Perry <pendingchaos02@gmail.com> Part-of: <!5663>
-
Lepton Wu authored
We need to update dynamic entries related code after updating asm stubs. Fixes: 45206d76 ("mapi: Adapted libglvnd x86 tsd changes") Signed-off-by:
Lepton Wu <lepton@chromium.org> Part-of: <!5598>
-
Eric Anholt authored
I think I've seen about 3 of this error total so far, but waiting 60 minutes for the scripts to give up wastes marge time. Part-of: <!5651>
-
Rob Clark authored
In cases where every variant is a shader-cache-hit, we never need the post-finalize round of nir opt/lowering passes. So defer this until the first shader-cache-miss to avoid doing pointless work. Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <!5372>
-
Rob Clark authored
Adds a shader disk-cache for ir3 shader variants. Note that builds with `-Dshader-cache=false` have no-op stubs with `disk_cache_create()` that returns NULL. Binning pass variants are serialized together with their draw-pass counterparts, due to shared const-state. Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <!5372>
-
Rob Clark authored
For shader-cache, we are going to want to serialize them together. Which is awkward if the two related variants are not compiled together. This also decouples allocation and compile, which will simplify adding shader-cache (which still needs to allocate, but can skip compile). Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <!5372>
-
Rob Clark authored
Currently we always do sysmem if there is tess. And for GS, the binning pass VS ends up identical to the draw pass VS, so no point in compiling it twice. (For GS what we should do someday is generate a binning pass GS, and possibly if we can do cross-stage linking opts, an optimized binning pass VS, but the required outputs would somehow have to end up in the shader variant key.) Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <!5372>
-
Rob Clark authored
Just to group together the parts that will get serialized when we have shader disk-cache. Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <!5372>
-
Rob Clark authored
Use ir3_compiler_destroy() rather than open-coding ralloc_free(). This will give us a place to add more compiler related cleanup code in the following patches. Signed-off-by:
Rob Clark <robdclark@chromium.org> Part-of: <!5372>
-