- 18 Nov, 2020 1 commit
-
-
Marek Olšák authored
This removes some overhead from tc_draw_vbo and increases the maximum number of draws per batch from 153 to 192 in u_threaded_context. Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!7441>
-
- 10 Nov, 2020 1 commit
-
-
Mike Blumenkrantz authored
this is another case where the hardcoded value was specific to radeon drivers Reviewed-by:
Adam Jackson <ajax@redhat.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Part-of: <!7475>
-
- 05 Nov, 2020 1 commit
-
-
Mike Blumenkrantz authored
this coincidentally worked because radeonsi has a hardcoded value of 64, but other drivers do not use this value and then things are subtly broken Reviewed-by:
Adam Jackson <ajax@redhat.com> Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!7452>
-
- 31 Oct, 2020 2 commits
-
-
Marek Olšák authored
Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!7056>
-
Marek Olšák authored
Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!7056>
-
- 30 Oct, 2020 1 commit
-
-
Marek Olšák authored
This queries the CPU cache topology correctly. Acked-by:
Jose Fonseca <jfonseca@vmware.com> Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!7054>
-
- 07 Oct, 2020 1 commit
-
-
Marek Olšák authored
Reviewed-by:
Timothy Arceri <tarceri@itsqueeze.com> Part-of: <!6955>
-
- 06 Oct, 2020 4 commits
-
-
Marek Olšák authored
discovered by valgrind Fixes: fd6a5e11Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!6952>
-
Marek Olšák authored
This can reduce overhead depending on whether st/mesa uses the ASYNC flag and the app flushes often. Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!6952>
-
Marek Olšák authored
In sysprof, the CPU time spend in tc_set_constant_buffer drops from 7% to 1.4% in a release build. It doesn't make sense to me. Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!6952>
-
Marek Olšák authored
Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!7031>
-
- 22 Sep, 2020 1 commit
-
-
Marek Olšák authored
Acked-by:
Eric Anholt <eric@anholt.net> Part-of: <!5749>
-
- 15 May, 2020 1 commit
-
-
Marek Olšák authored
This was the original intention, but it wasn't fully implemented. Fixes: 7f22e0fd Closes: #2953 Tested by: John Galt <johngalt@fake.mail> Acked-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!5030>
-
- 13 May, 2020 1 commit
-
-
Marek Olšák authored
Acked-by:
Eric Anholt <eric@anholt.net> Acked-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!4902>
-
- 29 Apr, 2020 1 commit
-
-
Mike Blumenkrantz authored
this adds a new pipe cap that drivers can support which enables passing buffer clears with scissor test enabled through to be handled by the driver instead of having mesa draw a quad also adjust all existing clear() hooks to have the new parameter Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Reviewed-by:
Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by:
Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by:
Kenneth Graunke <kenneth@whitecape.org> Part-of: <!4310>
-
- 16 Apr, 2020 1 commit
-
-
Pierre-Eric Pelloux-Prayer authored
tc_transfer_map maps buffers directly, but the unmap operation is executed in the driver thread. When an application does a lot of map/unmap operations, without flushing, this increase the RAM used (and eventually get the app killed by the oom-killer). This commit allows tc to keep track of how many bytes were mapped during the current batch. When this estimation becomes higher than a threshold, we flush the batch. See: #2735Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Part-of: <!4508>
-
- 31 Mar, 2020 1 commit
-
-
Marek Olšák authored
This is thread-safe and we want it to be done immediately for good L3 cache usage. Acked-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!4369>
-
- 17 Mar, 2020 1 commit
-
-
Marek Olšák authored
This was missing for the READ case. This improves glBegin/End performance. (vbo maps with WRITE | READ | UNSYCHRONIZED) Acked-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by:
Mathias Fröhlich <Mathias.Froehlich@web.de> Part-of: <!4153>
-
- 10 Mar, 2020 2 commits
-
-
Karol Herbst authored
This reverts commit e1ffb72a
-
Karol Herbst authored
needed by CL Signed-off-by:
Karol Herbst <kherbst@redhat.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Marge Bot <!4072> Part-of: <!4072>
-
- 28 Feb, 2020 2 commits
-
-
Marek Olšák authored
Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!3990>
-
Marek Olšák authored
Reviewed-by:
Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <!3990>
-
- 14 Nov, 2019 1 commit
-
-
Eric Anholt authored
To make PIPE_FORMATs usable from non-gallium parts of Mesa, I want to move their helpers out of gallium. Since u_format used util_copy_rect(), I moved that in there, too. I've put it in a separate directory in util/ because it's a big chunk of related code, and it's not clear to me whether we might want it as a separate library from libmesa_util at some point. Closes: #1905Acked-by:
Marek Olšák <marek.olsak@amd.com> Reviewed-by:
Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
-
- 28 Oct, 2019 3 commits
-
-
Timothy Arceri authored
Just use the inlined function directly. The macro was replaced with the function in ebe304fa. Reviewed-by:
Eric Engestrom <eric@engestrom.ch>
-
Timothy Arceri authored
Just use the inlined function directly. The macro was replaced with the function in ebe304fa. Reviewed-by:
Eric Engestrom <eric@engestrom.ch>
-
Timothy Arceri authored
Just use the inlined function directly. The macro was replaced with the function in ebe304fa. Reviewed-by:
Eric Engestrom <eric@engestrom.ch>
-
- 08 Oct, 2019 1 commit
-
-
Marek Olšák authored
u_upload_mgr sets it, so that util_range_add can skip the lock. The time spent in tc_transfer_flush_region decreases from 0.8% to 0.2% in torcs on radeonsi. Reviewed-by:
Kenneth Graunke <kenneth@whitecape.org>
-
- 19 Sep, 2019 1 commit
-
-
Timothy Arceri authored
When both UTIL_QUEUE_INIT_RESIZE_IF_FULL and UTIL_QUEUE_INIT_USE_MINIMUM_PRIORITY are set, we can get into a situation where the queue never executes and grows to a huge size due to all other threads being busy. This is the case with the shader cache when attempting to compile a huge number of shaders up front. If all threads are busy compiling shaders the cache queues memory use can climb into the many GBs very fast. The use of these two flags with the shader cache is intended to allow shaders compiled at runtime to be compiled as fast as possible. To avoid huge memory use but still allow the queue to perform optimally in the run time compilation case, we now add the ability to track memory consumed by the jobs in the queue and limit it to a hardcoded 256MB which should be more than enough. Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
- 31 Jul, 2019 2 commits
-
-
Eric Engestrom authored
Suggested-by:
Jason Ekstrand <jason@jlekstrand.net> Signed-off-by:
Eric Engestrom <eric.engestrom@intel.com> Reviewed-by:
Matt Turner <mattst88@gmail.com>
-
Eric Engestrom authored
MAYBE_UNUSED is going away, so let's replace legitimate uses of it with UNUSED, which the former aliased to so far anyway. Signed-off-by:
Eric Engestrom <eric.engestrom@intel.com> Reviewed-by:
Matt Turner <mattst88@gmail.com>
-
- 23 Jul, 2019 1 commit
-
-
Ilia Mirkin authored
This is a relatively minimal change to adjust all the gallium interfaces to use bool instead of boolean. I tried to avoid making unrelated changes inside of drivers to flip boolean -> bool to reduce the risk of regressions (the compiler will much more easily allow "dirty" values inside a char-based boolean than a C99 _Bool). This has been build-tested on amd64 with: Gallium drivers: nouveau r300 r600 radeonsi freedreno swrast etnaviv v3d vc4 i915 svga virgl swr panfrost iris lima kmsro Gallium st: mesa xa xvmc xvmc vdpau va Signed-off-by:
Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by:
Marek Olšák <marek.olsak@amd.com> Acked-by:
Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>
-
- 15 Jul, 2019 1 commit
-
-
Marek Olšák authored
This is needed to fix an issue with OpenGL when a buffer is mapped and BufferSubData is called. In this case, we can't invalidate the buffer range.
-
- 04 Apr, 2019 1 commit
-
-
Marek Olšák authored
to indicate write usage per buffer. This is just a hint (it will be used by radeonsi). Reviewed-by:
Timothy Arceri <tarceri@itsqueeze.com>
-
- 21 Feb, 2019 1 commit
-
-
Marek Olšák authored
radeonsi will require this. It's a no-op for drivers supporting persistent mappings. Reviewed-by:
Nicolai Hähnle <nicolai.haehnle@amd.com> Tested-by:
Dieter Nützel <Dieter@nuetzel-hh.de>
-
- 11 Feb, 2019 1 commit
-
-
Marek Olšák authored
Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by:
Nicolai Hähnle <nicolai.haehnle@amd.com>
-
- 07 Sep, 2018 1 commit
-
-
Marek Olšák authored
v2: - use set_context_param - set set_context_param even if the driver doesn't implement it Reviewed-by:
Brian Paul <brianp@vmware.com>
-
- 31 Jul, 2018 1 commit
-
-
Marek Olšák authored
Tested-by:
Dieter Nützel <Dieter@nuetzel-hh.de>
-
- 05 Jul, 2018 1 commit
-
-
Marek Olšák authored
There is a 15-character limit for thread names shared by the queue name and process name. Shorten the thread name to make space for the process name. Reviewed-by:
Timothy Arceri <tarceri@itsqueeze.com>
-
- 30 Jan, 2018 2 commits
-
-
Andres Rodriguez authored
v2: s/semaphore/fence Signed-off-by:
Andres Rodriguez <andresx7@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-
Andres Rodriguez authored
An fd can potentially have different types of objects backing it. Specifying the type helps us make sure we treat the FD correctly. This is in preparation to allow importing syncobj fence FDs in addition to native sync FDs. Signed-off-by:
Andres Rodriguez <andresx7@gmail.com> Reviewed-by:
Marek Olšák <marek.olsak@amd.com>
-