Commits · ppir-new-regalloc · Erico Nunes / mesa

Jan 15, 2020

lima/ppir: implement full liveness analysis for regalloc · 9bf210ba

Erico Nunes authored 5 years ago


The existing liveness analysis in ppir still ultimately relies on a
single continuous live_in and live_out range per register and was
observed to be the bottleneck for register allocation on complicated
examples with several control flow blocks.
The use of live_in and live_out ranges was fine before ppir got control
flow, but now it ends up creating unnecessary interferences as live_in
and live_out ranges may span across entire blocks after blocks get
placed sequentially.

This new liveness analysis implementation generates a set of live
variables at each program point; before and after each instruction and
beginning and end of each block.
This is a global analysis and propagates the sets of live registers
across blocks independently of their sequence.
The resulting sets optimally represent all variables that cannot share a
register at each program point, so can be directly translated as
interferences to the register allocator.

Special care has to be taken with non-ssa registers. In order to
properly define their live range, their alive components also need to be
tracked. Therefore ppir can't use simple bitsets to keep track of live
registers.

The algorithm uses an auxiliary set data structure to keep track of the
live registers. The initial implementation used only trivial arrays,
however regalloc execution time was then prohibitive (>1minute on
Cortex-A53) on extreme benchmarks with hundreds of instructions,
hundreds of registers and several spilling iterations, mostly due to the
n^2 complexity to generate the interferences from the live sets. Since
the live registers set are only a very sparse subset of all registers at
each instruction, iterating only over this subset allows it to run very
fast again (a couple of seconds for the same benchmark).

Signed-off-by: Erico Nunes <nunes.erico@gmail.com>
Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>
Tested-by: Marge Bot <mesa/mesa!3358>
Part-of: <mesa/mesa!3358>

9bf210ba

lima/ppir: remove orphan load node after cloning · 7e2765fd

Erico Nunes authored 5 years ago


There are some cases in shades using control flow where the varying load
is cloned to every block, and then the original node is left orphan.
This is not harmful for program execution, but it complicates analysis
for register allocation as there is now a case of writing to a register
that is never read.
While ppir doesn't have a dead code elimination pass for its own
optimizations and it is not hard to detect when we cloned the last load,
let's remove it early.

Signed-off-by: Erico Nunes <nunes.erico@gmail.com>
Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>
Part-of: <mesa/mesa!3358>

7e2765fd

iris: Print warning and return *out = NULL when fd to syncobj fails · a3a73d11
Kristian Høgsberg authored 5 years ago
```
Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
```
a3a73d11

iris: Advertise PIPE_CAP_NATIVE_FENCE_FD · 1ac13869

Kristian Høgsberg authored 5 years ago


Enables EGL_ANDROID_native_fence_sync.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

1ac13869

iris: Fix export of fences that have already completed. · e9f9a944

Kenneth Graunke authored 5 years ago and

Kristian H. Kristensen committed 5 years ago

After flushing batches, iris_fence_flush() asks the kernel whether
each batch's last_syncpt has already signalled or not. (The idea is
that either the compute or render batch may not have actually had any
work queued up, so last_syncpt there might have been signalled a long
time ago.) If it's already completed, we don't bother to record it.

A strange corner is the case of repeated flushes. For example, we
might flush for some reason, and hit a glFlush(), and hit SwapBuffers.
It's possible for all the batches to have been flushed previously, -and-
for them to have actually completed. In this case, we'll see that there
are no syncobj's to wait on, and record fence->count == 0.

This works fine internally - fence_finish can see count == 0 and realize
that it doesn't need to wait, for example. But when working with native
FDs, we may be asked to export a fence with count == 0. So we need an
actual synchronization primitive we can hand off. Because all of the
relevant batches had been signalled when creating the fence, we want the
new dummy fence to be signalled as well.

So we just make a signalled syncobj and export it.

Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>

e9f9a944

android: Fix whitespace issue · 6b9fce5d

Robert Foss authored 5 years ago


Signed-off-by: Robert Foss <robert.foss@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

6b9fce5d

panfrost: Prefix schedule_program to prevent collision · 62adb652

Robert Foss authored 5 years ago


Currently the schedule_program implementation being used is picked
at compile time, which on the Android platform means that the
bifrost compiler & scheduler is used for all targets, including
midgard based hardware.

This commit disambiguates between the two schedule_program functions.

Signed-off-by: Robert Foss <robert.foss@collabora.com>
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>

62adb652

radeonsi: merge si_compile_llvm and si_llvm_compile functions · c4daf2b4

Marek Olšák authored 5 years ago


Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Tested-by: Marge Bot <mesa/mesa!3399>
Part-of: <mesa/mesa!3399>

c4daf2b4

radeonsi: remove useless #includes · 68586bdd
Marek Olšák authored 5 years ago
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <mesa/mesa!3399>
```
68586bdd
radeonsi: move code for shader resources into si_shader_llvm_resources.c · 30b14ba6
Marek Olšák authored 5 years ago
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <mesa/mesa!3399>
```
30b14ba6
radeonsi: move geometry shader code into si_shader_llvm_gs.c · da2c12af
Marek Olšák authored 5 years ago
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <mesa/mesa!3399>
```
da2c12af
radeonsi: remove llvm_type_is_64bit · 57bd73e2
Marek Olšák authored 5 years ago
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <mesa/mesa!3399>
```
57bd73e2
radeonsi: move tessellation shader code into si_shader_llvm_tess.c · 194449a4
Marek Olšák authored 5 years ago
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <mesa/mesa!3399>
```
194449a4
radeonsi: move si_insert_input_* functions · d7c86b10
Marek Olšák authored 5 years ago
```
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <mesa/mesa!3399>
```
d7c86b10

radeonsi: work around an LLVM crash when using llvm.amdgcn.icmp.i64.i1 · 8ff8e68e

Marek Olšák authored 5 years ago


Cc: 19.2 19.3 <mesa-stable@lists.freedesktop.org>
Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Tested-by: Marge Bot <mesa/mesa!3338>
Part-of: <mesa/mesa!3338>

8ff8e68e

radeonsi: fix si_build_wrapper_function for compute-based primitive culling · af3fbb41

Marek Olšák authored 5 years ago


Fixes: 3b143369 "ac/nir, radv, radeonsi: Switch to using ac_shader_args"

Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <mesa/mesa!3338>

af3fbb41

radeonsi/gfx10: separate code for determining the number of vertices for NGG · 6d4993c9
Marek Olšák authored 5 years ago
```
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
6d4993c9
radeonsi/gfx10: separate code for getting edgeflags from the gs_invocation_id VGPR · 7a25521f
Marek Olšák authored 5 years ago
```
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
7a25521f
radeonsi: move VS_STATE.LS_OUT_PATCH_SIZE a few bits higher to make space there · cf65c6f0
Marek Olšák authored 5 years ago
```
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
cf65c6f0
radeonsi: make si_insert_input_* functions non-static · 34ef0c50
Marek Olšák authored 5 years ago
```
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
34ef0c50
ac/cull: don't read Position.Z if it's not needed for culling · eeb4a11c
Marek Olšák authored 5 years ago
```
It could be NULL.

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
eeb4a11c
radeonsi: separate code computing info for small primitive culling · 8070402a
Marek Olšák authored 5 years ago
```
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
```
8070402a

intel/compiler: Fix illegal mutation in get_nir_image_intrinsic_image · 0a1c4707

Kenneth Graunke authored 5 years ago


get_nir_image_intrinsic_image() was incorrectly mutating the value held
by the register which holds the intrinsic's first source (image index).

If this happened to be the register for an SSA def which is also used
elsewhere in the program, this meant that we would clobber that value
in subsequent uses.

Note that this only affects i965, because neither anv nor iris use the
binding table start sections, so nothing is ever added here.

Fixes KHR-GL46.compute_shader.resources-max on i965 with Eric Anholt's
MR !3240 applied.  That MR reorders SSBOs and ABOs, so that test uses
image 0 and SSBO 0, causing this code to brilliantly add binding table
index 45 to both the image (correct) and the SSBO (bzzt, wrong!).

Fixes: 09f1de97 ("anv,i965: Lower away image derefs in the driver")
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Reviewed-by: Eric Anholt <eric@anholt.net>
Tested-by: Marge Bot <mesa/mesa!3404>
Part-of: <mesa/mesa!3404>

0a1c4707

gitlab-ci: fix missing caselist.css/xsl · b706a157

Rob Clark authored 5 years ago


My best guess is that this was broken by d62dd8b0

Signed-off-by: Rob Clark <robdclark@chromium.org>
Tested-by: Marge Bot <mesa/mesa!3413>
Part-of: <mesa/mesa!3413>

b706a157

relnotes: Add Vulkan 1.2 · af6c2f41
Faith Ekstrand authored 5 years ago

af6c2f41

radv: enable Vulkan 1.2 · 7f5462e3

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


This bumps the Vulkan version to 1.2.128.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

7f5462e3

radv: implement Vulkan 1.2 features and properties · 68d6bead
Samuel Pitoiset authored 5 years ago and Faith Ekstrand committed 5 years ago
```
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
```
68d6bead
radv: implement Vulkan 1.1 features and properties · b3033198
Samuel Pitoiset authored 5 years ago and Faith Ekstrand committed 5 years ago
```
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
```
b3033198

radv: update VK_KHR_timeline_semaphore for Vulkan 1.2 · a09ab768

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

a09ab768

radv: update VK_KHR_uniform_buffer_standard_layout for Vulkan 1.2 · fab0aa91

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

fab0aa91

radv: update VK_KHR_shader_subgroup_extended_types for Vulkan 1.2 · 3ff8d124

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

3ff8d124

radv: update VK_KHR_shader_float_controls for Vulkan 1.2 · af25c8d5

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

af25c8d5

radv: update VK_KHR_shader_float16_int8 for Vulkan 1.2 · 5335bb6c

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

5335bb6c

radv: update VK_KHR_shader_atomic_int64 for Vulkan 1.2 · a73d01b1

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

a73d01b1

radv: update VK_KHR_imageless_framebuffer for Vulkan 1.2 · 83d1773a

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

83d1773a

radv: update VK_KHR_image_format_list for Vulkan 1.2 · b3bdb4e6

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

b3bdb4e6

radv: update VK_KHR_driver_properties for Vulkan 1.2 · a8022994

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

a8022994

radv: update VK_KHR_draw_indirect_count for Vulkan 1.2 · af883bf3

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

af883bf3

radv: update VK_KHR_depth_stencil_resolve for Vulkan 1.2 · b537be43

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

b537be43

radv: update VK_KHR_create_renderpass2 for Vulkan 1.2 · 5993f13b

Samuel Pitoiset authored 5 years ago and

Faith Ekstrand committed 5 years ago


Promoted to Vulkan 1.2 with the KHR suffix omitted.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>

5993f13b

Admin message

Admin message