Skip to content
GitLab
  • Menu
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • mesa mesa
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 2,772
    • Issues 2,772
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 929
    • Merge requests 929
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Releases
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Mesa
  • mesamesa
  • Merge requests
  • !14277

mesa/st: Turn on OptimizeForAOS on non-scalar NIR backends.

  • Review changes

  • Download
  • Email patches
  • Plain diff
Open Emma Anholt requested to merge anholt/mesa:optimize-for-aos into main Dec 20, 2021
  • Overview 6
  • Commits 5
  • Pipelines 8
  • Changes 4

Based on !14200 (merged), alternative to !14247 (closed). I think we should land this instead.

    On i965 vec4 hardware (most of crocus), this lets the VS matrix multiplies
    happen in parallel as independent DP4s to each dest channel, rather than a
    serialized set of MADs with approximately the same instruction count.
    Should be a perf regression fix from the crocus transition (from the
    original commit, "Improves performance in Lightsmark by 1.01131% +/-
    0.162069% (n = 10) on a Haswell GT2 system.").

    
    i915g:
    total instructions in shared programs: 396828 -> 396831 (<.01%)
    instructions in affected programs: 159 -> 162 (1.89%)
    
    r300:
    total instructions in shared programs: 1226783 -> 1228308 (0.12%)
    instructions in affected programs: 61920 -> 63445 (2.46%)
    total temps in shared programs: 195902 -> 195850 (-0.03%)
    temps in affected programs: 2393 -> 2341 (-2.17%)
    
    hsw:
    total instructions in shared programs: 8163635 -> 8154150 (-0.12%)
    instructions in affected programs: 174076 -> 164591 (-5.45%)
Edited Dec 20, 2021 by Emma Anholt
Assignee
Assign to
Reviewer
Request review from
Time tracking
Source branch: optimize-for-aos