Skip to content

aco: optimizations around int16 multiply-add

Daniel Schürmann requested to merge daniel-schuermann/mesa:aco_mad_u16 into main

Most changes are actually due to some refactoring I made on the selection of mul_u16 for 32bit multiplications.

 PERCENTAGE DELTAS                Shaders   VGPRs    CodeSize   Instrs   Latency  InvThroughput  VClause   SClause    Copies   Branches  PreSGPRs  PreVGPRs 
 assassins_creed_origins          2104      -0.01%    -0.01%    +0.00%    +0.00%      -0.00%        .         .       +0.01%      .         .         .     
 battlefront2                     4568        .       -0.00%    -0.00%    +0.02%      -0.00%      -0.01%      .       -0.03%      .         .         .     
 britannia                        465         .         .         .         .           .           .         .         .         .         .         .     
 control_msaa4x                   752       -0.03%      .       -0.00%    -0.00%      -0.00%      -0.01%      .         .         .         .         .     
 control_no_msaa                  752       -0.03%      .       -0.00%    -0.00%      -0.00%      -0.01%      .         .         .         .         .     
 cyberpunk_2077                   331         .         .         .         .           .           .         .         .         .         .         .     
 dark_souls_1_remastered          4249        .         .         .         .           .           .         .         .         .         .         .     
 dark_souls_2_sotfs               6876        .         .         .         .           .           .         .         .         .         .         .     
 dark_souls_3                     8912        .       +0.00%    +0.00%    +0.00%      +0.00%        .         .       +0.03%      .         .         .     
 death_stranding                  1467        .       -0.01%    -0.12%    -0.02%      -0.10%      +0.04%      .         .         .         .         .     
 detroit_become_human             19959       .         .         .         .           .           .         .         .         .         .         .     
 deus_ex_md                       8063        .       -0.00%    -0.00%    -0.00%      -0.00%        .         .         .         .         .         .     
 dirt4                            4030        .       +0.00%    +0.00%      .         -0.00%        .         .       +0.03%      .         .         .     
 doom_2016                        2963        .         .         .         .           .           .         .         .         .         .         .     
 doom_eternal                     1211        .         .       -0.00%    -0.00%      -0.00%        .         .         .         .         .         .     
 dota2                            1015        .         .         .         .           .           .         .         .         .         .         .     
 dow3                             280         .         .         .         .           .           .         .         .         .         .         .     
 evilwithin_demo                  740         .         .         .         .           .           .         .         .         .         .         .     
 f12017                           5503        .       -0.00%    -0.11%    -0.01%      -0.03%        .         .         .         .         .         .     
 gtav                             2987        .         .         .         .           .           .         .         .         .         .         .     
 hitman                           2785        .       -0.01%    +0.00%      .           .         +0.01%      .       +0.00%    +0.02%      .         .     
 madmax                           948         .         .         .         .           .           .         .         .         .         .         .     
 PERCENTAGE DELTAS                Shaders   VGPRs    CodeSize   Instrs   Latency  InvThroughput  VClause   SClause    Copies   Branches  PreSGPRs  PreVGPRs 
 monster_hunter_world_d3d11       3122        .       -0.00%    -0.00%    -0.00%      -0.00%        .         .       +0.01%      .         .       -0.00%  
 monster_hunter_world_d3d12       450         .       -0.00%    -0.00%    -0.00%      -0.00%        .         .       +0.04%      .         .       -0.01%  
 nier                             5492        .         .         .         .           .           .         .         .         .         .         .     
 parallel-rdp/small_subgroup      246         .       -0.92%    -0.67%    -0.32%      -0.86%        .         .       -0.08%      .       -0.01%    -1.64%  
 parallel-rdp/small_uber_subgroup 55          .       -0.21%    -0.45%    -2.01%      -2.03%        .         .       -0.80%      .         .         .     
 parallel-rdp/subgroup            327         .       -0.01%    -0.00%    -0.01%      -0.02%        .         .       -0.04%      .       -0.01%      .     
 parallel-rdp/uber_subgroup       55          .       -0.07%    -0.00%    -0.42%      -0.42%        .         .       +0.12%    +0.04%      .         .     
 rage2                            6876      +0.00%    -0.00%    -0.00%    -0.00%      -0.00%        .       -0.00%      .         .         .         .     
 rdr2                             3888        .       +0.00%    -0.00%    +0.01%      +0.05%        .         .       +0.00%      .       -0.01%      .     
 redout                           2509        .       +0.00%    -0.00%    +0.00%      -0.00%        .       +0.01%    +0.01%      .         .         .     
 rottr                            8120        .       +0.00%    +0.00%    +0.00%      +0.00%        .         .       +0.09%      .         .         .     
 sottr_demo                       5705        .         .         .       +0.00%        .           .         .         .         .         .         .     
 strange_brigade                  1417        .       -0.00%    -0.00%    -0.00%        .           .         .         .         .         .         .     
 talos                            794         .         .         .         .           .           .         .         .         .         .         .     
 thewitness                       3150        .         .         .         .           .           .         .         .         .         .         .     
 threekingdoms                    844         .         .         .         .           .           .         .         .         .         .         .     
 tomb_raider                      16230       .         .       -0.00%      .         -0.00%        .         .         .         .         .         .     
 warhammer2                       1803        .         .       -0.00%    -0.00%      -0.00%        .         .         .         .         .         .     
 wolfenstein2_fp16                1012        .       +0.00%    -0.00%    +0.00%      -0.00%        .         .       +0.00%      .         .         .     
 worldofwarships                  5590        .         .         .         .           .           .         .         .         .         .         .     
 youngblood                       843         .         .       -0.00%    -0.00%      -0.00%        .         .         .         .         .         .     
 youngblood_fp16                  682         .       +0.00%    -0.00%    +0.00%      -0.00%        .         .       +0.00%      .         .         .     
 ------------------------------------------------------------------------------------------------------------------------------------------------------------
 All affected                     2861      -0.01%    -0.11%    -0.29%    -0.56%      -0.14%      +0.01%    +0.01%    -0.04%    +0.01%    -0.02%    -0.07%  
 ------------------------------------------------------------------------------------------------------------------------------------------------------------
 Total                            150170    -0.00%    -0.00%    -0.01%    -0.01%      -0.01%      +0.00%    +0.00%    -0.00%    +0.00%    -0.00%    -0.00%  

Merge request reports