iova faults on IB1 cmdstream

added bug label

mentioned in merge request mesa/mesa!12489 (merged)

Note that mesa/mesa!12489 (merged) makes crashdec decode the entire RB when used with the --verbose flag, which makes it easier to see the sequence of cmds since the previous resume.

Not sure if it is relevant, but based on the size of the two IB's this looks like a no-op submit (ie. userspace needed a fence-fd or similar), which probably explains why this crash seems to go unnoticed, other than showing up in the crash telemetry logging..

changed the description

I seem to have gotten lucky and managed to repro the IB1 fault with my kernel side patchset to add additional info including the PTEs on iova fault devcores:

kernel: 5.15.0-rc3+
module: msm
time: 1633017102.630726774
comm: deqp-egl:sq0
cmdline: ./deqp-egl --deqp-surface-width=256 --deqp-surface-height=256 --deqp-surface-type=pbuffer --deqp-gl-config-name=rgba8888d24s8ms0 --deqp-visibility=hidden --deqp-caselist-file=/home/robclark/src/deqp/build/modules/egl/new-run/c6.r2.caselist.txt --deqp-log-filename=>
revision: 618 (6.1.8.0)
Got gpu_id=618
fault-info:
  - far: 0000000100114000
  - ttbr0: 00000000077a3000
  - contextidr: 00000000
  - fsr: 40000402
  - fsynr0: 00000000
  - fsynr1: 07000000
  - cbfrsynra: 00000000
  - iova: 0000000100114000
  - dir: READ
  - type: TRANSLATION
  - source: CP
pgtable-fault-info:
  - ttbr0: 0000000262a23000
  - asid: 0
  - ptes: 008000010b54a003 008000010c289003 008000010b634003 000000010bddcfc7
rbbm-status: 0x00000000
ringbuffer:
  - id: 0
    iova: 0x0001000000001000
    last-fence: 1332623
    retired-fence: 1332622
    rptr: 104
    wptr: 138
    size: 32768
bos:
  - iova: 0x0000000100114000
    size: 4096

ib1-fault.devcore

According to my decoding of leaf PTE:

  0x000000010bddcfc7:
      b2:    ARM_LPAE_MAIR_ATTR_IDX_CACHE
      b6:    ARM_LPAE_PTE_AP_UNPRIV
      b7:    ARM_LPAE_PTE_AP_RDONLY
      b8..9: ARM_LPAE_PTE_SH_IS
      b10:   ARM_LPAE_PTE_AF
      b11:   ARM_LPAE_PTE_nG

which looks according to what I'd expect, ie. IOMMU_READ | IOMMU_CACHE.

Look at fsynr0 though - that says it faulted at level 0, so the leaf PTE would be neither here nor there. What would really be interesting would be to know what was in the TLBs at that point, but that's not something we can do from the kernel. Note also that a fault at the topmost level may imply the whole TTBR being disabled, but I'm guessing this sequence probably isn't wiggling TCR.PD0.

fwiw, I'm beginning to suspect "IB1 fault" is really just a generic symptom for "something is wrong w/ smmu tables" rather than being a single bug.. which kinda makes sense, it is the first userspace bo that will be dereferenced. So if something were wrong, this is where it would show up.

I did manage to catch one devcore dump with https://patchwork.freedesktop.org/series/94968/ applied, and in that case the TTBR0 read back from hw didn't match what we expected. Possibly fixed by https://patchwork.freedesktop.org/patch/456636/?series=95297&rev=1

But other devcore's without the PTE/etc snapshot look like TTBR0 matched what we expected, ie. what we read back from the SMMU registers matched the CP_SMMU_TABLE_UPDATE packet in the ringbuffer prior to the faulting CP_INDIRECT_BUFFER.

(And yeah, being able to snapshot the TLBs would be pretty useful.. I can ask qcom if they have some implementation specific way to dump that.)

Certainly MMU-500 has some debug registers to read out the internal cache structures but they're Secure-only, and you really want to do it via external debug with the system halted since it's rather long-winded. Do you know how CP_SMMU_TABLE_UPDATE handles TLB/walk cache invalidation? To me this smells most like a translation or prefetch racing against the table switch.

(BTW I will take a proper look at the patches soon - I have a few other things to catch up on first and this is me skiving off that)

Yes, CP_SMMU_TABLE_UPDATE does a full TLB invalidation.. if it didn't we'd have a lot more problems ;-)

It does a kind of interesting dance where it does a "dry run" writing TTBR0 and then tbl inv (IIRC TLBIASID) and then polling, but using a different base address so those reads/writes point at scratch registers, and then disabling access to memory and changing the base address to point at the real SSMU cb0 registers, and repeating, before re-enabling memory access. Presumably this is to ensure the whole sequence of instructions is in SQE's instruction cache. @cwabbott0 r/e'd this part of the SQE fw so we'd understand how it worked.

Haven't seen this issue for a while, likely the fix was 1d054c9b

closed

mentioned in commit c3b0f72e

mentioned in commit ee15c8bf

iova faults on IB1 cmdstream

Designs

Child items 0

Activity

Admin message

Admin message

iova faults on IB1 cmdstream

Activity