Skip to content
Snippets Groups Projects
  • David Hildenbrand's avatar
    d7c0e68d
    mm/ksm: convert break_ksm() to use walk_page_range_vma() · d7c0e68d
    David Hildenbrand authored
    FOLL_MIGRATION exists only for the purpose of break_ksm(), and actually,
    there is not even the need to wait for the migration to finish, we only
    want to know if we're dealing with a KSM page.
    
    Using follow_page() just to identify a KSM page overcomplicates GUP code. 
    Let's use walk_page_range_vma() instead, because we don't actually care
    about the page itself, we only need to know a single property -- no need
    to even grab a reference.
    
    So, get rid of follow_page() usage such that we can get rid of
    FOLL_MIGRATION now and eventually be able to get rid of follow_page() in
    the future.
    
    In my setup (AMD Ryzen 9 3900X), running the KSM selftest to test unmerge
    performance on 2 GiB (taskset 0x8 ./ksm_tests -D -s 2048), this results in
    a performance degradation of ~2% (old: ~5010 MiB/s, new: ~4900 MiB/s).  I
    don't think we particularly care for now.
    
    Interestingly, the benchmark reduction is due to the single callback. 
    Adding a second callback (e.g., pud_entry()) reduces the benchmark by
    another 100-200 MiB/s.
    
    Link: https://lkml.kernel.org/r/20221021101141.84170-9-david@redhat.com
    
    
    Signed-off-by: default avatarDavid Hildenbrand <david@redhat.com>
    Cc: Andrea Arcangeli <aarcange@redhat.com>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Jason Gunthorpe <jgg@nvidia.com>
    Cc: John Hubbard <jhubbard@nvidia.com>
    Cc: Matthew Wilcox (Oracle) <willy@infradead.org>
    Cc: Peter Xu <peterx@redhat.com>
    Cc: Shuah Khan <shuah@kernel.org>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
    d7c0e68d
    History
    mm/ksm: convert break_ksm() to use walk_page_range_vma()
    David Hildenbrand authored
    FOLL_MIGRATION exists only for the purpose of break_ksm(), and actually,
    there is not even the need to wait for the migration to finish, we only
    want to know if we're dealing with a KSM page.
    
    Using follow_page() just to identify a KSM page overcomplicates GUP code. 
    Let's use walk_page_range_vma() instead, because we don't actually care
    about the page itself, we only need to know a single property -- no need
    to even grab a reference.
    
    So, get rid of follow_page() usage such that we can get rid of
    FOLL_MIGRATION now and eventually be able to get rid of follow_page() in
    the future.
    
    In my setup (AMD Ryzen 9 3900X), running the KSM selftest to test unmerge
    performance on 2 GiB (taskset 0x8 ./ksm_tests -D -s 2048), this results in
    a performance degradation of ~2% (old: ~5010 MiB/s, new: ~4900 MiB/s).  I
    don't think we particularly care for now.
    
    Interestingly, the benchmark reduction is due to the single callback. 
    Adding a second callback (e.g., pud_entry()) reduces the benchmark by
    another 100-200 MiB/s.
    
    Link: https://lkml.kernel.org/r/20221021101141.84170-9-david@redhat.com
    
    
    Signed-off-by: default avatarDavid Hildenbrand <david@redhat.com>
    Cc: Andrea Arcangeli <aarcange@redhat.com>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Jason Gunthorpe <jgg@nvidia.com>
    Cc: John Hubbard <jhubbard@nvidia.com>
    Cc: Matthew Wilcox (Oracle) <willy@infradead.org>
    Cc: Peter Xu <peterx@redhat.com>
    Cc: Shuah Khan <shuah@kernel.org>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>