Skip to content
  • Eric Dumazet's avatar
    net: prevent dst uses after free · 222d7dbd
    Eric Dumazet authored
    In linux-4.13, Wei worked hard to convert dst to a traditional
    refcounted model, removing GC.
    
    We now want to make sure a dst refcount can not transition from 0 back
    to 1.
    
    The problem here is that input path attached a not refcounted dst to an
    skb. Then later, because packet is forwarded and hits skb_dst_force()
    before exiting RCU section, we might try to take a refcount on one dst
    that is about to be freed, if another cpu saw 1 -> 0 transition in
    dst_release() and queued the dst for freeing after one RCU grace period.
    
    Lets unify skb_dst_force() and skb_dst_force_safe(), since we should
    always perform the complete check against dst refcount, and not assume
    it is not zero.
    
    Bugzilla : https://bugzilla.kernel.org/show_bug.cgi?id=197005
    
    [  989.919496]  skb_dst_force+0x32/0x34
    [  989.919498]  __dev_queue_xmit+0x1ad/0x482
    [  989.919501]  ? eth_header+0x28/0xc6
    [  989.919502]  dev_queue_xmit+0xb/0xd
    [  989.919504]  neigh_connected_output+0x9b/0xb4
    [  989.919507]  ip_finish_output2+0x234/0x294
    [  989.919509]  ? ipt_do_table+0x369/0x388
    [  989.919510]  ip_finish_output+0x12c/0x13f
    [  989.919512]  ip_output+0x53/0x87
    [  989.919513]  ip_forward_finish+0x53/0x5a
    [  989.919515]  ip_forward+0x2cb/0x3e6
    [  989.919516]  ? pskb_trim_rcsum.part.9+0x4b/0x4b
    [  989.919518]  ip_rcv_finish+0x2e2/0x321
    [  989.919519]  ip_rcv+0x26f/0x2eb
    [  989.919522]  ? vlan_do_receive+0x4f/0x289
    [  989.919523]  __netif_receive_skb_core+0x467/0x50b
    [  989.919526]  ? tcp_gro_receive+0x239/0x239
    [  989.919529]  ? inet_gro_receive+0x226/0x238
    [  989.919530]  __netif_receive_skb+0x4d/0x5f
    [  989.919532]  netif_receive_skb_internal+0x5c/0xaf
    [  989.919533]  napi_gro_receive+0x45/0x81
    [  989.919536]  ixgbe_poll+0xc8a/0xf09
    [  989.919539]  ? kmem_cache_free_bulk+0x1b6/0x1f7
    [  989.919540]  net_rx_action+0xf4/0x266
    [  989.919543]  __do_softirq+0xa8/0x19d
    [  989.919545]  irq_exit+0x5d/0x6b
    [  989.919546]  do_IRQ+0x9c/0xb5
    [  989.919548]  common_interrupt+0x93/0x93
    [  989.919548]  </IRQ>
    
    Similarly dst_clone() can use dst_hold() helper to have additional
    debugging, as a follow up to commit 44ebe791 ("net: add debug
    atomic_inc_not_zero() in dst_hold()")
    
    In net-next we will convert dst atomic_t to refcount_t for peace of
    mind.
    
    Fixes: a4c2fd7f
    
     ("net: remove DST_NOCACHE flag")
    Signed-off-by: default avatarEric Dumazet <edumazet@google.com>
    Cc: Wei Wang <weiwan@google.com>
    Reported-by: default avatarPaweł Staszewski <pstaszewski@itcare.pl>
    Bisected-by: default avatarPaweł Staszewski <pstaszewski@itcare.pl>
    Acked-by: default avatarWei Wang <weiwan@google.com>
    Acked-by: default avatarMartin KaFai Lau <kafai@fb.com>
    Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
    222d7dbd