syzbot


possible deadlock in rcu_gp_fqs_loop (2)

Status: upstream: reported on 2025/10/25 22:22
Reported-by: syzbot+03cb9448675893965b49@syzkaller.appspotmail.com
First crash: 57d, last: 13d
Similar bugs (1)
Kernel Title Rank 🛈 Repro Cause bisect Fix bisect Count Last Reported Patched Status
linux-5.15 possible deadlock in rcu_gp_fqs_loop 4 1 340d 340d 0/3 auto-obsoleted due to no activity on 2025/04/25 08:59

Sample crash report:
======================================================
WARNING: possible circular locking dependency detected
syzkaller #0 Not tainted
------------------------------------------------------
rcu_preempt/15 is trying to acquire lock:
ffff8880b903a358 (&rq->__lock){-.-.}-{2:2}, at: raw_spin_rq_lock_nested+0x26/0x140 kernel/sched/core.c:475

but task is already holding lock:
ffffffff8c120b98 (rcu_node_0){-.-.}-{2:2}, at: force_qs_rnp kernel/rcu/tree.c:2646 [inline]
ffffffff8c120b98 (rcu_node_0){-.-.}-{2:2}, at: rcu_gp_fqs kernel/rcu/tree.c:-1 [inline]
ffffffff8c120b98 (rcu_node_0){-.-.}-{2:2}, at: rcu_gp_fqs_loop+0x768/0x11b0 kernel/rcu/tree.c:1986

which lock already depends on the new lock.


the existing dependency chain (in reverse order) is:

-> #3 (rcu_node_0){-.-.}-{2:2}:
       __raw_spin_lock include/linux/spinlock_api_smp.h:142 [inline]
       _raw_spin_lock+0x2a/0x40 kernel/locking/spinlock.c:154
       check_cb_ovld kernel/rcu/tree.c:2974 [inline]
       __call_rcu kernel/rcu/tree.c:3025 [inline]
       call_rcu+0x312/0x930 kernel/rcu/tree.c:3091
       queue_rcu_work+0x81/0x90 kernel/workqueue.c:1788
       kfree_rcu_monitor+0x32a/0x730 kernel/rcu/tree.c:3418
       process_one_work+0x863/0x1000 kernel/workqueue.c:2310
       worker_thread+0xaa8/0x12a0 kernel/workqueue.c:2457
       kthread+0x436/0x520 kernel/kthread.c:334
       ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:287

-> #2 (krc.lock){..-.}-{2:2}:
       __raw_spin_lock include/linux/spinlock_api_smp.h:142 [inline]
       _raw_spin_lock+0x2a/0x40 kernel/locking/spinlock.c:154
       krc_this_cpu_lock kernel/rcu/tree.c:3203 [inline]
       add_ptr_to_bulk_krc_lock kernel/rcu/tree.c:3510 [inline]
       kvfree_call_rcu+0x186/0x7c0 kernel/rcu/tree.c:3601
       trie_update_elem+0x86e/0xc50 kernel/bpf/lpm_trie.c:396
       bpf_map_update_value+0x57d/0x650 kernel/bpf/syscall.c:223
       generic_map_update_batch+0x525/0x7c0 kernel/bpf/syscall.c:1430
       bpf_map_do_batch+0x466/0x600 kernel/bpf/syscall.c:-1
       __sys_bpf+0x601/0x670 kernel/bpf/syscall.c:-1
       __do_sys_bpf kernel/bpf/syscall.c:4761 [inline]
       __se_sys_bpf kernel/bpf/syscall.c:4759 [inline]
       __x64_sys_bpf+0x78/0x90 kernel/bpf/syscall.c:4759
       do_syscall_x64 arch/x86/entry/common.c:50 [inline]
       do_syscall_64+0x4c/0xa0 arch/x86/entry/common.c:80
       entry_SYSCALL_64_after_hwframe+0x66/0xd0

-> #1 (&trie->lock){..-.}-{2:2}:
       __raw_spin_lock_irqsave include/linux/spinlock_api_smp.h:110 [inline]
       _raw_spin_lock_irqsave+0xa4/0xf0 kernel/locking/spinlock.c:162
       trie_delete_elem+0x90/0x710 kernel/bpf/lpm_trie.c:467
       0xffffffffa0006f05
       bpf_dispatcher_nop_func include/linux/bpf.h:888 [inline]
       __bpf_prog_run include/linux/filter.h:628 [inline]
       bpf_prog_run include/linux/filter.h:635 [inline]
       __bpf_trace_run kernel/trace/bpf_trace.c:1878 [inline]
       bpf_trace_run2+0x15b/0x2d0 kernel/trace/bpf_trace.c:1915
       trace_tlb_flush+0xe6/0x110 include/trace/events/tlb.h:38
       switch_mm_irqs_off+0x6e3/0x9a0 arch/x86/mm/tlb.c:-1
       context_switch kernel/sched/core.c:5035 [inline]
       __schedule+0x1024/0x4390 kernel/sched/core.c:6395
       preempt_schedule_common+0x82/0xd0 kernel/sched/core.c:6571
       preempt_schedule+0xa7/0xb0 kernel/sched/core.c:6596
       preempt_schedule_thunk+0x16/0x18 arch/x86/entry/thunk_64.S:34
       unwind_next_frame+0x12ac/0x1d90 arch/x86/kernel/unwind_orc.c:616
       arch_stack_walk+0x10c/0x140 arch/x86/kernel/stacktrace.c:25
       stack_trace_save+0x98/0xe0 kernel/stacktrace.c:122
       kasan_save_stack mm/kasan/common.c:38 [inline]
       kasan_set_track mm/kasan/common.c:46 [inline]
       set_alloc_info mm/kasan/common.c:434 [inline]
       __kasan_slab_alloc+0x9c/0xd0 mm/kasan/common.c:467
       kasan_slab_alloc include/linux/kasan.h:254 [inline]
       slab_post_alloc_hook+0x4c/0x380 mm/slab.h:519
       slab_alloc_node mm/slub.c:3225 [inline]
       slab_alloc mm/slub.c:3233 [inline]
       kmem_cache_alloc+0x100/0x290 mm/slub.c:3238
       kmem_cache_zalloc include/linux/slab.h:728 [inline]
       __alloc_file+0x25/0x240 fs/file_table.c:132
       alloc_empty_file+0x90/0x180 fs/file_table.c:181
       alloc_file+0x5b/0x4f0 fs/file_table.c:223
       alloc_file_pseudo+0x17a/0x1f0 fs/file_table.c:263
       __anon_inode_getfile fs/anon_inodes.c:109 [inline]
       anon_inode_getfile+0xc1/0x1a0 fs/anon_inodes.c:147
       __do_sys_perf_event_open kernel/events/core.c:12613 [inline]
       __se_sys_perf_event_open+0xbd2/0x1b80 kernel/events/core.c:12387
       do_syscall_x64 arch/x86/entry/common.c:50 [inline]
       do_syscall_64+0x4c/0xa0 arch/x86/entry/common.c:80
       entry_SYSCALL_64_after_hwframe+0x66/0xd0

-> #0 (&rq->__lock){-.-.}-{2:2}:
       check_prev_add kernel/locking/lockdep.c:3053 [inline]
       check_prevs_add kernel/locking/lockdep.c:3172 [inline]
       validate_chain kernel/locking/lockdep.c:3788 [inline]
       __lock_acquire+0x2c33/0x7c60 kernel/locking/lockdep.c:5012
       lock_acquire+0x197/0x3f0 kernel/locking/lockdep.c:5623
       _raw_spin_lock_nested+0x2e/0x40 kernel/locking/spinlock.c:368
       raw_spin_rq_lock_nested+0x26/0x140 kernel/sched/core.c:475
       raw_spin_rq_lock kernel/sched/sched.h:1326 [inline]
       _raw_spin_rq_lock_irqsave kernel/sched/sched.h:1345 [inline]
       resched_cpu+0xd4/0x240 kernel/sched/core.c:994
       rcu_implicit_dynticks_qs+0x438/0xc30 kernel/rcu/tree.c:1329
       force_qs_rnp kernel/rcu/tree.c:2664 [inline]
       rcu_gp_fqs kernel/rcu/tree.c:-1 [inline]
       rcu_gp_fqs_loop+0x972/0x11b0 kernel/rcu/tree.c:1986
       rcu_gp_kthread+0x98/0x350 kernel/rcu/tree.c:2145
       kthread+0x436/0x520 kernel/kthread.c:334
       ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:287

other info that might help us debug this:

Chain exists of:
  &rq->__lock --> krc.lock --> rcu_node_0

 Possible unsafe locking scenario:

       CPU0                    CPU1
       ----                    ----
  lock(rcu_node_0);
                               lock(krc.lock);
                               lock(rcu_node_0);
  lock(&rq->__lock);

 *** DEADLOCK ***

1 lock held by rcu_preempt/15:
 #0: ffffffff8c120b98 (rcu_node_0){-.-.}-{2:2}, at: force_qs_rnp kernel/rcu/tree.c:2646 [inline]
 #0: ffffffff8c120b98 (rcu_node_0){-.-.}-{2:2}, at: rcu_gp_fqs kernel/rcu/tree.c:-1 [inline]
 #0: ffffffff8c120b98 (rcu_node_0){-.-.}-{2:2}, at: rcu_gp_fqs_loop+0x768/0x11b0 kernel/rcu/tree.c:1986

stack backtrace:
CPU: 1 PID: 15 Comm: rcu_preempt Not tainted syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/25/2025
Call Trace:
 <TASK>
 dump_stack_lvl+0x168/0x230 lib/dump_stack.c:106
 check_noncircular+0x274/0x310 kernel/locking/lockdep.c:2133
 check_prev_add kernel/locking/lockdep.c:3053 [inline]
 check_prevs_add kernel/locking/lockdep.c:3172 [inline]
 validate_chain kernel/locking/lockdep.c:3788 [inline]
 __lock_acquire+0x2c33/0x7c60 kernel/locking/lockdep.c:5012
 lock_acquire+0x197/0x3f0 kernel/locking/lockdep.c:5623
 _raw_spin_lock_nested+0x2e/0x40 kernel/locking/spinlock.c:368
 raw_spin_rq_lock_nested+0x26/0x140 kernel/sched/core.c:475
 raw_spin_rq_lock kernel/sched/sched.h:1326 [inline]
 _raw_spin_rq_lock_irqsave kernel/sched/sched.h:1345 [inline]
 resched_cpu+0xd4/0x240 kernel/sched/core.c:994
 rcu_implicit_dynticks_qs+0x438/0xc30 kernel/rcu/tree.c:1329
 force_qs_rnp kernel/rcu/tree.c:2664 [inline]
 rcu_gp_fqs kernel/rcu/tree.c:-1 [inline]
 rcu_gp_fqs_loop+0x972/0x11b0 kernel/rcu/tree.c:1986
 rcu_gp_kthread+0x98/0x350 kernel/rcu/tree.c:2145
 kthread+0x436/0x520 kernel/kthread.c:334
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:287
 </TASK>

Crashes (4):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2025/12/08 12:46 linux-5.15.y 68efe5a6c16a d6526ea3 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf possible deadlock in rcu_gp_fqs_loop
2025/11/17 01:25 linux-5.15.y cc5ec8769306 f7988ea4 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf possible deadlock in rcu_gp_fqs_loop
2025/11/15 15:56 linux-5.15.y cc5ec8769306 f7988ea4 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf possible deadlock in rcu_gp_fqs_loop
2025/10/25 22:22 linux-5.15.y ac56c046adf4 c0460fcd .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf possible deadlock in rcu_gp_fqs_loop
* Struck through repros no longer work on HEAD.