soft lockup - may be related to wireguard (backported)
Jason A. Donenfeld
Jason at zx2c4.com
Tue May 5 00:55:02 CEST 2020
On 5/4/20 4:47 AM, Serge Belyshev wrote:
> Hi! I can reproduce similar RCU stall with a different kernel under
> specific conditions on a specific box:
>
> [ 54.437636] rcu: INFO: rcu_sched self-detected stall on CPU
> [ 54.438838] rcu: 0-...!: (2101 ticks this GP) idle=ea6/1/0x4000000000000002 softirq=604/604 fqs=0
> [ 54.440052] (t=2101 jiffies g=69 q=89)
> [ 54.441273] rcu: rcu_sched kthread starved for 2101 jiffies! g69 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
> [ 54.442547] rcu: RCU grace-period kthread stack dump:
> [ 54.443812] rcu_sched I 0 10 2 0x80004000
> [ 54.443814] Call Trace:
> [ 54.445087] ? __schedule+0x540/0xa80
> [ 54.446356] schedule+0x45/0xb0
> [ 54.447612] schedule_timeout+0x144/0x280
> [ 54.448859] ? __next_timer_interrupt+0xc0/0xc0
> [ 54.450099] rcu_gp_kthread+0x3f0/0x840
> [ 54.451329] kthread+0xe6/0x120
> [ 54.452557] ? rcu_gp_slow.part.0+0x30/0x30
> [ 54.453761] ? __kthread_create_on_node+0x150/0x150
> [ 54.454943] ret_from_fork+0x1f/0x30
> [ 54.456095] NMI backtrace for cpu 0
> [ 54.457221] CPU: 0 PID: 2910 Comm: md5sum Not tainted 5.6.0-00001-g6e142c237f00 #1309
> [ 54.458355] Hardware name: Gigabyte Technology Co., Ltd. GA-MA790FX-DQ6/GA-MA790FX-DQ6, BIOS F7g 07/19/2010
> [ 54.459484] Call Trace:
> [ 54.460576] <IRQ>
> [ 54.461672] dump_stack+0x50/0x70
> [ 54.462772] nmi_cpu_backtrace.cold+0x14/0x53
> [ 54.463871] ? lapic_can_unplug_cpu.cold+0x3e/0x3e
> [ 54.464955] nmi_trigger_cpumask_backtrace+0x7c/0x89
> [ 54.466026] rcu_dump_cpu_stacks+0x7b/0xa9
> [ 54.467088] rcu_sched_clock_irq.cold+0x153/0x38a
> [ 54.468146] update_process_times+0x1f/0x50
> [ 54.469204] tick_sched_timer+0x33/0x70
> [ 54.470262] ? tick_sched_do_timer+0x50/0x50
> [ 54.471321] __hrtimer_run_queues+0xe2/0x180
> [ 54.472378] hrtimer_interrupt+0x109/0x240
> [ 54.473423] smp_apic_timer_interrupt+0x48/0x80
> [ 54.474461] apic_timer_interrupt+0xf/0x20
> [ 54.475486] </IRQ>
> [ 54.476495] RIP: 0033:0x556cbd33bf19
> [ 54.477506] Code: ce 44 8b 4b 10 c1 c9 0f 01 d1 44 89 4c 24 c8 21 ce 31 c6 01 fe 41 8d bc 01 af 0f 7c f5 89 d0 44 8b 4b 3c c1 ce 0a 31 c8 01 ce <21> f0 31 d0 01 f8 41 8d bc 12 2a c6 87 47 89 ca 41 89 ea c1 c0 07
> [ 54.479694] RSP: 002b:00007ffc30913ce8 EFLAGS: 00000283 ORIG_RAX: ffffffffffffff13
> [ 54.480813] RAX: 00000000980270bd RBX: 0000556cbe81e4e0 RCX: 00000000c35c3b1a
> [ 54.481943] RDX: 000000005b5e4ba7 RSI: 00000000ae8ee5ae RDI: 0000000009b5de85
> [ 54.483075] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
> [ 54.484201] R10: 0000000000000000 R11: 00000000b16eb4f8 R12: 0000000000000000
> [ 54.485317] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000023604445
>
I don't see anything wireguard-related in this stacktrace. Can you try
sending one that has something wireguard-related in it? Or is more complete?
More information about the WireGuard
mailing list