nohz_full: Add rcu_dyntick data for scalable detection of all-idle state

This commit adds fields to the rcu_dyntick structure that are used to detect idle CPUs. These new fields differ from the existing ones in that the existing ones consider a CPU executing in user mode to be idle, where the new ones consider CPUs executing in user mode to be busy. The handling of these new fields is otherwise quite similar to that for the exiting fields. This commit also adds the initialization required for these fields. So, why is usermode execution treated differently, with RCU considering it a quiescent state equivalent to idle, while in contrast the new full-system idle state detection considers usermode execution to be non-idle? It turns out that although one of RCU's quiescent states is usermode execution, it is not a full-system idle state. This is because the purpose of the full-system idle state is not RCU, but rather determining when accurate timekeeping can safely be disabled. Whenever accurate timekeeping is required in a CONFIG_NO_HZ_FULL kernel, at least one CPU must keep the scheduling-clock tick going. If even one CPU is executing in user mode, accurate timekeeping is requires, particularly for architectures where gettimeofday() and friends do not enter the kernel. Only when all CPUs are really and truly idle can accurate timekeeping be disabled, allowing all CPUs to turn off the scheduling clock interrupt, thus greatly improving energy efficiency. This naturally raises the question "Why is this code in RCU rather than in timekeeping?", and the answer is that RCU has the data and infrastructure to efficiently make this determination. Signed-off-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com> Acked-by: Frederic Weisbecker <fweisbec@gmail.com> Cc: Steven Rostedt <rostedt@goodmis.org> Reviewed-by: Josh Triplett <josh@joshtriplett.org>
author: Paul E. McKenney <paulmck@linux.vnet.ibm.com> 2013-06-21 21:34:33 +0200
committer: Paul E. McKenney <paulmck@linux.vnet.ibm.com> 2013-08-19 03:58:31 +0200
commit: 2333210b26cf7aaf48d71343029afb860103d9f9 (patch)
tree: d3395575543b67a7ca6a91158d65c3ee760500bf /kernel/rcutree.c
parent: nohz_full: Add Kconfig parameter for scalable detection of all-idle state (diff)
download: linux-2333210b26cf7aaf48d71343029afb860103d9f9.tar.xz
linux-2333210b26cf7aaf48d71343029afb860103d9f9.zip
1 files changed, 5 insertions, 0 deletions
diff --git a/kernel/rcutree.c b/kernel/rcutree.c
index 8807019138c6..4f27b85d8c86 100644
--- a/kernel/rcutree.c
+++ b/kernel/rcutree.c
@@ -224,6 +224,10 @@ EXPORT_SYMBOL_GPL(rcu_note_context_switch);
 DEFINE_PER_CPU(struct rcu_dynticks, rcu_dynticks) = {
 	.dynticks_nesting = DYNTICK_TASK_EXIT_IDLE,
 	.dynticks = ATOMIC_INIT(1),
+#ifdef CONFIG_NO_HZ_FULL_SYSIDLE
+	.dynticks_idle_nesting = DYNTICK_TASK_NEST_VALUE,
+	.dynticks_idle = ATOMIC_INIT(1),
+#endif /* #ifdef CONFIG_NO_HZ_FULL_SYSIDLE */
 };
 
 static long blimit = 10;	/* Maximum callbacks per rcu_do_batch. */
@@ -2904,6 +2908,7 @@ rcu_init_percpu_data(int cpu, struct rcu_state *rsp, int preemptible)
 	rdp->blimit = blimit;
 	init_callback_list(rdp);  /* Re-enable callbacks on this CPU. */
 	rdp->dynticks->dynticks_nesting = DYNTICK_TASK_EXIT_IDLE;
+	rcu_sysidle_init_percpu_data(rdp->dynticks);
 	atomic_set(&rdp->dynticks->dynticks,
 		   (atomic_read(&rdp->dynticks->dynticks) & ~0x1) + 1);
 	raw_spin_unlock(&rnp->lock);		/* irqs remain disabled. */
author	Paul E. McKenney <paulmck@linux.vnet.ibm.com>	2013-06-21 21:34:33 +0200
committer	Paul E. McKenney <paulmck@linux.vnet.ibm.com>	2013-08-19 03:58:31 +0200
commit	2333210b26cf7aaf48d71343029afb860103d9f9 (patch)
tree	d3395575543b67a7ca6a91158d65c3ee760500bf /kernel/rcutree.c
parent	nohz_full: Add Kconfig parameter for scalable detection of all-idle state (diff)
download	linux-2333210b26cf7aaf48d71343029afb860103d9f9.tar.xz linux-2333210b26cf7aaf48d71343029afb860103d9f9.zip