Context Navigation

source: rtems-docs/c-user/symmetric_multiprocessing_services.rst @ 794eb1b

5

Last change on this file since 794eb1b was 794eb1b, checked in by Chris Johns <chrisj@…>, on 09/14/17 at 00:32:28
c-user: Clarify SMP's rtems_get_processor_count.
Property mode set to `100644`
File size: 33.6 KB

Line
1	.. comment SPDX-License-Identifier: CC-BY-SA-4.0
2
3	.. COMMENT: COPYRIGHT (c) 2014.
4	.. COMMENT: On-Line Applications Research Corporation (OAR).
5	.. COMMENT: Copyright (c) 2017 embedded brains GmbH.
6	.. COMMENT: All rights reserved.
7
8	Symmetric Multiprocessing (SMP)
9	*******************************
10
11	Introduction
12	============
13
14	The Symmetric Multiprocessing (SMP) support of the RTEMS 4.12 is available on
15
16	- ARMv7-A,
17
18	- PowerPC, and
19
20	- SPARC.
21
22	.. warning::
23
24	The SMP support must be explicitly enabled via the ``--enable-smp``
25	configure command line option for the :term:`BSP` build.
26
27	RTEMS is supposed to be a real-time operating system. What does this mean in
28	the context of SMP? The RTEMS interpretation of real-time on SMP is the
29	support for :ref:`ClusteredScheduling` with priority based schedulers and
30	adequate locking protocols. One aim is to enable a schedulability analysis
31	under the sporadic task model :cite:`Brandenburg:2011:SL`
32	:cite:`Burns:2013:MrsP`.
33
34	The directives provided by the SMP support are:
35
36	- rtems_get_processor_count_ - Get processor count
37
38	- rtems_get_current_processor_ - Get current processor index
39
40	Background
41	==========
42
43	Application Configuration
44	-------------------------
45
46	By default, the maximum processor count is set to one in the application
47	configuration. To enable SMP, the application configuration option
48	:ref:`CONFIGURE_MAXIMUM_PROCESSORS <CONFIGURE_MAXIMUM_PROCESSORS>` must be
49	defined to a value greater than one. It is recommended to use the smallest
50	value suitable for the application in order to save memory. Each processor
51	needs an idle thread and interrupt stack for example.
52
53	The default scheduler for SMP applications supports up to 32 processors and is
54	a global fixed priority scheduler, see also :ref:`Configuring Clustered
55	Schedulers`.
56
57	The following compile-time test can be used to check if the SMP support is
58	available or not.
59
60	.. code-block:: c
61
62	#include <rtems.h>
63
64	#ifdef RTEMS_SMP
65	#warning "SMP support is enabled"
66	#else
67	#warning "SMP support is disabled"
68	#endif
69
70	Examples
71	--------
72
73	For example applications see `testsuites/smptests
74	<https://git.rtems.org/rtems/tree/testsuites/smptests>`_.
75
76	Uniprocessor versus SMP Parallelism
77	-----------------------------------
78
79	Uniprocessor systems have long been used in embedded systems. In this hardware
80	model, there are some system execution characteristics which have long been
81	taken for granted:
82
83	- one task executes at a time
84
85	- hardware events result in interrupts
86
87	There is no true parallelism. Even when interrupts appear to occur at the same
88	time, they are processed in largely a serial fashion. This is true even when
89	the interupt service routines are allowed to nest. From a tasking viewpoint,
90	it is the responsibility of the real-time operatimg system to simulate
91	parallelism by switching between tasks. These task switches occur in response
92	to hardware interrupt events and explicit application events such as blocking
93	for a resource or delaying.
94
95	With symmetric multiprocessing, the presence of multiple processors allows for
96	true concurrency and provides for cost-effective performance
97	improvements. Uniprocessors tend to increase performance by increasing clock
98	speed and complexity. This tends to lead to hot, power hungry microprocessors
99	which are poorly suited for many embedded applications.
100
101	The true concurrency is in sharp contrast to the single task and interrupt
102	model of uniprocessor systems. This results in a fundamental change to
103	uniprocessor system characteristics listed above. Developers are faced with a
104	different set of characteristics which, in turn, break some existing
105	assumptions and result in new challenges. In an SMP system with N processors,
106	these are the new execution characteristics.
107
108	- N tasks execute in parallel
109
110	- hardware events result in interrupts
111
112	There is true parallelism with a task executing on each processor and the
113	possibility of interrupts occurring on each processor. Thus in contrast to
114	their being one task and one interrupt to consider on a uniprocessor, there are
115	N tasks and potentially N simultaneous interrupts to consider on an SMP system.
116
117	This increase in hardware complexity and presence of true parallelism results
118	in the application developer needing to be even more cautious about mutual
119	exclusion and shared data access than in a uniprocessor embedded system. Race
120	conditions that never or rarely happened when an application executed on a
121	uniprocessor system, become much more likely due to multiple threads executing
122	in parallel. On a uniprocessor system, these race conditions would only happen
123	when a task switch occurred at just the wrong moment. Now there are N-1 tasks
124	executing in parallel all the time and this results in many more opportunities
125	for small windows in critical sections to be hit.
126
127	Task Affinity
128	-------------
129	.. index:: task affinity
130	.. index:: thread affinity
131
132	RTEMS provides services to manipulate the affinity of a task. Affinity is used
133	to specify the subset of processors in an SMP system on which a particular task
134	can execute.
135
136	By default, tasks have an affinity which allows them to execute on any
137	available processor.
138
139	Task affinity is a possible feature to be supported by SMP-aware
140	schedulers. However, only a subset of the available schedulers support
141	affinity. Although the behavior is scheduler specific, if the scheduler does
142	not support affinity, it is likely to ignore all attempts to set affinity.
143
144	The scheduler with support for arbitary processor affinities uses a proof of
145	concept implementation. See https://devel.rtems.org/ticket/2510.
146
147	Task Migration
148	--------------
149	.. index:: task migration
150	.. index:: thread migration
151
152	With more than one processor in the system tasks can migrate from one processor
153	to another. There are four reasons why tasks migrate in RTEMS.
154
155	- The scheduler changes explicitly via
156	:ref:`rtems_task_set_scheduler() <rtems_task_set_scheduler>` or similar
157	directives.
158
159	- The task processor affinity changes explicitly via
160	:ref:`rtems_task_set_affinity() <rtems_task_set_affinity>` or similar
161	directives.
162
163	- The task resumes execution after a blocking operation. On a priority based
164	scheduler it will evict the lowest priority task currently assigned to a
165	processor in the processor set managed by the scheduler instance.
166
167	- The task moves temporarily to another scheduler instance due to locking
168	protocols like the :ref:`MrsP` or the :ref:`OMIP`.
169
170	Task migration should be avoided so that the working set of a task can stay on
171	the most local cache level.
172
173	.. _ClusteredScheduling:
174
175	Clustered Scheduling
176	--------------------
177
178	The scheduler is responsible to assign processors to some of the threads which
179	are ready to execute. Trouble starts if more ready threads than processors
180	exist at the same time. There are various rules how the processor assignment
181	can be performed attempting to fulfill additional constraints or yield some
182	overall system properties. As a matter of fact it is impossible to meet all
183	requirements at the same time. The way a scheduler works distinguishes
184	real-time operating systems from general purpose operating systems.
185
186	We have clustered scheduling in case the set of processors of a system is
187	partitioned into non-empty pairwise-disjoint subsets of processors. These
188	subsets are called clusters. Clusters with a cardinality of one are
189	partitions. Each cluster is owned by exactly one scheduler instance. In case
190	the cluster size equals the processor count, it is called global scheduling.
191
192	Modern SMP systems have multi-layer caches. An operating system which neglects
193	cache constraints in the scheduler will not yield good performance. Real-time
194	operating systems usually provide priority (fixed or job-level) based
195	schedulers so that each of the highest priority threads is assigned to a
196	processor. Priority based schedulers have difficulties in providing cache
197	locality for threads and may suffer from excessive thread migrations
198	:cite:`Brandenburg:2011:SL` :cite:`Compagnin:2014:RUN`. Schedulers that use local run
199	queues and some sort of load-balancing to improve the cache utilization may not
200	fulfill global constraints :cite:`Gujarati:2013:LPP` and are more difficult to
201	implement than one would normally expect :cite:`Lozi:2016:LSDWC`.
202
203	Clustered scheduling was implemented for RTEMS SMP to best use the cache
204	topology of a system and to keep the worst-case latencies under control. The
205	low-level SMP locks use FIFO ordering. So, the worst-case run-time of
206	operations increases with each processor involved. The scheduler configuration
207	is quite flexible and done at link-time, see :ref:`Configuring Clustered
208	Schedulers`. It is possible to re-assign processors to schedulers during
209	run-time via :ref:`rtems_scheduler_add_processor()
210	<rtems_scheduler_add_processor>` and :ref:`rtems_scheduler_remove_processor()
211	<rtems_scheduler_remove_processor>`. The schedulers are implemented in an
212	object-oriented fashion.
213
214	The problem is to provide synchronization
215	primitives for inter-cluster synchronization (more than one cluster is involved
216	in the synchronization process). In RTEMS there are currently some means
217	available
218
219	- events,
220
221	- message queues,
222
223	- mutexes using the :ref:`OMIP`,
224
225	- mutexes using the :ref:`MrsP`, and
226
227	- binary and counting semaphores.
228
229	The clustered scheduling approach enables separation of functions with
230	real-time requirements and functions that profit from fairness and high
231	throughput provided the scheduler instances are fully decoupled and adequate
232	inter-cluster synchronization primitives are used.
233
234	To set the scheduler of a task see :ref:`rtems_scheduler_ident()
235	<rtems_scheduler_ident>` and :ref:`rtems_task_set_scheduler()
236	<rtems_task_set_scheduler>`.
237
238	OpenMP
239	------
240
241	OpenMP support for RTEMS is available via the GCC provided libgomp. There is
242	libgomp support for RTEMS in the POSIX configuration of libgomp since GCC 4.9
243	(requires a Newlib snapshot after 2015-03-12). In GCC 6.1 or later (requires a
244	Newlib snapshot after 2015-07-30 for <sys/lock.h> provided self-contained
245	synchronization objects) there is a specialized libgomp configuration for RTEMS
246	which offers a significantly better performance compared to the POSIX
247	configuration of libgomp. In addition application configurable thread pools
248	for each scheduler instance are available in GCC 6.1 or later.
249
250	The run-time configuration of libgomp is done via environment variables
251	documented in the `libgomp manual <https://gcc.gnu.org/onlinedocs/libgomp/>`_.
252	The environment variables are evaluated in a constructor function which
253	executes in the context of the first initialization task before the actual
254	initialization task function is called (just like a global C++ constructor).
255	To set application specific values, a higher priority constructor function must
256	be used to set up the environment variables.
257
258	.. code-block:: c
259
260	#include <stdlib.h>
261	void __attribute__((constructor(1000))) config_libgomp( void )
262	{
263	setenv( "OMP_DISPLAY_ENV", "VERBOSE", 1 );
264	setenv( "GOMP_SPINCOUNT", "30000", 1 );
265	setenv( "GOMP_RTEMS_THREAD_POOLS", "1$2@SCHD", 1 );
266	}
267
268	The environment variable ``GOMP_RTEMS_THREAD_POOLS`` is RTEMS-specific. It
269	determines the thread pools for each scheduler instance. The format for
270	``GOMP_RTEMS_THREAD_POOLS`` is a list of optional
271	``<thread-pool-count>[$<priority>]@<scheduler-name>`` configurations separated
272	by ``:`` where:
273
274	- ``<thread-pool-count>`` is the thread pool count for this scheduler instance.
275
276	- ``$<priority>`` is an optional priority for the worker threads of a thread
277	pool according to ``pthread_setschedparam``. In case a priority value is
278	omitted, then a worker thread will inherit the priority of the OpenMP master
279	thread that created it. The priority of the worker thread is not changed by
280	libgomp after creation, even if a new OpenMP master thread using the worker
281	has a different priority.
282
283	- ``@<scheduler-name>`` is the scheduler instance name according to the RTEMS
284	application configuration.
285
286	In case no thread pool configuration is specified for a scheduler instance,
287	then each OpenMP master thread of this scheduler instance will use its own
288	dynamically allocated thread pool. To limit the worker thread count of the
289	thread pools, each OpenMP master thread must call ``omp_set_num_threads``.
290
291	Lets suppose we have three scheduler instances ``IO``, ``WRK0``, and ``WRK1``
292	with ``GOMP_RTEMS_THREAD_POOLS`` set to ``"1@WRK0:3$4@WRK1"``. Then there are
293	no thread pool restrictions for scheduler instance ``IO``. In the scheduler
294	instance ``WRK0`` there is one thread pool available. Since no priority is
295	specified for this scheduler instance, the worker thread inherits the priority
296	of the OpenMP master thread that created it. In the scheduler instance
297	``WRK1`` there are three thread pools available and their worker threads run at
298	priority four.
299
300	Application Issues
301	==================
302
303	Most operating system services provided by the uni-processor RTEMS are
304	available in SMP configurations as well. However, applications designed for an
305	uni-processor environment may need some changes to correctly run in an SMP
306	configuration.
307
308	As discussed earlier, SMP systems have opportunities for true parallelism which
309	was not possible on uni-processor systems. Consequently, multiple techniques
310	that provided adequate critical sections on uni-processor systems are unsafe on
311	SMP systems. In this section, some of these unsafe techniques will be
312	discussed.
313
314	In general, applications must use proper operating system provided mutual
315	exclusion mechanisms to ensure correct behavior.
316
317	Task variables
318	--------------
319
320	Task variables are ordinary global variables with a dedicated value for each
321	thread. During a context switch from the executing thread to the heir thread,
322	the value of each task variable is saved to the thread control block of the
323	executing thread and restored from the thread control block of the heir thread.
324	This is inherently broken if more than one executing thread exists.
325	Alternatives to task variables are POSIX keys and :ref:`TLS <TLS>`. All use
326	cases of task variables in the RTEMS code base were replaced with alternatives.
327	The task variable API has been removed in RTEMS 4.12.
328
329	Highest Priority Thread Never Walks Alone
330	-----------------------------------------
331
332	On a uni-processor system, it is safe to assume that when the highest priority
333	task in an application executes, it will execute without being preempted until
334	it voluntarily blocks. Interrupts may occur while it is executing, but there
335	will be no context switch to another task unless the highest priority task
336	voluntarily initiates it.
337
338	Given the assumption that no other tasks will have their execution interleaved
339	with the highest priority task, it is possible for this task to be constructed
340	such that it does not need to acquire a mutex for protected access to shared
341	data.
342
343	In an SMP system, it cannot be assumed there will never be a single task
344	executing. It should be assumed that every processor is executing another
345	application task. Further, those tasks will be ones which would not have been
346	executed in a uni-processor configuration and should be assumed to have data
347	synchronization conflicts with what was formerly the highest priority task
348	which executed without conflict.
349
350	Disabling of Thread Preemption
351	------------------------------
352
353	A thread which disables preemption prevents that a higher priority thread gets
354	hold of its processor involuntarily. In uni-processor configurations, this can
355	be used to ensure mutual exclusion at thread level. In SMP configurations,
356	however, more than one executing thread may exist. Thus, it is impossible to
357	ensure mutual exclusion using this mechanism. In order to prevent that
358	applications using preemption for this purpose, would show inappropriate
359	behaviour, this feature is disabled in SMP configurations and its use would
360	case run-time errors.
361
362	Disabling of Interrupts
363	-----------------------
364
365	A low overhead means that ensures mutual exclusion in uni-processor
366	configurations is the disabling of interrupts around a critical section. This
367	is commonly used in device driver code. In SMP configurations, however,
368	disabling the interrupts on one processor has no effect on other processors.
369	So, this is insufficient to ensure system-wide mutual exclusion. The macros
370
371	* :ref:`rtems_interrupt_disable() <rtems_interrupt_disable>`,
372
373	* :ref:`rtems_interrupt_enable() <rtems_interrupt_enable>`, and
374
375	* :ref:`rtems_interrupt_flash() <rtems_interrupt_flash>`.
376
377	are disabled in SMP configurations and its use will cause compile-time warnings
378	and link-time errors. In the unlikely case that interrupts must be disabled on
379	the current processor, the
380
381	* :ref:`rtems_interrupt_local_disable() <rtems_interrupt_local_disable>`, and
382
383	* :ref:`rtems_interrupt_local_enable() <rtems_interrupt_local_enable>`.
384
385	macros are now available in all configurations.
386
387	Since disabling of interrupts is insufficient to ensure system-wide mutual
388	exclusion on SMP a new low-level synchronization primitive was added --
389	interrupt locks. The interrupt locks are a simple API layer on top of the SMP
390	locks used for low-level synchronization in the operating system core.
391	Currently, they are implemented as a ticket lock. In uni-processor
392	configurations, they degenerate to simple interrupt disable/enable sequences by
393	means of the C pre-processor. It is disallowed to acquire a single interrupt
394	lock in a nested way. This will result in an infinite loop with interrupts
395	disabled. While converting legacy code to interrupt locks, care must be taken
396	to avoid this situation to happen.
397
398	.. code-block:: c
399	:linenos:
400
401	#include <rtems.h>
402
403	void legacy_code_with_interrupt_disable_enable( void )
404	{
405	rtems_interrupt_level level;
406
407	rtems_interrupt_disable( level );
408	/* Critical section */
409	rtems_interrupt_enable( level );
410	}
411
412	RTEMS_INTERRUPT_LOCK_DEFINE( static, lock, "Name" )
413
414	void smp_ready_code_with_interrupt_lock( void )
415	{
416	rtems_interrupt_lock_context lock_context;
417
418	rtems_interrupt_lock_acquire( &lock, &lock_context );
419	/* Critical section */
420	rtems_interrupt_lock_release( &lock, &lock_context );
421	}
422
423	An alternative to the RTEMS-specific interrupt locks are POSIX spinlocks. The
424	:c:type:`pthread_spinlock_t` is defined as a self-contained object, e.g. the
425	user must provide the storage for this synchronization object.
426
427	.. code-block:: c
428	:linenos:
429
430	#include <assert.h>
431	#include <pthread.h>
432
433	pthread_spinlock_t lock;
434
435	void smp_ready_code_with_posix_spinlock( void )
436	{
437	int error;
438
439	error = pthread_spin_lock( &lock );
440	assert( error == 0 );
441	/* Critical section */
442	error = pthread_spin_unlock( &lock );
443	assert( error == 0 );
444	}
445
446	In contrast to POSIX spinlock implementation on Linux or FreeBSD, it is not
447	allowed to call blocking operating system services inside the critical section.
448	A recursive lock attempt is a severe usage error resulting in an infinite loop
449	with interrupts disabled. Nesting of different locks is allowed. The user
450	must ensure that no deadlock can occur. As a non-portable feature the locks
451	are zero-initialized, e.g. statically initialized global locks reside in the
452	``.bss`` section and there is no need to call :c:func:`pthread_spin_init`.
453
454	Interrupt Service Routines Execute in Parallel With Threads
455	-----------------------------------------------------------
456
457	On a machine with more than one processor, interrupt service routines (this
458	includes timer service routines installed via :ref:`rtems_timer_fire_after()
459	<rtems_timer_fire_after>`) and threads can execute in parallel. Interrupt
460	service routines must take this into account and use proper locking mechanisms
461	to protect critical sections from interference by threads (interrupt locks or
462	POSIX spinlocks). This likely requires code modifications in legacy device
463	drivers.
464
465	Timers Do Not Stop Immediately
466	------------------------------
467
468	Timer service routines run in the context of the clock interrupt. On
469	uni-processor configurations, it is sufficient to disable interrupts and remove
470	a timer from the set of active timers to stop it. In SMP configurations,
471	however, the timer service routine may already run and wait on an SMP lock
472	owned by the thread which is about to stop the timer. This opens the door to
473	subtle synchronization issues. During destruction of objects, special care
474	must be taken to ensure that timer service routines cannot access (partly or
475	fully) destroyed objects.
476
477	False Sharing of Cache Lines Due to Objects Table
478	-------------------------------------------------
479
480	The Classic API and most POSIX API objects are indirectly accessed via an
481	object identifier. The user-level functions validate the object identifier and
482	map it to the actual object structure which resides in a global objects table
483	for each object class. So, unrelated objects are packed together in a table.
484	This may result in false sharing of cache lines. The effect of false sharing
485	of cache lines can be observed with the `TMFINE 1
486	<https://git.rtems.org/rtems/tree/testsuites/tmtests/tmfine01>`_ test program
487	on a suitable platform, e.g. QorIQ T4240. High-performance SMP applications
488	need full control of the object storage :cite:`Drepper:2007:Memory`.
489	Therefore, self-contained synchronization objects are now available for RTEMS.
490
491	Directives
492	==========
493
494	This section details the symmetric multiprocessing services. A subsection is
495	dedicated to each of these services and describes the calling sequence, related
496	constants, usage, and status codes.
497
498	.. raw:: latex
499
500	\clearpage
501
502	.. _rtems_get_processor_count:
503
504	GET_PROCESSOR_COUNT - Get processor count
505	-----------------------------------------
506
507	CALLING SEQUENCE:
508	.. code-block:: c
509
510	uint32_t rtems_get_processor_count(void);
511
512	DIRECTIVE STATUS CODES:
513
514	The count of processors in the system that can be run. The value returned
515	is the highest numbered processor index of all processors available to the
516	application (if a scheduler is assigned) plus one.
517
518	DESCRIPTION:
519	In uni-processor configurations, a value of one will be returned.
520
521	In SMP configurations, this returns the value of a global variable set
522	during system initialization to indicate the count of utilized processors.
523	The processor count depends on the physically or virtually available
524	processors and application configuration. The value will always be less
525	than or equal to the maximum count of application configured processors.
526
527	NOTES:
528	None.
529
530	.. raw:: latex
531
532	\clearpage
533
534	.. _rtems_get_current_processor:
535
536	GET_CURRENT_PROCESSOR - Get current processor index
537	---------------------------------------------------
538
539	CALLING SEQUENCE:
540	.. code-block:: c
541
542	uint32_t rtems_get_current_processor(void);
543
544	DIRECTIVE STATUS CODES:
545	The index of the current processor.
546
547	DESCRIPTION:
548	In uni-processor configurations, a value of zero will be returned.
549
550	In SMP configurations, an architecture specific method is used to obtain the
551	index of the current processor in the system. The set of processor indices
552	is the range of integers starting with zero up to the processor count minus
553	one.
554
555	Outside of sections with disabled thread dispatching the current processor
556	index may change after every instruction since the thread may migrate from
557	one processor to another. Sections with disabled interrupts are sections
558	with thread dispatching disabled.
559
560	NOTES:
561	None.
562
563	Implementation Details
564	======================
565
566	This section covers some implementation details of the RTEMS SMP support.
567
568	Low-Level Synchronization
569	-------------------------
570
571	All low-level synchronization primitives are implemented using :term:`C11`
572	atomic operations, so no target-specific hand-written assembler code is
573	necessary. Four synchronization primitives are currently available
574
575	* ticket locks (mutual exclusion),
576
577	* :term:`MCS` locks (mutual exclusion),
578
579	* barriers, implemented as a sense barrier, and
580
581	* sequence locks :cite:`Boehm:2012:Seqlock`.
582
583	A vital requirement for low-level mutual exclusion is :term:`FIFO` fairness
584	since we are interested in a predictable system and not maximum throughput.
585	With this requirement, there are only few options to resolve this problem. For
586	reasons of simplicity, the ticket lock algorithm was chosen to implement the
587	SMP locks. However, the API is capable to support MCS locks, which may be
588	interesting in the future for systems with a processor count in the range of 32
589	or more, e.g. :term:`NUMA`, many-core systems.
590
591	The test program `SMPLOCK 1
592	<https://git.rtems.org/rtems/tree/testsuites/smptests/smplock01>`_ can be used
593	to gather performance and fairness data for several scenarios. The SMP lock
594	performance and fairness measured on the QorIQ T4240 follows as an example.
595	This chip contains three L2 caches. Each L2 cache is shared by eight
596	processors.
597
598	.. image:: ../images/c_user/smplock01perf-t4240.*
599	:width: 400
600	:align: center
601
602	.. image:: ../images/c_user/smplock01fair-t4240.*
603	:width: 400
604	:align: center
605
606	Internal Locking
607	----------------
608
609	In SMP configurations, the operating system uses non-recursive SMP locks for
610	low-level mutual exclusion. The locking domains are roughly
611
612	* a particular data structure,
613	* the thread queue operations,
614	* the thread state changes, and
615	* the scheduler operations.
616
617	For a good average-case performance it is vital that every high-level
618	synchronization object, e.g. mutex, has its own SMP lock. In the average-case,
619	only this SMP lock should be involved to carry out a specific operation, e.g.
620	obtain/release a mutex. In general, the high-level synchronization objects
621	have a thread queue embedded and use its SMP lock.
622
623	In case a thread must block on a thread queue, then things get complicated.
624	The executing thread first acquires the SMP lock of the thread queue and then
625	figures out that it needs to block. The procedure to block the thread on this
626	particular thread queue involves state changes of the thread itself and for
627	this thread-specific SMP locks must be used.
628
629	In order to determine if a thread is blocked on a thread queue or not
630	thread-specific SMP locks must be used. A thread priority change must
631	propagate this to the thread queue (possibly recursively). Care must be taken
632	to not have a lock order reversal between thread queue and thread-specific SMP
633	locks.
634
635	Each scheduler instance has its own SMP lock. For the scheduler helping
636	protocol multiple scheduler instances may be in charge of a thread. It is not
637	possible to acquire two scheduler instance SMP locks at the same time,
638	otherwise deadlocks would happen. A thread-specific SMP lock is used to
639	synchronize the thread data shared by different scheduler instances.
640
641	The thread state SMP lock protects various things, e.g. the thread state, join
642	operations, signals, post-switch actions, the home scheduler instance, etc.
643
644	Profiling
645	---------
646
647	To identify the bottlenecks in the system, support for profiling of low-level
648	synchronization is optionally available. The profiling support is a BSP build
649	time configuration option (``--enable-profiling``) and is implemented with an
650	acceptable overhead, even for production systems. A low-overhead counter for
651	short time intervals must be provided by the hardware.
652
653	Profiling reports are generated in XML for most test programs of the RTEMS
654	testsuite (more than 500 test programs). This gives a good sample set for
655	statistics. For example the maximum thread dispatch disable time, the maximum
656	interrupt latency or lock contention can be determined.
657
658	.. code-block:: xml
659
660	<ProfilingReport name="SMPMIGRATION 1">
661	<PerCPUProfilingReport processorIndex="0">
662	<MaxThreadDispatchDisabledTime unit="ns">36636</MaxThreadDispatchDisabledTime>
663	<MeanThreadDispatchDisabledTime unit="ns">5065</MeanThreadDispatchDisabledTime>
664	<TotalThreadDispatchDisabledTime unit="ns">3846635988
665	</TotalThreadDispatchDisabledTime>
666	<ThreadDispatchDisabledCount>759395</ThreadDispatchDisabledCount>
667	<MaxInterruptDelay unit="ns">8772</MaxInterruptDelay>
668	<MaxInterruptTime unit="ns">13668</MaxInterruptTime>
669	<MeanInterruptTime unit="ns">6221</MeanInterruptTime>
670	<TotalInterruptTime unit="ns">6757072</TotalInterruptTime>
671	<InterruptCount>1086</InterruptCount>
672	</PerCPUProfilingReport>
673	<PerCPUProfilingReport processorIndex="1">
674	<MaxThreadDispatchDisabledTime unit="ns">39408</MaxThreadDispatchDisabledTime>
675	<MeanThreadDispatchDisabledTime unit="ns">5060</MeanThreadDispatchDisabledTime>
676	<TotalThreadDispatchDisabledTime unit="ns">3842749508
677	</TotalThreadDispatchDisabledTime>
678	<ThreadDispatchDisabledCount>759391</ThreadDispatchDisabledCount>
679	<MaxInterruptDelay unit="ns">8412</MaxInterruptDelay>
680	<MaxInterruptTime unit="ns">15868</MaxInterruptTime>
681	<MeanInterruptTime unit="ns">3525</MeanInterruptTime>
682	<TotalInterruptTime unit="ns">3814476</TotalInterruptTime>
683	<InterruptCount>1082</InterruptCount>
684	</PerCPUProfilingReport>
685	<!-- more reports omitted --->
686	<SMPLockProfilingReport name="Scheduler">
687	<MaxAcquireTime unit="ns">7092</MaxAcquireTime>
688	<MaxSectionTime unit="ns">10984</MaxSectionTime>
689	<MeanAcquireTime unit="ns">2320</MeanAcquireTime>
690	<MeanSectionTime unit="ns">199</MeanSectionTime>
691	<TotalAcquireTime unit="ns">3523939244</TotalAcquireTime>
692	<TotalSectionTime unit="ns">302545596</TotalSectionTime>
693	<UsageCount>1518758</UsageCount>
694	<ContentionCount initialQueueLength="0">759399</ContentionCount>
695	<ContentionCount initialQueueLength="1">759359</ContentionCount>
696	<ContentionCount initialQueueLength="2">0</ContentionCount>
697	<ContentionCount initialQueueLength="3">0</ContentionCount>
698	</SMPLockProfilingReport>
699	</ProfilingReport>
700
701	Scheduler Helping Protocol
702	--------------------------
703
704	The scheduler provides a helping protocol to support locking protocols like the
705	:ref:`OMIP` or the :ref:`MrsP`. Each thread has a scheduler node for each
706	scheduler instance in the system which are located in its :term:`TCB`. A
707	thread has exactly one home scheduler instance which is set during thread
708	creation. The home scheduler instance can be changed with
709	:ref:`rtems_task_set_scheduler() <rtems_task_set_scheduler>`. Due to the
710	locking protocols a thread may gain access to scheduler nodes of other
711	scheduler instances. This allows the thread to temporarily migrate to another
712	scheduler instance in case of preemption.
713
714	The scheduler infrastructure is based on an object-oriented design. The
715	scheduler operations for a thread are defined as virtual functions. For the
716	scheduler helping protocol the following operations must be implemented by an
717	SMP-aware scheduler
718
719	* ask a scheduler node for help,
720	* reconsider the help request of a scheduler node,
721	* withdraw a schedule node.
722
723	All currently available SMP-aware schedulers use a framework which is
724	customized via inline functions. This eases the implementation of scheduler
725	variants. Up to now, only priority-based schedulers are implemented.
726
727	In case a thread is allowed to use more than one scheduler node it will ask
728	these nodes for help
729
730	* in case of preemption, or
731	* an unblock did not schedule the thread, or
732	* a yield was successful.
733
734	The actual ask for help scheduler operations are carried out as a side-effect
735	of the thread dispatch procedure. Once a need for help is recognized, a help
736	request is registered in one of the processors related to the thread and a
737	thread dispatch is issued. This indirection leads to a better decoupling of
738	scheduler instances. Unrelated processors are not burdened with extra work for
739	threads which participate in resource sharing. Each ask for help operation
740	indicates if it could help or not. The procedure stops after the first
741	successful ask for help. Unsuccessful ask for help operations will register
742	this need in the scheduler context.
743
744	After a thread dispatch the reconsider help request operation is used to clean
745	up stale help registrations in the scheduler contexts.
746
747	The withdraw operation takes away scheduler nodes once the thread is no longer
748	allowed to use them, e.g. it released a mutex. The availability of scheduler
749	nodes for a thread is controlled by the thread queues.
750
751	Thread Dispatch Details
752	-----------------------
753
754	This section gives background information to developers interested in the
755	interrupt latencies introduced by thread dispatching. A thread dispatch
756	consists of all work which must be done to stop the currently executing thread
757	on a processor and hand over this processor to an heir thread.
758
759	In SMP systems, scheduling decisions on one processor must be propagated
760	to other processors through inter-processor interrupts. A thread dispatch
761	which must be carried out on another processor does not happen instantaneously.
762	Thus, several thread dispatch requests might be in the air and it is possible
763	that some of them may be out of date before the corresponding processor has
764	time to deal with them. The thread dispatch mechanism uses three per-processor
765	variables,
766
767	- the executing thread,
768
769	- the heir thread, and
770
771	- a boolean flag indicating if a thread dispatch is necessary or not.
772
773	Updates of the heir thread are done via a normal store operation. The thread
774	dispatch necessary indicator of another processor is set as a side-effect of an
775	inter-processor interrupt. So, this change notification works without the use
776	of locks. The thread context is protected by a :term:`TTAS` lock embedded in
777	the context to ensure that it is used on at most one processor at a time.
778	Normally, only thread-specific or per-processor locks are used during a thread
779	dispatch. This implementation turned out to be quite efficient and no lock
780	contention was observed in the testsuite. The heavy-weight thread dispatch
781	sequence is only entered in case the thread dispatch indicator is set.
782
783	The context-switch is performed with interrupts enabled. During the transition
784	from the executing to the heir thread neither the stack of the executing nor
785	the heir thread must be used during interrupt processing. For this purpose a
786	temporary per-processor stack is set up which may be used by the interrupt
787	prologue before the stack is switched to the interrupt stack.

Note: See TracBrowser for help on using the repository browser.

Download in other formats: