Context Navigation

smp.t @ 29e6637e

4.115

Last change on this file since 29e6637e was 29e6637e, checked in by Sebastian Huber <sebastian.huber@…>, on 05/05/14 at 19:55:13

doc: Move SMP glossary to global glossary

Add some terms.

Property mode set to 100644

File size: 19.6 KB

Line
1	@c
2	@c COPYRIGHT (c) 2014.
3	@c On-Line Applications Research Corporation (OAR).
4	@c All rights reserved.
5	@c
6
7	@chapter Symmetric Multiprocessing Services
8
9	@section Introduction
10
11	This chapter describes the services related to Symmetric Multiprocessing
12	provided by RTEMS.
13
14	The application level services currently provided are:
15
16	@itemize @bullet
17	@item @code{rtems_get_processor_count} - Get processor count
18	@item @code{rtems_get_current_processor} - Get current processor index
19	@item @code{rtems_scheduler_ident} - Get ID of a scheduler
20	@item @code{rtems_scheduler_get_processor_set} - Get processor set of a scheduler
21	@item @code{rtems_task_get_scheduler} - Get scheduler of a task
22	@item @code{rtems_task_set_scheduler} - Set scheduler of a task
23	@item @code{rtems_task_get_affinity} - Get task processor affinity
24	@item @code{rtems_task_set_affinity} - Set task processor affinity
25	@end itemize
26
27	@c
28	@c
29	@c
30	@section Background
31
32	@subsection Uniprocessor versus SMP Parallelism
33
34	Uniprocessor systems have long been used in embedded systems. In this hardware
35	model, there are some system execution characteristics which have long been
36	taken for granted:
37
38	@itemize @bullet
39	@item one task executes at a time
40	@item hardware events result in interrupts
41	@end itemize
42
43	There is no true parallelism. Even when interrupts appear to occur
44	at the same time, they are processed in largely a serial fashion.
45	This is true even when the interupt service routines are allowed to
46	nest. From a tasking viewpoint, it is the responsibility of the real-time
47	operatimg system to simulate parallelism by switching between tasks.
48	These task switches occur in response to hardware interrupt events and explicit
49	application events such as blocking for a resource or delaying.
50
51	With symmetric multiprocessing, the presence of multiple processors
52	allows for true concurrency and provides for cost-effective performance
53	improvements. Uniprocessors tend to increase performance by increasing
54	clock speed and complexity. This tends to lead to hot, power hungry
55	microprocessors which are poorly suited for many embedded applications.
56
57	The true concurrency is in sharp contrast to the single task and
58	interrupt model of uniprocessor systems. This results in a fundamental
59	change to uniprocessor system characteristics listed above. Developers
60	are faced with a different set of characteristics which, in turn, break
61	some existing assumptions and result in new challenges. In an SMP system
62	with N processors, these are the new execution characteristics.
63
64	@itemize @bullet
65	@item N tasks execute in parallel
66	@item hardware events result in interrupts
67	@end itemize
68
69	There is true parallelism with a task executing on each processor and
70	the possibility of interrupts occurring on each processor. Thus in contrast
71	to their being one task and one interrupt to consider on a uniprocessor,
72	there are N tasks and potentially N simultaneous interrupts to consider
73	on an SMP system.
74
75	This increase in hardware complexity and presence of true parallelism
76	results in the application developer needing to be even more cautious
77	about mutual exclusion and shared data access than in a uniprocessor
78	embedded system. Race conditions that never or rarely happened when an
79	application executed on a uniprocessor system, become much more likely
80	due to multiple threads executing in parallel. On a uniprocessor system,
81	these race conditions would only happen when a task switch occurred at
82	just the wrong moment. Now there are N-1 tasks executing in parallel
83	all the time and this results in many more opportunities for small
84	windows in critical sections to be hit.
85
86	@subsection Task Affinity
87
88	@cindex task affinity
89	@cindex thread affinity
90
91	RTEMS provides services to manipulate the affinity of a task. Affinity
92	is used to specify the subset of processors in an SMP system on which
93	a particular task can execute.
94
95	By default, tasks have an affinity which allows them to execute on any
96	available processor.
97
98	Task affinity is a possible feature to be supported by SMP-aware
99	schedulers. However, only a subset of the available schedulers support
100	affinity. Although the behavior is scheduler specific, if the scheduler
101	does not support affinity, it is likely to ignore all attempts to set
102	affinity.
103
104	@subsection Task Migration
105
106	@cindex task migration
107	@cindex thread migration
108
109	With more than one processor in the system tasks can migrate from one processor
110	to another. There are three reasons why tasks migrate in RTEMS.
111
112	@itemize @bullet
113	@item The scheduler changes explicitly via @code{rtems_task_set_scheduler()} or
114	similar directives.
115	@item The task resumes execution after a blocking operation. On a priority
116	based scheduler it will evict the lowest priority task currently assigned to a
117	processor in the processor set managed by the scheduler instance.
118	@item The task moves temporarily to another scheduler instance due to locking
119	protocols like @cite{Migratory Priority Inheritance} or the
120	@cite{Multiprocessor Resource Sharing Protocol}.
121	@end itemize
122
123	Task migration should be avoided so that the working set of a task can stay on
124	the most local cache level.
125
126	The current implementation of task migration in RTEMS has some implications
127	with respect to the interrupt latency. It is crucial to preserve the system
128	invariant that a task can execute on at most one processor in the system at a
129	time. This is accomplished with a boolean indicator in the task context. The
130	processor architecture specific low-level task context switch code will mark
131	that a task context is no longer executing and waits that the heir context
132	stopped execution before it restores the heir context and resumes execution of
133	the heir task. So there is one point in time in which a processor is without a
134	task. This is essential to avoid cyclic dependencies in case multiple tasks
135	migrate at once. Otherwise some supervising entity is necessary to prevent
136	life-locks. Such a global supervisor would lead to scalability problems so
137	this approach is not used. Currently the thread dispatch is performed with
138	interrupts disabled. So in case the heir task is currently executing on
139	another processor then this prolongs the time of disabled interrupts since one
140	processor has to wait for another processor to make progress.
141
142	It is difficult to avoid this issue with the interrupt latency since interrupts
143	normally store the context of the interrupted task on its stack. In case a
144	task is marked as not executing we must not use its task stack to store such an
145	interrupt context. We cannot use the heir stack before it stopped execution on
146	another processor. So if we enable interrupts during this transition we have
147	to provide an alternative task independent stack for this time frame. This
148	issue needs further investigation.
149
150	@subsection Critical Section Techniques and SMP
151
152	As discussed earlier, SMP systems have opportunities for true parallelism
153	which was not possible on uniprocessor systems. Consequently, multiple
154	techniques that provided adequate critical sections on uniprocessor
155	systems are unsafe on SMP systems. In this section, some of these
156	unsafe techniques will be discussed.
157
158	In general, applications must use proper operating system provided mutual
159	exclusion mechanisms to ensure correct behavior. This primarily means
160	the use of binary semaphores or mutexes to implement critical sections.
161
162	@subsubsection Disable Interrupts
163
164	Again on a uniprocessor system, there is only a single processor which
165	logically executes a single task and takes interrupts. On an SMP system,
166	each processor may take an interrupt. When the application disables
167	interrupts, it generally does so by altering a processor register to
168	mask interrupts and later to re-enable them. On a uniprocessor system,
169	changing this in the single processor is sufficient. However, on an SMP
170	system, this register in @strong{ALL} processors must be changed. There
171	are no comparable capabilities in an SMP system to disable all interrupts
172	across all processors.
173
174	@subsubsection Highest Priority Task Assumption
175
176	On a uniprocessor system, it is safe to assume that when the highest
177	priority task in an application executes, it will execute without being
178	preempted until it voluntarily blocks. Interrupts may occur while it is
179	executing, but there will be no context switch to another task unless
180	the highest priority task voluntarily initiates it.
181
182	Given the assumption that no other tasks will have their execution
183	interleaved with the highest priority task, it is possible for this
184	task to be constructed such that it does not need to acquire a binary
185	semaphore or mutex for protected access to shared data.
186
187	In an SMP system, it cannot be assumed there will never be a single task
188	executing. It should be assumed that every processor is executing another
189	application task. Further, those tasks will be ones which would not have
190	been executed in a uniprocessor configuration and should be assumed to
191	have data synchronization conflicts with what was formerly the highest
192	priority task which executed without conflict.
193
194	@subsubsection Disable Preemption
195
196	On a uniprocessor system, disabling preemption in a task is very similar
197	to making the highest priority task assumption. While preemption is
198	disabled, no task context switches will occur unless the task initiates
199	them voluntarily. And, just as with the highest priority task assumption,
200	there are N-1 processors also running tasks. Thus the assumption that no
201	other tasks will run while the task has preemption disabled is violated.
202
203	@subsection Task Unique Data and SMP
204
205	Per task variables are a service commonly provided by real-time operating
206	systems for application use. They work by allowing the application
207	to specify a location in memory (typically a @code{void *}) which is
208	logically added to the context of a task. On each task switch, the
209	location in memory is stored and each task can have a unique value in
210	the same memory location. This memory location is directly accessed as a
211	variable in a program.
212
213	This works well in a uniprocessor environment because there is one task
214	executing and one memory location containing a task-specific value. But
215	it is fundamentally broken on an SMP system because there are always N
216	tasks executing. With only one location in memory, N-1 tasks will not
217	have the correct value.
218
219	This paradigm for providing task unique data values is fundamentally
220	broken on SMP systems.
221
222	@subsubsection Classic API Per Task Variables
223
224	The Classic API provides three directives to support per task variables. These are:
225
226	@itemize @bullet
227	@item @code{@value{DIRPREFIX}task_variable_add} - Associate per task variable
228	@item @code{@value{DIRPREFIX}task_variable_get} - Obtain value of a a per task variable
229	@item @code{@value{DIRPREFIX}task_variable_delete} - Remove per task variable
230	@end itemize
231
232	As task variables are unsafe for use on SMP systems, the use of these
233	services should be eliminated in all software that is to be used in
234	an SMP environment. It is recommended that the application developer
235	consider the use of POSIX Keys or Thread Local Storage (TLS). POSIX Keys
236	are not enabled in all RTEMS configurations.
237
238	@b{STATUS}: As of March 2014, some support services in the
239	@code{rtems/cpukit} use per task variables. When these uses are
240	eliminated, the per task variable directives will be disabled when
241	building RTEMS in SMP configuration.
242
243	@c
244	@c
245	@c
246	@section Operations
247
248	@subsection Setting Affinity to a Single Processor
249
250	On some embedded applications targeting SMP systems, it may be beneficial to
251	lock individual tasks to specific processors. In this way, one can designate a
252	processor for I/O tasks, another for computation, etc.. The following
253	illustrates the code sequence necessary to assign a task an affinity for
254	processor with index @code{processor_index}.
255
256	@example
257	@group
258	#include <rtems.h>
259	#include <assert.h>
260
261	void pin_to_processor(rtems_id task_id, int processor_index)
262	@{
263	rtems_status_code sc;
264	cpu_set_t cpuset;
265
266	CPU_ZERO(&cpuset);
267	CPU_SET(processor_index, &cpuset);
268
269	sc = rtems_task_set_affinity(task_id, sizeof(cpuset), &cpuset);
270	assert(sc == RTEMS_SUCCESSFUL);
271	@}
272	@end group
273	@end example
274
275	It is important to note that the @code{cpuset} is not validated until the
276	@code{@value{DIRPREFIX}task_set_affinity} call is made. At that point,
277	it is validated against the current system configuration.
278
279	@c
280	@c
281	@c
282	@section Directives
283
284	This section details the symmetric multiprocessing services. A subsection
285	is dedicated to each of these services and describes the calling sequence,
286	related constants, usage, and status codes.
287
288	@c
289	@c rtems_get_processor_count
290	@c
291	@page
292	@subsection GET_PROCESSOR_COUNT - Get processor count
293
294	@subheading CALLING SEQUENCE:
295
296	@ifset is-C
297	@example
298	uint32_t rtems_get_processor_count(void);
299	@end example
300	@end ifset
301
302	@ifset is-Ada
303	@end ifset
304
305	@subheading DIRECTIVE STATUS CODES:
306
307	The count of processors in the system.
308
309	@subheading DESCRIPTION:
310
311	On uni-processor configurations a value of one will be returned.
312
313	On SMP configurations this returns the value of a global variable set during
314	system initialization to indicate the count of utilized processors. The
315	processor count depends on the physically or virtually available processors and
316	application configuration. The value will always be less than or equal to the
317	maximum count of application configured processors.
318
319	@subheading NOTES:
320
321	None.
322
323	@c
324	@c rtems_get_current_processor
325	@c
326	@page
327	@subsection GET_CURRENT_PROCESSOR - Get current processor index
328
329	@subheading CALLING SEQUENCE:
330
331	@ifset is-C
332	@example
333	uint32_t rtems_get_current_processor(void);
334	@end example
335	@end ifset
336
337	@ifset is-Ada
338	@end ifset
339
340	@subheading DIRECTIVE STATUS CODES:
341
342	The index of the current processor.
343
344	@subheading DESCRIPTION:
345
346	On uni-processor configurations a value of zero will be returned.
347
348	On SMP configurations an architecture specific method is used to obtain the
349	index of the current processor in the system. The set of processor indices is
350	the range of integers starting with zero up to the processor count minus one.
351
352	Outside of sections with disabled thread dispatching the current processor
353	index may change after every instruction since the thread may migrate from one
354	processor to another. Sections with disabled interrupts are sections with
355	thread dispatching disabled.
356
357	@subheading NOTES:
358
359	None.
360
361	@c
362	@c rtems_scheduler_ident
363	@c
364	@page
365	@subsection SCHEDULER_IDENT - Get ID of a scheduler
366
367	@subheading CALLING SEQUENCE:
368
369	@ifset is-C
370	@example
371	rtems_status_code rtems_scheduler_ident(
372	rtems_name name,
373	rtems_id *id
374	);
375	@end example
376	@end ifset
377
378	@ifset is-Ada
379	@end ifset
380
381	@subheading DIRECTIVE STATUS CODES:
382
383	@code{@value{RPREFIX}SUCCESSFUL} - successful operation@*
384	@code{@value{RPREFIX}INVALID_ADDRESS} - @code{id} is NULL@*
385	@code{@value{RPREFIX}INVALID_NAME} - invalid scheduler name@*
386	@code{@value{RPREFIX}UNSATISFIED} - - a scheduler with this name exists, but
387	the processor set of this scheduler is empty
388
389	@subheading DESCRIPTION:
390
391	Identifies a scheduler by its name. The scheduler name is determined by the
392	scheduler configuration. @xref{Configuring a System Configuring
393	Clustered/Partitioned Schedulers}.
394
395	@subheading NOTES:
396
397	None.
398
399	@c
400	@c rtems_scheduler_get_processor_set
401	@c
402	@page
403	@subsection SCHEDULER_GET_PROCESSOR_SET - Get processor set of a scheduler
404
405	@subheading CALLING SEQUENCE:
406
407	@ifset is-C
408	@example
409	rtems_status_code rtems_scheduler_get_processor_set(
410	rtems_id scheduler_id,
411	size_t cpusetsize,
412	cpu_set_t *cpuset
413	);
414	@end example
415	@end ifset
416
417	@ifset is-Ada
418	@end ifset
419
420	@subheading DIRECTIVE STATUS CODES:
421
422	@code{@value{RPREFIX}SUCCESSFUL} - successful operation@*
423	@code{@value{RPREFIX}INVALID_ADDRESS} - @code{cpuset} is NULL@*
424	@code{@value{RPREFIX}INVALID_ID} - invalid scheduler id@*
425	@code{@value{RPREFIX}INVALID_NUMBER} - the affinity set buffer is too small for
426	set of processors owned by the scheduler
427
428	@subheading DESCRIPTION:
429
430	Returns the processor set owned by the scheduler in @code{cpuset}. A set bit
431	in the processor set means that this processor is owned by the scheduler and a
432	cleared bit means the opposite.
433
434	@subheading NOTES:
435
436	None.
437
438	@c
439	@c rtems_task_get_scheduler
440	@c
441	@page
442	@subsection TASK_GET_SCHEDULER - Get scheduler of a task
443
444	@subheading CALLING SEQUENCE:
445
446	@ifset is-C
447	@example
448	rtems_status_code rtems_task_get_scheduler(
449	rtems_id id,
450	rtems_id *scheduler_id
451	);
452	@end example
453	@end ifset
454
455	@ifset is-Ada
456	@end ifset
457
458	@subheading DIRECTIVE STATUS CODES:
459
460	@code{@value{RPREFIX}SUCCESSFUL} - successful operation@*
461	@code{@value{RPREFIX}INVALID_ADDRESS} - @code{scheduler_id} is NULL@*
462	@code{@value{RPREFIX}INVALID_ID} - invalid task id
463
464	@subheading DESCRIPTION:
465
466	Returns the scheduler identifier of a task in @code{scheduler_id}.
467
468	@subheading NOTES:
469
470	None.
471
472	@c
473	@c rtems_task_set_scheduler
474	@c
475	@page
476	@subsection TASK_SET_SCHEDULER - Set scheduler of a task
477
478	@subheading CALLING SEQUENCE:
479
480	@ifset is-C
481	@example
482	rtems_status_code rtems_task_set_scheduler(
483	rtems_id id,
484	rtems_id scheduler_id
485	);
486	@end example
487	@end ifset
488
489	@ifset is-Ada
490	@end ifset
491
492	@subheading DIRECTIVE STATUS CODES:
493
494	@code{@value{RPREFIX}SUCCESSFUL} - successful operation@*
495	@code{@value{RPREFIX}INVALID_ID} - invalid task or scheduler id@*
496	@code{@value{RPREFIX}INCORRECT_STATE} - the task is in the wrong state to
497	perform a scheduler change
498
499	@subheading DESCRIPTION:
500
501	Sets the scheduler of a task specified by @code{scheduler_id}. The scheduler
502	of a task is initialized to the scheduler of the task that created it.
503
504	@subheading NOTES:
505
506	None.
507
508	@subheading EXAMPLE:
509
510	@example
511	@group
512	#include <rtems.h>
513	#include <assert.h>
514
515	void task(rtems_task_argument arg);
516
517	void example(void)
518	@{
519	rtems_status_code sc;
520	rtems_id task_id;
521	rtems_id scheduler_id;
522	rtems_name scheduler_name;
523
524	scheduler_name = rtems_build_name('W', 'O', 'R', 'K');
525
526	sc = rtems_scheduler_ident(scheduler_name, &scheduler_id);
527	assert(sc == RTEMS_SUCCESSFUL);
528
529	sc = rtems_task_create(
530	rtems_build_name('T', 'A', 'S', 'K'),
531	1,
532	RTEMS_MINIMUM_STACK_SIZE,
533	RTEMS_DEFAULT_MODES,
534	RTEMS_DEFAULT_ATTRIBUTES,
535	&task_id
536	);
537	assert(sc == RTEMS_SUCCESSFUL);
538
539	sc = rtems_task_set_scheduler(task_id, scheduler_id);
540	assert(sc == RTEMS_SUCCESSFUL);
541
542	sc = rtems_task_start(task_id, task, 0);
543	assert(sc == RTEMS_SUCCESSFUL);
544	@}
545	@end group
546	@end example
547
548	@c
549	@c rtems_task_get_affinity
550	@c
551	@page
552	@subsection TASK_GET_AFFINITY - Get task processor affinity
553
554	@subheading CALLING SEQUENCE:
555
556	@ifset is-C
557	@example
558	rtems_status_code rtems_task_get_affinity(
559	rtems_id id,
560	size_t cpusetsize,
561	cpu_set_t *cpuset
562	);
563	@end example
564	@end ifset
565
566	@ifset is-Ada
567	@end ifset
568
569	@subheading DIRECTIVE STATUS CODES:
570
571	@code{@value{RPREFIX}SUCCESSFUL} - successful operation@*
572	@code{@value{RPREFIX}INVALID_ADDRESS} - @code{cpuset} is NULL@*
573	@code{@value{RPREFIX}INVALID_ID} - invalid task id@*
574	@code{@value{RPREFIX}INVALID_NUMBER} - the affinity set buffer is too small for
575	the current processor affinity set of the task
576
577	@subheading DESCRIPTION:
578
579	Returns the current processor affinity set of the task in @code{cpuset}. A set
580	bit in the affinity set means that the task can execute on this processor and a
581	cleared bit means the opposite.
582
583	@subheading NOTES:
584
585	None.
586
587	@c
588	@c rtems_task_set_affinity
589	@c
590	@page
591	@subsection TASK_SET_AFFINITY - Set task processor affinity
592
593	@subheading CALLING SEQUENCE:
594
595	@ifset is-C
596	@example
597	rtems_status_code rtems_task_set_affinity(
598	rtems_id id,
599	size_t cpusetsize,
600	const cpu_set_t *cpuset
601	);
602	@end example
603	@end ifset
604
605	@ifset is-Ada
606	@end ifset
607
608	@subheading DIRECTIVE STATUS CODES:
609
610	@code{@value{RPREFIX}SUCCESSFUL} - successful operation@*
611	@code{@value{RPREFIX}INVALID_ADDRESS} - @code{cpuset} is NULL@*
612	@code{@value{RPREFIX}INVALID_ID} - invalid task id@*
613	@code{@value{RPREFIX}INVALID_NUMBER} - invalid processor affinity set
614
615	@subheading DESCRIPTION:
616
617	Sets the processor affinity set for the task specified by @code{cpuset}. A set
618	bit in the affinity set means that the task can execute on this processor and a
619	cleared bit means the opposite.
620
621	@subheading NOTES:
622
623	None.

Note: See TracBrowser for help on using the repository browser.

Context Navigation

source: rtems/doc/user/smp.t @ 29e6637e

Download in other formats: