Context Navigation

source: rtems/doc/porting/taskcontext.t @ bb4b574

4.104.114.84.95

Last change on this file since bb4b574 was bb4b574, checked in by Joel Sherrill <joel.sherrill@…>, on 04/25/00 at 13:15:14
Merged changes from 4.5 branch and removed that branch.
Property mode set to `100644`
File size: 21.8 KB

Line
1	@c
2	@c COPYRIGHT (c) 1988-1999.
3	@c On-Line Applications Research Corporation (OAR).
4	@c All rights reserved.
5	@c
6	@c $Id$
7	@c
8
9	@chapter Task Context Management
10
11	@section Introduction
12
13	XXX
14
15	@section Task Stacks
16
17	XXX
18
19	@subsection Direction of Stack Growth
20
21	The CPU_STACK_GROWS_UP macro is set based upon the answer to the following
22	question: Does the stack grow up (toward higher addresses) or down (toward
23	lower addresses)? If the stack grows upward in memory, then this macro
24	should be set to TRUE. Otherwise, it should be set to FALSE to indicate
25	that the stack grows downward toward smaller addresses.
26
27	The following illustrates how the CPU_STACK_GROWS_UP macro is set:
28
29	@example
30	#define CPU_STACK_GROWS_UP TRUE
31	@end example
32
33	@subsection Minimum Task Stack Size
34
35	The CPU_STACK_MINIMUM_SIZE macro should be set to the minimum size of each
36	task stack. This size is specified as the number of bytes. This minimum
37	stack size should be large enough to run all RTEMS tests. The minimum
38	stack size is chosen such that a "reasonable" small application should not
39	have any problems. Choosing a minimum stack size that is too small will
40	result in the RTEMS tests "blowing" their stack and not executing
41	properly.
42
43	There are many reasons a task could require a stack size larger than the
44	minimum. For example, a task could have a very deep call path or declare
45	large data structures on the stack. Tasks which utilize C++ exceptions
46	tend to require larger stacks as do Ada tasks.
47
48	The following illustrates setting the minimum stack size to 4 kilobytes
49	per task.
50
51	@example
52	#define CPU_STACK_MINIMUM_SIZE (1024*4)
53	@end example
54
55	@subsection Stack Alignment Requirements
56
57	The CPU_STACK_ALIGNMENT macro is set to indicate the byte alignment
58	requirement for the stack. This alignment requirement may be stricter
59	than that for the data types alignment specified by CPU_ALIGNMENT. If the
60	CPU_ALIGNMENT is strict enough for the stack, then this should be set to
61	0.
62
63	The following illustrates how the CPU_STACK_ALIGNMENT macro should be set
64	when there are no special requirements:
65
66	@example
67	#define CPU_STACK_ALIGNMENT 0
68	@end example
69
70	NOTE: This must be a power of 2 either 0 or greater than CPU_ALIGNMENT. [XXX is this true?]
71
72	@section Task Context
73
74	Associated with each task is a context that distinguishes it from other
75	tasks in the system and logically gives it its own scratch pad area for
76	computations. In addition, when an interrupt occurs some processor
77	context information must be saved and restored. This is managed in RTEMS
78	as three items:
79
80	@itemize @bullet
81
82	@item Basic task level context (e.g. the Context_Control structure)
83
84	@item Floating point task context (e.g. Context_Control_fp structure)
85
86	@item Interrupt level context (e.g. the Context_Control_interrupt
87	structure)
88
89	@end itemize
90
91	The integer and floating point context structures and the routines that
92	manipulate them are discussed in detail in this section, while the
93	interrupt level context structure is discussed in the XXX.
94
95	Additionally, if the GNU debugger gdb is to be made aware of RTEMS tasks
96	for this CPU, then care should be used in designing the context area.
97
98	@example
99	typedef struct @{
100	unsigned32 special_interrupt_register;
101	@} CPU_Interrupt_frame;
102	@end example
103
104
105	@subsection Basic Context Data Structure
106
107	The Context_Control data structure contains the basic integer context of a
108	task. In addition, this context area contains stack and frame pointers,
109	processor status register(s), and any other registers that are normally
110	altered by compiler generated code. In addition, this context must
111	contain the processor interrupt level since the processor interrupt level
112	is maintained on a per-task basis. This is necessary to support the
113	interrupt level portion of the task mode as provided by the Classic RTEMS
114	API.
115
116	On some processors, it is cost-effective to save only the callee preserved
117	registers during a task context switch. This means that the ISR code
118	needs to save those registers which do not persist across function calls.
119	It is not mandatory to make this distinctions between the caller/callee
120	saves registers for the purpose of minimizing context saved during task
121	switch and on interrupts. If the cost of saving extra registers is
122	minimal, simplicity is the choice. Save the same context on interrupt
123	entry as for tasks in this case.
124
125	The Context_Control data structure should be defined such that the order
126	of elements results in the simplest, most efficient implementation of XXX.
127	A typical implementation starts with a definition such as the following:
128
129	@example
130	typedef struct @{
131	unsigned32 some_integer_register;
132	unsigned32 another_integer_register;
133	unsigned32 some_system_register;
134	@} Context_Control;
135	@end example
136
137	@subsection Initializing a Context
138
139	The _CPU_Context_Initialize routine initializes the context to a state
140	suitable for starting a task after a context restore operation.
141	Generally, this involves:
142
143	@itemize @bullet
144
145	@item setting a starting address,
146
147	@item preparing the stack,
148
149	@item preparing the stack and frame pointers,
150
151	@item setting the proper interrupt level in the context, and
152
153	@item initializing the floating point context
154
155	@end itemize
156
157	This routine generally does not set any unnecessary register in the
158	context. The state of the "general data" registers is undefined at task
159	start time. The _CPU_Context_initialize routine is prototyped as follows:
160
161	@example
162	void _CPU_Context_Initialize(
163	Context_Control *_the_context,
164	void *_stack_base,
165	unsigned32 _size,
166	unsigned32 _isr,
167	void *_entry_point,
168	unsigned32 _is_fp
169	);
170	@end example
171
172	The @code{is_fp} parameter is TRUE if the thread is to be a floating point
173	thread. This is typically only used on CPUs where the FPU may be easily
174	disabled by software such as on the SPARC where the PSR contains an enable
175	FPU bit. The use of an FPU enable bit allows RTEMS to ensure that a
176	non-floating point task is unable to access the FPU. This guarantees that
177	a deferred floating point context switch is safe.
178
179	The @code{_stack_base} parameter is the base address of the memory area
180	allocated for use as the task stack. It is critical to understand that
181	@code{_stack_base} may not be the starting stack pointer for this task.
182	On CPU families where the stack grows from high addresses to lower ones,
183	(i.e. @code{CPU_STACK_GROWS_UP} is FALSE) the starting stack point
184	will be near the end of the stack memory area or close to
185	@code{_stack_base} + @code{_size}. Even on CPU families where the stack
186	grows from low to higher addresses, there may be some required
187	outermost stack frame that must be put at the address @code{_stack_base}.
188
189	The @code{_size} parameter is the requested size in bytes of the stack for
190	this task. It is assumed that the memory area @code{_stack_base}
191	is of this size.
192
193	XXX explain other parameters and check prototype
194
195	@subsection Performing a Context Switch
196
197	The _CPU_Context_switch performs a normal non-FP context switch from the
198	context of the current executing thread to the context of the heir thread.
199
200	@example
201	void _CPU_Context_switch(
202	Context_Control *run,
203	Context_Control *heir
204	);
205	@end example
206
207	This routine begins by saving the current state of the
208	CPU (i.e. the context) in the context area at @code{run}.
209	Then the routine should load the CPU context pointed to
210	by @code{heir}. Loading the new context will cause a
211	branch to its task code, so the task that invoked
212	@code{_CPU_Context_switch} will not run for a while.
213	When, eventually, a context switch is made to load
214	context from @code{*run} again, this task will resume
215	and @code{_CPU_Context_switch} will return to its caller.
216
217	Care should be exercise when writing this routine. All
218	registers assumed to be preserved across subroutine calls
219	must be preserved. These registers may be saved in
220	the task's context area or on its stack. However, the
221	stack pointer and address to resume executing the task
222	at must be included in the context (normally the subroutine
223	return address to the caller of @code{_Thread_Dispatch}.
224	The decision of where to store the task's context is based
225	on numerous factors including the capabilities of
226	the CPU architecture itself and simplicity as well
227	as external considerations such as debuggers wishing
228	to examine a task's context. In this case, it is
229	often simpler to save all data in the context area.
230
231	Also there may be special considerations
232	when loading the stack pointers or interrupt level of the
233	incoming task. Independent of CPU specific considerations,
234	if some context is saved on the task stack, then the porter
235	must ensure that the stack pointer is adjusted @b{BEFORE}
236	to make room for this context information before the
237	information is written. Otherwise, an interrupt could
238	occur writing over the context data. The following is
239	an example of an @b{INCORRECT} sequence:
240
241	@example
242	save part of context beyond current top of stack
243	interrupt pushes context -- overwriting written context
244	interrupt returns
245	adjust stack pointer
246	@end example
247
248	@subsection Restoring a Context
249
250	The _CPU_Context_restore routine is generally used only to restart the
251	currently executing thread (i.e. self) in an efficient manner. In many
252	ports, it can simply be a label in _CPU_Context_switch. It may be
253	unnecessary to reload some registers.
254
255	@example
256	void _CPU_Context_restore(
257	Context_Control *new_context
258	);
259	@end example
260
261	@subsection Restarting the Currently Executing Task
262
263	The _CPU_Context_Restart_self is responsible for somehow restarting the
264	currently executing task. If you are lucky when porting RTEMS, then all
265	that is necessary is restoring the context. Otherwise, there will need to
266	be a routine that does something special in this case. Performing a
267	_CPU_Context_Restore on the currently executing task after reinitializing
268	that context should work on most ports. It will not work if restarting
269	self conflicts with the stack frame assumptions of restoring a context.
270
271	The following is an implementation of _CPU_Context_Restart_self that can
272	be used when no special handling is required for this case.
273
274	@example
275	#define _CPU_Context_Restart_self( _the_context ) \
276	_CPU_Context_restore( (_the_context) )
277	@end example
278
279	XXX find a port which does not do it this way and include it here
280
281	@section Floating Point Context
282
283	@subsection CPU_HAS_FPU Macro Definition
284
285	The CPU_HAS_FPU macro is set based on the answer to the question: Does the
286	CPU have hardware floating point? If the CPU has an FPU, then this should
287	be set to TRUE. Otherwise, it should be set to FALSE. The primary
288	implication of setting this macro to TRUE is that it indicates that tasks
289	may have floating point contexts. In the Classic API, this means that the
290	RTEMS_FLOATING_POINT task attribute specified as part of rtems_task_create
291	is supported on this CPU. If CPU_HAS_FPU is set to FALSE, then no tasks
292	or threads may be floating point and the RTEMS_FLOATING_POINT task
293	attribute is ignored. On an API such as POSIX where all threads
294	implicitly have a floating point context, then the setting of this macro
295	determines whether every POSIX thread has a floating point context.
296
297	The following example illustrates how the CPU_HARDWARE_FP (XXX macro name
298	is varying) macro is set based on the CPU family dependent macro.
299
300	@example
301	#if ( THIS_CPU_FAMILY_HAS_FPU == 1 ) /* where THIS_CPU_FAMILY */
302	/* might be M68K */
303	#define CPU_HARDWARE_FP TRUE
304	#else
305	#define CPU_HARDWARE_FP FALSE
306	#endif
307	@end example
308
309	The macro name THIS_CPU_FAMILY_HAS_FPU should be made CPU specific. It
310	indicates whether or not this CPU model has FP support. For example, the
311	definition of the i386ex and i386sx CPU models would set I386_HAS_FPU to
312	FALSE to indicate that these CPU models are i386's without an i387 and
313	wish to leave floating point support out of RTEMS when built for the
314	i386_nofp processor model. On a CPU with a built-in FPU like the i486,
315	this would be defined as TRUE.
316
317	On some processor families, the setting of the THIS_CPU_FAMILY_HAS_FPU
318	macro may be derived from compiler predefinitions. This can be used when
319	the compiler distinguishes the individual CPU models for this CPU family
320	as distinctly as RTEMS requires. Often RTEMS needs to need more about the
321	CPU model than the compiler because of differences at the system level
322	such as caching, interrupt structure.
323
324	@subsection CPU_ALL_TASKS_ARE_FP Macro Setting
325
326	The CPU_ALL_TASKS_ARE_FP macro is set to TRUE or FALSE based upon the
327	answer to the following question: Are all tasks RTEMS_FLOATING_POINT tasks
328	implicitly? If this macro is set TRUE, then all tasks and threads are
329	assumed to have a floating point context. In the Classic API, this is
330	equivalent to setting the RTEMS_FLOATING_POINT task attribute on all
331	rtems_task_create calls. If the CPU_ALL_TASKS_ARE_FP macro is set to
332	FALSE, then the RTEMS_FLOATING_POINT task attribute in the Classic API is
333	honored.
334
335	The rationale for this macro is that if a function that an application
336	developer would not think utilize the FP unit DOES, then one can not
337	easily predict which tasks will use the FP hardware. In this case, this
338	option should be TRUE. So far, the only CPU families for which this macro
339	has been to TRUE are the HP PA-RISC and PowerPC. For the HP PA-RISC, the
340	HP C compiler and gcc both implicitly use the floating point registers to
341	perform integer multiplies. For the PowerPC, this feature macro is set to
342	TRUE because the printf routine saves a floating point register whether or
343	not a floating point number is actually printed. If the newlib
344	implementation of printf were restructured to avoid this, then the PowerPC
345	port would not have to have this option set to TRUE.
346
347	The following example illustrates how the CPU_ALL_TASKS_ARE_FP is set on
348	the PowerPC. On this CPU family, this macro is set to TRUE if the CPU
349	model has hardware floating point.
350
351	@example
352	#if (CPU_HARDWARE_FP == TRUE)
353	#define CPU_ALL_TASKS_ARE_FP TRUE
354	#else
355	#define CPU_ALL_TASKS_ARE_FP FALSE
356	#endif
357	@end example
358
359	NOTE: If CPU_HARDWARE_FP is FALSE, then this should be FALSE as well.
360
361	@subsection CPU_USE_DEFERRED_FP_SWITCH Macro Setting
362
363	The CPU_USE_DEFERRED_FP_SWITCH macro is set based upon the answer to the
364	following question: Should the saving of the floating point registers be
365	deferred until a context switch is made to another different floating
366	point task? If the floating point context will not be stored until
367	necessary, then this macro should be set to TRUE. When set to TRUE, the
368	floating point context of a task will remain in the floating point
369	registers and not disturbed until another floating point task is switched
370	to.
371
372	If the CPU_USE_DEFERRED_FP_SWITCH is set to FALSE, then the floating point
373	context is saved each time a floating point task is switched out and
374	restored when the next floating point task is restored. The state of the
375	floating point registers between those two operations is not specified.
376
377	There are a couple of known cases where the port should not defer saving
378	the floating point context. The first case is when the compiler generates
379	instructions that use the FPU when floating point is not actually used.
380	This occurs on the HP PA-RISC for example when an integer multiply is
381	performed. On the PowerPC, the printf routine includes a save of a
382	floating point register to support printing floating point numbers even if
383	the path that actually prints the floating point number is not invoked.
384	In both of these cases, deferred floating point context switches can not
385	be used. If the floating point context has to be saved as part of
386	interrupt dispatching, then it may also be necessary to disable deferred
387	context switches.
388
389	Setting this flag to TRUE results in using a different algorithm for
390	deciding when to save and restore the floating point context. The
391	deferred FP switch algorithm minimizes the number of times the FP context
392	is saved and restored. The FP context is not saved until a context switch
393	is made to another, different FP task. Thus in a system with only one FP
394	task, the FP context will never be saved or restored.
395
396	The following illustrates setting the CPU_USE_DEFERRED_FP_SWITCH macro on
397	a processor family such as the M68K or i386 which can use deferred
398	floating point context switches.
399
400	@example
401	#define CPU_USE_DEFERRED_FP_SWITCH TRUE
402	@end example
403
404	@subsection Floating Point Context Data Structure
405
406	The Context_Control_fp contains the per task information for the floating
407	point unit. The organization of this structure may be a list of floating
408	point registers along with any floating point control and status registers
409	or it simply consist of an array of a fixed number of bytes. Defining the
410	floating point context area as an array of bytes is done when the floating
411	point context is dumped by a "FP save context" type instruction and the
412	format is either not completely defined by the CPU documentation or the
413	format is not critical for the implementation of the floating point
414	context switch routines. In this case, there is no need to figure out the
415	exact format -- only the size. Of course, although this is enough
416	information for RTEMS, it is probably not enough for a debugger such as
417	gdb. But that is another problem.
418
419	@example
420	typedef struct @{
421	double some_float_register;
422	@} Context_Control_fp;
423	@end example
424
425
426	On some CPUs with hardware floating point support, the Context_Control_fp
427	structure will not be used.
428
429	@subsection Size of Floating Point Context Macro
430
431	The CPU_CONTEXT_FP_SIZE macro is set to the size of the floating point
432	context area. On some CPUs this will not be a "sizeof" because the format
433	of the floating point area is not defined -- only the size is. This is
434	usually on CPUs with a "floating point save context" instruction. In
435	general, though it is easier to define the structure as a "sizeof"
436	operation and define the Context_Control_fp structure to be an area of
437	bytes of the required size in this case.
438
439	@example
440	#define CPU_CONTEXT_FP_SIZE sizeof( Context_Control_fp )
441	@end example
442
443	@subsection Start of Floating Point Context Area Macro
444
445	The _CPU_Context_Fp_start macro is used in the XXX routine and allows the initial pointer into a floating point context area (used to save the floating point context) to be at an arbitrary place in the floating point context area. This is necessary because some FP units are designed to have their context saved as a stack which grows into lower addresses. Other FP units can be saved by simply moving registers into offsets from the base of the context area. Finally some FP units provide a "dump context" instruction which could fill in from high to low or low to high based on the whim of the CPU designers. Regardless, the address at which that floating point context area pointer should start within the actual floating point context area varies between ports and this macro provides a clean way of addressing this.
446
447	This is a common implementation of the _CPU_Context_Fp_start routine which
448	is suitable for many processors. In particular, RISC processors tend to
449	use this implementation since the floating point context is saved as a
450	sequence of store operations.
451
452	@example
453	#define _CPU_Context_Fp_start( _base, _offset ) \
454	( (void *) _Addresses_Add_offset( (_base), (_offset) ) )
455	@end example
456
457	In contrast, the m68k treats the floating point context area as a stack
458	which grows downward in memory. Thus the following implementation of
459	_CPU_Context_Fp_start is used in that port:
460
461
462	@example
463	XXX insert m68k version here
464	@end example
465
466	@subsection Initializing a Floating Point Context
467
468	The _CPU_Context_Initialize_fp routine initializes the floating point
469	context area passed to it to. There are a few standard ways in which to
470	initialize the floating point context. The simplest, and least
471	deterministic behaviorally, is to do nothing. This leaves the FPU in a
472	random state and is generally not a suitable way to implement this
473	routine. The second common implementation is to place a "null FP status
474	word" into some status/control register in the FPU. This mechanism is
475	simple and works on many FPUs. Another common way is to initialize the
476	FPU to a known state during _CPU_Initialize and save the context (using
477	_CPU_Context_save_fp_context) into the special floating point context
478	_CPU_Null_fp_context. Then all that is required to initialize a floating
479	point context is to copy _CPU_Null_fp_context to the destination floating
480	point context passed to it. The following example implementation shows
481	how to accomplish this:
482
483	@example
484	#define _CPU_Context_Initialize_fp( _destination ) \
485	@{ \
486	((Context_Control_fp ) ((void *) _destination)) = \
487	_CPU_Null_fp_context; \
488	@}
489	@end example
490
491	The _CPU_Null_fp_context is optional. A port need only include this variable when it uses the above mechanism to initialize a floating point context. This is typically done on CPUs where it is difficult to generate an "uninitialized" FP context. If the port requires this variable, then it is declared as follows:
492
493	@example
494	Context_Control_fp _CPU_Null_fp_context;
495	@end example
496
497
498	@subsection Saving a Floating Point Context
499
500	The _CPU_Context_save_fp_context routine is responsible for saving the FP
501	context at *fp_context_ptr. If the point to load the FP context from is
502	changed then the pointer is modified by this routine.
503
504	Sometimes a macro implementation of this is in cpu.h which dereferences
505	the ** and a similarly named routine in this file is passed something like
506	a (Context_Control_fp *). The general rule on making this decision is to
507	avoid writing assembly language.
508
509	@example
510	void _CPU_Context_save_fp(
511	void **fp_context_ptr
512	)
513	@end example
514
515	@subsection Restoring a Floating Point Context
516
517	The _CPU_Context_restore_fp_context is responsible for restoring the FP
518	context at *fp_context_ptr. If the point to load the FP context from is
519	changed then the pointer is modified by this routine.
520
521	Sometimes a macro implementation of this is in cpu.h which dereferences
522	the ** and a similarly named routine in this file is passed something like
523	a (Context_Control_fp *). The general rule on making this decision is to
524	avoid writing assembly language.
525
526	@example
527	void _CPU_Context_restore_fp(
528	void **fp_context_ptr
529	);
530	@end example
531

Note: See TracBrowser for help on using the repository browser.

Download in other formats: