257 lines
9.1 KiB
Plaintext
257 lines
9.1 KiB
Plaintext
|
==========================
|
||
|
KERNEL ABIS FOR METAG ARCH
|
||
|
==========================
|
||
|
|
||
|
This document describes the Linux ABIs for the metag architecture, and has the
|
||
|
following sections:
|
||
|
|
||
|
(*) Outline of registers
|
||
|
(*) Userland registers
|
||
|
(*) Kernel registers
|
||
|
(*) System call ABI
|
||
|
(*) Calling conventions
|
||
|
|
||
|
|
||
|
====================
|
||
|
OUTLINE OF REGISTERS
|
||
|
====================
|
||
|
|
||
|
The main Meta core registers are arranged in units:
|
||
|
|
||
|
UNIT Type DESCRIPTION GP EXT PRIV GLOBAL
|
||
|
======= ======= =============== ======= ======= ======= =======
|
||
|
CT Special Control unit
|
||
|
D0 General Data unit 0 0-7 8-15 16-31 16-31
|
||
|
D1 General Data unit 1 0-7 8-15 16-31 16-31
|
||
|
A0 General Address unit 0 0-3 4-7 8-15 8-15
|
||
|
A1 General Address unit 1 0-3 4-7 8-15 8-15
|
||
|
PC Special PC unit 0 1
|
||
|
PORT Special Ports
|
||
|
TR Special Trigger unit 0-7
|
||
|
TT Special Trace unit 0-5
|
||
|
FX General FP unit 0-15
|
||
|
|
||
|
GP registers form part of the main context.
|
||
|
|
||
|
Extended context registers (EXT) may not be present on all hardware threads and
|
||
|
can be context switched if support is enabled and the appropriate bits are set
|
||
|
in e.g. the D0.8 register to indicate what extended state to preserve.
|
||
|
|
||
|
Global registers are shared between threads and are privilege protected.
|
||
|
|
||
|
See arch/metag/include/asm/metag_regs.h for definitions relating to core
|
||
|
registers and the fields and bits they contain. See the TRMs for further details
|
||
|
about special registers.
|
||
|
|
||
|
Several special registers are preserved in the main context, these are the
|
||
|
interesting ones:
|
||
|
|
||
|
REG (ALIAS) PURPOSE
|
||
|
======================= ===============================================
|
||
|
CT.1 (TXMODE) Processor mode bits (particularly for DSP)
|
||
|
CT.2 (TXSTATUS) Condition flags and LSM_STEP (MGET/MSET step)
|
||
|
CT.3 (TXRPT) Branch repeat counter
|
||
|
PC.0 (PC) Program counter
|
||
|
|
||
|
Some of the general registers have special purposes in the ABI and therefore
|
||
|
have aliases:
|
||
|
|
||
|
D0 REG (ALIAS) PURPOSE D1 REG (ALIAS) PURPOSE
|
||
|
=============== =============== =============== =======================
|
||
|
D0.0 (D0Re0) 32bit result D1.0 (D1Re0) Top half of 64bit result
|
||
|
D0.1 (D0Ar6) Argument 6 D1.1 (D1Ar5) Argument 5
|
||
|
D0.2 (D0Ar4) Argument 4 D1.2 (D1Ar3) Argument 3
|
||
|
D0.3 (D0Ar2) Argument 2 D1.3 (D1Ar1) Argument 1
|
||
|
D0.4 (D0FrT) Frame temp D1.4 (D1RtP) Return pointer
|
||
|
D0.5 Call preserved D1.5 Call preserved
|
||
|
D0.6 Call preserved D1.6 Call preserved
|
||
|
D0.7 Call preserved D1.7 Call preserved
|
||
|
|
||
|
A0 REG (ALIAS) PURPOSE A1 REG (ALIAS) PURPOSE
|
||
|
=============== =============== =============== =======================
|
||
|
A0.0 (A0StP) Stack pointer A1.0 (A1GbP) Global base pointer
|
||
|
A0.1 (A0FrP) Frame pointer A1.1 (A1LbP) Local base pointer
|
||
|
A0.2 A1.2
|
||
|
A0.3 A1.3
|
||
|
|
||
|
|
||
|
==================
|
||
|
USERLAND REGISTERS
|
||
|
==================
|
||
|
|
||
|
All the general purpose D0, D1, A0, A1 registers are preserved when entering the
|
||
|
kernel (including asynchronous events such as interrupts and timer ticks) except
|
||
|
the following which have special purposes in the ABI:
|
||
|
|
||
|
REGISTERS WHEN STATUS PURPOSE
|
||
|
=============== ======= =============== ===============================
|
||
|
D0.8 DSP Preserved ECH, determines what extended
|
||
|
DSP state to preserve.
|
||
|
A0.0 (A0StP) ALWAYS Preserved Stack >= A0StP may be clobbered
|
||
|
at any time by the creation of a
|
||
|
signal frame.
|
||
|
A1.0 (A1GbP) SMP Clobbered Used as temporary for loading
|
||
|
kernel stack pointer and saving
|
||
|
core context.
|
||
|
A0.15 !SMP Protected Stores kernel stack pointer.
|
||
|
A1.15 ALWAYS Protected Stores kernel base pointer.
|
||
|
|
||
|
On UP A0.15 is used to store the kernel stack pointer for storing the userland
|
||
|
context. A0.15 is global between hardware threads though which means it cannot
|
||
|
be used on SMP for this purpose. Since no protected local registers are
|
||
|
available A1GbP is reserved for use as a temporary to allow a percpu stack
|
||
|
pointer to be loaded for storing the rest of the context.
|
||
|
|
||
|
|
||
|
================
|
||
|
KERNEL REGISTERS
|
||
|
================
|
||
|
|
||
|
When in the kernel the following registers have special purposes in the ABI:
|
||
|
|
||
|
REGISTERS WHEN STATUS PURPOSE
|
||
|
=============== ======= =============== ===============================
|
||
|
A0.0 (A0StP) ALWAYS Preserved Stack >= A0StP may be clobbered
|
||
|
at any time by the creation of
|
||
|
an irq signal frame.
|
||
|
A1.0 (A1GbP) ALWAYS Preserved Reserved (kernel base pointer).
|
||
|
|
||
|
|
||
|
===============
|
||
|
SYSTEM CALL ABI
|
||
|
===============
|
||
|
|
||
|
When a system call is made, the following registers are effective:
|
||
|
|
||
|
REGISTERS CALL RETURN
|
||
|
=============== ======================= ===============================
|
||
|
D0.0 (D0Re0) Return value (or -errno)
|
||
|
D1.0 (D1Re0) System call number Clobbered
|
||
|
D0.1 (D0Ar6) Syscall arg #6 Preserved
|
||
|
D1.1 (D1Ar5) Syscall arg #5 Preserved
|
||
|
D0.2 (D0Ar4) Syscall arg #4 Preserved
|
||
|
D1.2 (D1Ar3) Syscall arg #3 Preserved
|
||
|
D0.3 (D0Ar2) Syscall arg #2 Preserved
|
||
|
D1.3 (D1Ar1) Syscall arg #1 Preserved
|
||
|
|
||
|
Due to the limited number of argument registers and some system calls with badly
|
||
|
aligned 64-bit arguments, 64-bit values are always packed in consecutive
|
||
|
arguments, even if this is contrary to the normal calling conventions (where the
|
||
|
two halves would go in a matching pair of data registers).
|
||
|
|
||
|
For example fadvise64_64 usually has the signature:
|
||
|
|
||
|
long sys_fadvise64_64(i32 fd, i64 offs, i64 len, i32 advice);
|
||
|
|
||
|
But for metag fadvise64_64 is wrapped so that the 64-bit arguments are packed:
|
||
|
|
||
|
long sys_fadvise64_64_metag(i32 fd, i32 offs_lo,
|
||
|
i32 offs_hi, i32 len_lo,
|
||
|
i32 len_hi, i32 advice)
|
||
|
|
||
|
So the arguments are packed in the registers like this:
|
||
|
|
||
|
D0 REG (ALIAS) VALUE D1 REG (ALIAS) VALUE
|
||
|
=============== =============== =============== =======================
|
||
|
D0.1 (D0Ar6) advice D1.1 (D1Ar5) hi(len)
|
||
|
D0.2 (D0Ar4) lo(len) D1.2 (D1Ar3) hi(offs)
|
||
|
D0.3 (D0Ar2) lo(offs) D1.3 (D1Ar1) fd
|
||
|
|
||
|
|
||
|
===================
|
||
|
CALLING CONVENTIONS
|
||
|
===================
|
||
|
|
||
|
These calling conventions apply to both user and kernel code. The stack grows
|
||
|
from low addresses to high addresses in the metag ABI. The stack pointer (A0StP)
|
||
|
should always point to the next free address on the stack and should at all
|
||
|
times be 64-bit aligned. The following registers are effective at the point of a
|
||
|
call:
|
||
|
|
||
|
REGISTERS CALL RETURN
|
||
|
=============== ======================= ===============================
|
||
|
D0.0 (D0Re0) 32bit return value
|
||
|
D1.0 (D1Re0) Upper half of 64bit return value
|
||
|
D0.1 (D0Ar6) 32bit argument #6 Clobbered
|
||
|
D1.1 (D1Ar5) 32bit argument #5 Clobbered
|
||
|
D0.2 (D0Ar4) 32bit argument #4 Clobbered
|
||
|
D1.2 (D1Ar3) 32bit argument #3 Clobbered
|
||
|
D0.3 (D0Ar2) 32bit argument #2 Clobbered
|
||
|
D1.3 (D1Ar1) 32bit argument #1 Clobbered
|
||
|
D0.4 (D0FrT) Clobbered
|
||
|
D1.4 (D1RtP) Return pointer Clobbered
|
||
|
D{0-1}.{5-7} Preserved
|
||
|
A0.0 (A0StP) Stack pointer Preserved
|
||
|
A1.0 (A0GbP) Preserved
|
||
|
A0.1 (A0FrP) Frame pointer Preserved
|
||
|
A1.1 (A0LbP) Preserved
|
||
|
A{0-1},{2-3} Clobbered
|
||
|
|
||
|
64-bit arguments are placed in matching pairs of registers (i.e. the same
|
||
|
register number in both D0 and D1 units), with the least significant half in D0
|
||
|
and the most significant half in D1, leaving a gap where necessary. Further
|
||
|
arguments are stored on the stack in reverse order (earlier arguments at higher
|
||
|
addresses):
|
||
|
|
||
|
ADDRESS 0 1 2 3 4 5 6 7
|
||
|
=============== ===== ===== ===== ===== ===== ===== ===== =====
|
||
|
A0StP -->
|
||
|
A0StP-0x08 32bit argument #8 32bit argument #7
|
||
|
A0StP-0x10 32bit argument #10 32bit argument #9
|
||
|
|
||
|
Function prologues tend to look a bit like this:
|
||
|
|
||
|
/* If frame pointer in use, move it to frame temp register so it can be
|
||
|
easily pushed onto stack */
|
||
|
MOV D0FrT,A0FrP
|
||
|
|
||
|
/* If frame pointer in use, set it to stack pointer */
|
||
|
ADD A0FrP,A0StP,#0
|
||
|
|
||
|
/* Preserve D0FrT, D1RtP, D{0-1}.{5-7} on stack, incrementing A0StP */
|
||
|
MSETL [A0StP++],D0FrT,D0.5,D0.6,D0.7
|
||
|
|
||
|
/* Allocate some stack space for local variables */
|
||
|
ADD A0StP,A0StP,#0x10
|
||
|
|
||
|
At this point the stack would look like this:
|
||
|
|
||
|
ADDRESS 0 1 2 3 4 5 6 7
|
||
|
=============== ===== ===== ===== ===== ===== ===== ===== =====
|
||
|
A0StP -->
|
||
|
A0StP-0x08
|
||
|
A0StP-0x10
|
||
|
A0StP-0x18 Old D0.7 Old D1.7
|
||
|
A0StP-0x20 Old D0.6 Old D1.6
|
||
|
A0StP-0x28 Old D0.5 Old D1.5
|
||
|
A0FrP --> Old A0FrP (frame ptr) Old D1RtP (return ptr)
|
||
|
A0FrP-0x08 32bit argument #8 32bit argument #7
|
||
|
A0FrP-0x10 32bit argument #10 32bit argument #9
|
||
|
|
||
|
Function epilogues tend to differ depending on the use of a frame pointer. An
|
||
|
example of a frame pointer epilogue:
|
||
|
|
||
|
/* Restore D0FrT, D1RtP, D{0-1}.{5-7} from stack, incrementing A0FrP */
|
||
|
MGETL D0FrT,D0.5,D0.6,D0.7,[A0FrP++]
|
||
|
/* Restore stack pointer to where frame pointer was before increment */
|
||
|
SUB A0StP,A0FrP,#0x20
|
||
|
/* Restore frame pointer from frame temp */
|
||
|
MOV A0FrP,D0FrT
|
||
|
/* Return to caller via restored return pointer */
|
||
|
MOV PC,D1RtP
|
||
|
|
||
|
If the function hasn't touched the frame pointer, MGETL cannot be safely used
|
||
|
with A0StP as it always increments and that would expose the stack to clobbering
|
||
|
by interrupts (kernel) or signals (user). Therefore it's common to see the MGETL
|
||
|
split into separate GETL instructions:
|
||
|
|
||
|
/* Restore D0FrT, D1RtP, D{0-1}.{5-7} from stack */
|
||
|
GETL D0FrT,D1RtP,[A0StP+#-0x30]
|
||
|
GETL D0.5,D1.5,[A0StP+#-0x28]
|
||
|
GETL D0.6,D1.6,[A0StP+#-0x20]
|
||
|
GETL D0.7,D1.7,[A0StP+#-0x18]
|
||
|
/* Restore stack pointer */
|
||
|
SUB A0StP,A0StP,#0x30
|
||
|
/* Return to caller via restored return pointer */
|
||
|
MOV PC,D1RtP
|