Function calling conventions and other conventions regarding the use of
machine registers and stack slots.
Require Import Coqlib.
Require Import Decidableplus.
Require Import AST.
Require Import Events.
Require Import Locations.
Require Import Compopts.
Require Archi.
Classification of machine registers
Machine registers (type
mreg in module
Locations) are divided in
the following groups:
-
Temporaries used for spilling, reloading, and parallel move operations.
-
Allocatable registers, that can be assigned to RTL pseudo-registers.
These are further divided into:
-
Callee-save registers, whose value is preserved across a function call.
-
Caller-save registers that can be modified during a function call.
We follow the ARM application binary interface (EABI) in our choice
of callee- and caller-save registers.
Definition is_callee_save (
r:
mreg):
bool :=
match r with
|
R0 |
R1 |
R2 |
R3 |
R12 =>
false
|
R4 |
R5 |
R6 |
R7 |
R8 |
R9 |
R10 |
R11 =>
true
|
F0 |
F1 |
F2 |
F3 |
F4 |
F5 |
F6 |
F7 =>
false
|
F8 |
F9 |
F10 |
F11 |
F12 |
F13 |
F14 |
F15 =>
true
end.
Definition int_caller_save_regs :=
R0 ::
R1 ::
R2 ::
R3 ::
R12 ::
nil.
Definition float_caller_save_regs :=
F0 ::
F1 ::
F2 ::
F3 ::
F4 ::
F5 ::
F6 ::
F7 ::
nil.
Definition int_callee_save_regs :=
R4 ::
R5 ::
R6 ::
R7 ::
R8 ::
R9 ::
R10 ::
R11 ::
nil.
Definition float_callee_save_regs :=
F8 ::
F9 ::
F10 ::
F11 ::
F12 ::
F13 ::
F14 ::
F15 ::
nil.
Definition destroyed_at_call :=
List.filter (
fun r =>
negb (
is_callee_save r))
all_mregs.
Definition dummy_int_reg :=
R0.
(* Used in Coloring. *)
Definition dummy_float_reg :=
F0.
(* Used in Coloring. *)
Definition callee_save_type :=
mreg_type.
Definition is_float_reg (
r:
mreg):
bool :=
match r with
|
R0 |
R1 |
R2 |
R3
|
R4 |
R5 |
R6 |
R7
|
R8 |
R9 |
R10 |
R11 |
R12 =>
false
|
F0 |
F1 |
F2 |
F3 |
F4 |
F5 |
F6 |
F7
|
F8 |
F9 |
F10 |
F11 |
F12 |
F13 |
F14 |
F15 =>
true
end.
How to use registers for register allocation.
In classic ARM mode, we favor the use of caller-save registers,
using callee-save registers only when no caller-save is available.
In Thumb mode, we additionally favor integer registers R0 to R3 over the other
integer registers, as they lead to more compact instruction encodings.
When -fno-callee-save-allocation is in effect (i.e.
Compopts.callee_save_allocation returns false), callee-save
registers are removed from the remaining_* lists, so the IRC
allocator falls through to a stack spill instead of allocating one.
Record alloc_regs :=
mk_alloc_regs {
preferred_int_regs:
list mreg;
remaining_int_regs:
list mreg;
preferred_float_regs:
list mreg;
remaining_float_regs:
list mreg
}.
Definition drop_callee_save_if_disabled (
l:
list mreg) :
list mreg :=
if Compopts.callee_save_allocation tt
then l
else List.filter (
fun r =>
negb (
is_callee_save r))
l.
Definition allocatable_registers (_ :
unit) :=
if thumb tt then
{|
preferred_int_regs :=
R0 ::
R1 ::
R2 ::
R3 ::
nil;
remaining_int_regs :=
drop_callee_save_if_disabled
(
R4 ::
R5 ::
R6 ::
R7 ::
R12 ::
R8 ::
R9 ::
R10 ::
R11 ::
nil);
preferred_float_regs :=
float_caller_save_regs;
remaining_float_regs :=
drop_callee_save_if_disabled float_callee_save_regs |}
else
{|
preferred_int_regs :=
int_caller_save_regs;
remaining_int_regs :=
drop_callee_save_if_disabled int_callee_save_regs;
preferred_float_regs :=
float_caller_save_regs;
remaining_float_regs :=
drop_callee_save_if_disabled float_callee_save_regs |}.
Function calling conventions
The functions in this section determine the locations (machine registers
and stack slots) used to communicate arguments and results between the
caller and the callee during function calls. These locations are functions
of the signature of the function and of the call instruction.
Agreement between the caller and the callee on the locations to use
is guaranteed by our dynamic semantics for Cminor and RTL, which demand
that the signature of the call instruction is identical to that of the
called function.
Calling conventions are largely arbitrary: they must respect the properties
proved in this section (such as no overlapping between the locations
of function arguments), but this leaves much liberty in choosing actual
locations.
Location of function result
The result value of a function is passed back to the caller in
registers R0 or F0 or R0,R1, depending on the type of the
returned value. We treat a function without result as a function
with one integer result.
For the "softfloat" convention, results of FP types should be passed
in R0 or R0,R1. This doesn't fit the CompCert register model,
so we have code in arm/TargetPrinter.ml that inserts additional moves
to/from F0.
Concerning endianness for 64bit values in register pairs, the contents
of the registers is as if the value had been loaded from memory
representation with a single LDM instruction.
Definition loc_result (
s:
signature) :
rpair mreg :=
match proj_sig_res s with
|
Tint |
Tany32 =>
One R0
|
Tfloat |
Tsingle |
Tany64 =>
One F0
|
Tlong =>
if Archi.big_endian
then Twolong R0 R1
else Twolong R1 R0
end.
The result registers have types compatible with that given in the signature.
Lemma loc_result_type:
forall sig,
subtype (
proj_sig_res sig) (
typ_rpair mreg_type (
loc_result sig)) =
true.
Proof.
The result locations are caller-save registers
Lemma loc_result_caller_save:
forall (
s:
signature),
forall_rpair (
fun r =>
is_callee_save r =
false) (
loc_result s).
Proof.
If the result is in a pair of registers, those registers are distinct and have type Tint at least.
Lemma loc_result_pair:
forall sg,
match loc_result sg with
|
One _ =>
True
|
Twolong r1 r2 =>
r1 <>
r2 /\
proj_sig_res sg =
Tlong
/\
subtype Tint (
mreg_type r1) =
true /\
subtype Tint (
mreg_type r2) =
true
/\
Archi.ptr64 =
false
end.
Proof.
The location of the result depends only on the result part of the signature
Lemma loc_result_exten:
forall s1 s2,
s1.(
sig_res) =
s2.(
sig_res) ->
loc_result s1 =
loc_result s2.
Proof.
Location of function arguments
For the "hardfloat" configuration, we use the following calling conventions,
adapted from the ARM EABI-HF:
-
The first 4 integer arguments are passed in registers R0 to R3.
-
The first 2 long integer arguments are passed in an aligned pair of
two integer registers.
-
The first 8 single- and double-precision float arguments are passed
in registers F0...F7
-
Extra arguments are passed on the stack, in Outgoing slots, consecutively
assigned (1 word for an integer or single argument, 2 words for a float
or a long), starting at word offset 0.
This convention is not quite that of the ARM EABI-HF, whereas single float
arguments are passed in 32-bit float registers. Unfortunately,
this does not fit the data model of CompCert. In
PrintAsm.ml
we insert additional code around function calls that moves
data appropriately.
Definition int_param_regs :=
R0 ::
R1 ::
R2 ::
R3 ::
nil.
Definition float_param_regs :=
F0 ::
F1 ::
F2 ::
F3 ::
F4 ::
F5 ::
F6 ::
F7 ::
nil.
Definition ireg_param (
n:
Z) :
mreg :=
match list_nth_z int_param_regs n with Some r =>
r |
None =>
R0 end.
Definition freg_param (
n:
Z) :
mreg :=
match list_nth_z float_param_regs n with Some r =>
r |
None =>
F0 end.
Fixpoint loc_arguments_hf
(
tyl:
list typ) (
ir fr ofs:
Z) {
struct tyl} :
list (
rpair loc) :=
match tyl with
|
nil =>
nil
| (
Tint |
Tany32)
as ty ::
tys =>
if zlt ir 4
then One (
R (
ireg_param ir)) ::
loc_arguments_hf tys (
ir + 1)
fr ofs
else One (
S Outgoing ofs ty) ::
loc_arguments_hf tys ir fr (
ofs + 1)
| (
Tfloat |
Tany64)
as ty ::
tys =>
if zlt fr 8
then One (
R (
freg_param fr)) ::
loc_arguments_hf tys ir (
fr + 1)
ofs
else let ofs :=
align ofs 2
in
One (
S Outgoing ofs ty) ::
loc_arguments_hf tys ir fr (
ofs + 2)
|
Tsingle ::
tys =>
if zlt fr 8
then One (
R (
freg_param fr)) ::
loc_arguments_hf tys ir (
fr + 1)
ofs
else One (
S Outgoing ofs Tsingle) ::
loc_arguments_hf tys ir fr (
ofs + 1)
|
Tlong ::
tys =>
let ohi :=
if Archi.big_endian then 0
else 1
in
let olo :=
if Archi.big_endian then 1
else 0
in
let ir :=
align ir 2
in
if zlt ir 4
then Twolong (
R (
ireg_param (
ir +
ohi))) (
R (
ireg_param (
ir +
olo))) ::
loc_arguments_hf tys (
ir + 2)
fr ofs
else let ofs :=
align ofs 2
in
Twolong (
S Outgoing (
ofs +
ohi)
Tint) (
S Outgoing (
ofs +
olo)
Tint) ::
loc_arguments_hf tys ir fr (
ofs + 2)
end.
For the "softfloat" configuration, as well as for variable-argument functions
in the "hardfloat" configuration, we use the default ARM EABI (not HF)
calling conventions:
-
The first 4 integer arguments are passed in registers R0 to R3.
-
The first 2 long integer arguments are passed in an aligned pair of
two integer registers.
-
The first 2 double-precision float arguments are passed in F0 or F2
-
The first 4 single-precision float arguments are passed in F0...F3
-
Integer arguments and float arguments are kept in sync so that
they can all be mapped back to R0...R3 in PrintAsm.ml.
-
Extra arguments are passed on the stack, in Outgoing slots, consecutively
assigned (1 word for an integer or single argument, 2 words for a float
or a long), starting at word offset 0.
This convention is not quite that of the ARM EABI, whereas every float
argument are passed in one or two integer registers. Unfortunately,
this does not fit the data model of CompCert. In
PrintAsm.ml
we insert additional code around function calls and returns that moves
data appropriately.
Fixpoint loc_arguments_sf
(
tyl:
list typ) (
ofs:
Z) {
struct tyl} :
list (
rpair loc) :=
match tyl with
|
nil =>
nil
| (
Tint|
Tany32)
as ty ::
tys =>
One (
if zlt ofs 0
then R (
ireg_param (
ofs + 4))
else S Outgoing ofs ty)
::
loc_arguments_sf tys (
ofs + 1)
| (
Tfloat|
Tany64)
as ty ::
tys =>
let ofs :=
align ofs 2
in
One (
if zlt ofs 0
then R (
freg_param (
ofs + 4))
else S Outgoing ofs ty)
::
loc_arguments_sf tys (
ofs + 2)
|
Tsingle ::
tys =>
One (
if zlt ofs 0
then R (
freg_param (
ofs + 4))
else S Outgoing ofs Tsingle)
::
loc_arguments_sf tys (
ofs + 1)
|
Tlong ::
tys =>
let ohi :=
if Archi.big_endian then 0
else 1
in
let olo :=
if Archi.big_endian then 1
else 0
in
let ofs :=
align ofs 2
in
Twolong (
if zlt ofs 0
then R (
ireg_param (
ofs+
ohi+4))
else S Outgoing (
ofs+
ohi)
Tint)
(
if zlt ofs 0
then R (
ireg_param (
ofs+
olo+4))
else S Outgoing (
ofs+
olo)
Tint)
::
loc_arguments_sf tys (
ofs + 2)
end.
loc_arguments s returns the list of locations where to store arguments
when calling a function with signature s.
Definition loc_arguments (
s:
signature) :
list (
rpair loc) :=
match Archi.abi with
|
Archi.Softfloat =>
loc_arguments_sf (
proj_sig_args s) (-4)
|
Archi.Hardfloat =>
if s.(
sig_cc).(
cc_vararg)
then loc_arguments_sf (
proj_sig_args s) (-4)
else loc_arguments_hf (
proj_sig_args s) 0 0 0
end.
Argument locations are either non-temporary registers or Outgoing
stack slots at nonnegative offsets.
Definition loc_argument_acceptable (
l:
loc) :
Prop :=
match l with
|
R r =>
is_callee_save r =
false
|
S Outgoing ofs ty =>
ofs >= 0 /\ (
typealign ty |
ofs)
| _ =>
False
end.
Definition loc_argument_charact (
ofs:
Z) (
l:
loc) :
Prop :=
match l with
|
R r =>
is_callee_save r =
false
|
S Outgoing ofs' ty =>
ofs' >=
ofs /\
typealign ty = 1
| _ =>
False
end.
Remark ireg_param_caller_save:
forall n,
is_callee_save (
ireg_param n) =
false.
Proof.
Remark freg_param_caller_save:
forall n,
is_callee_save (
freg_param n) =
false.
Proof.
Remark loc_arguments_hf_charact:
forall tyl ir fr ofs p,
In p (
loc_arguments_hf tyl ir fr ofs) ->
forall_rpair (
loc_argument_charact ofs)
p.
Proof.
Remark loc_arguments_sf_charact:
forall tyl ofs p,
In p (
loc_arguments_sf tyl ofs) ->
forall_rpair (
loc_argument_charact (
Z.max 0
ofs))
p.
Proof.
Lemma loc_arguments_acceptable:
forall (
s:
signature) (
p:
rpair loc),
In p (
loc_arguments s) ->
forall_rpair loc_argument_acceptable p.
Proof.
Global Hint Resolve loc_arguments_acceptable:
locs.
Lemma loc_arguments_main:
loc_arguments signature_main =
nil.
Proof.
Normalization of function results and parameters
No normalization needed.
Definition return_value_needs_normalization (
t:
xtype) :=
false.
Definition parameter_needs_normalization (
t:
xtype) :=
false.