Review S: ARM stacks & subroutines

S1: [1] [2] [3] [4] // S2: [1] // S3: [1] [2] [3] [4] [5] [6] [7] // S4: [1] [2] [3]

Problem S1.1

An ARM assembly language subroutine can be called from multiple locations within a program. From wherever it is called, the subroutine should return back to the instruction following the one that entered the subroutine. How does an ARM subroutine determine where to return after completing its computation? Your answer should describe where it can find the return location and how the value is normally placed there.

To enter a subroutine, ARM code uses the BL instruction, which places the address of the instruction after BL into the “link register,” R14. Upon finishing, then, the subroutine knows that it can look into R14 to find which instruction should be executed next.

(In practice, the subroutine will often place R14 on the stack, since R14 would change whenever the subroutine calls any other subroutines.)

Problem S1.4

In the following ARM assembly language subroutine using the ARM Procedure Call Standard, we take a value n found in R0 and raise it to the tenth power (n¹⁰), placing the result back into R0. The subroutine accomplishes this by calling a subroutine pow, which takes the value in R0 and raises it to the power found in R1, placing the value of a^b into R0.

toTenth MOV R1, #10

        BL pow

        MOV PC, LR

Even though pow abides by the ARM Procedure Call Standard and works correctly, we find that this toTenth subroutine never returns. Explain why precisely this happens, and describe what you would do to repair it (without necessarily resorting to the precise ARM assembly code you would add or change).

When we execute “BL pow”, the address of the following instruction (“MOV PC, LR”) is placed into the link register. Once pow returns, the link register still contains the address of that instruction, so when we execute “MOV PC, LR”, we are simply resetting PC to point again to that same MOV instruction, and it will end up repeating that single instruction indefinitely.

To repair it, we would need to store LR somehow (probably on the program stack) before entering the pow subroutine. In returning, you would need to ensure that this stored value of LR is placed into PC.

Problem S3.4

Suppose a subroutine gcd, using the ARM Procedure Call Standard, already exists elsewhere in a program to compute the greatest common denominator of two integer parameters. We want to use this to compose a subroutine gcd3 that computes the greatest common denominator of \emph{three} integer parameters. Describe in English what goes wrong with the following solution, and show how to repair the assembly language so that it works correctly.

gcd3 BL gcd     ; R0 = gcd(param0, param1)

     MOV R1, R2

     BL gcd     ; R0 = gcd(R0, param2)

     MOV PC, LR ; return R0

In entering gcd3, R2 and LR will have values that the subroutine later uses: The second line needs R2, while the last one needs LR. However, the first BL instruction will lead to both values being lost: The BL instruction will itself overwrite LR as it enters gcd to hold the address of gcd3's second instruction (so gcd knows where to resume), while the gcd subroutine is allowed to change all caller-save registers, including R2. Consequently, the second and fourth lines will use the wrong values for R2 and LR.

The solution is to push R4 and LR onto the stack upon entering the subroutine, using the callee-save register R4 to remember the third parameter over the subroutine call, and then restoring R4 and LR just before returning.

gcd3 STMDB SP!, {R4, LR}

     MOV R4, R2

     BL gcd     ; R0 = gcd(param0, param1)

     MOV R1, R4

     BL gcd     ; R0 = gcd(R0, param2)

     LDMIA SP!, {R4, LR}

     MOV PC, LR ; return R0

Problem S3.6

Assuming we already have an ARM subroutine named sin, translate the following C function into an ARM subroutine following the ARM Procedure Call Standard.

int addsin(int a, int b) {

  int fa = sin(a);

  int fb = sin(b);

  return fa + fb;

}

addsin STMDB SP!, {R4, R5, LR}

       MOV R4, R1

       BL sin

       MOV R5, R0

       MOV R0, R4

       BL sin

       ADD R0, R0, R5

       LDMIA SP!, {R4, R5, PC}

Problem S3.7

In the following ARM assembly code using the ARM Procedure Call Standard, we wish to compute n¹⁰ + n⁹ + n⁸ + … + n¹, where n is the value found in R0 upon entering the addPows subroutine. However, it doesn't work because the call to pow ends up changing the registers R0 through R3, and this code depends on R1, R2, and R3 remaining unchanged.

What specific ARM assembly code would you insert or change so that this works, while still conforming to the ARM Procedure Call Standard?

addPows MOV R1, #10

        MOV R2, #0

        MOV R3, R0

again   MOV R0, R3

        BL pow

        ADD R2, R2, R0

        SUBS R1, R1, #1

        BNE again

        MOV R0, R2

        MOV PC, LR

We need to stash the values of these registers so that they can be recovered following completion of the subroutine call. You can store the register values on the program stack by inserting “STMDB SP!, {R1-R3}” preceding the “BL pow” instruction, and you can restore their values by inserting “LDMIA SP!, {R1-R3}” following the BL instruction.

Problem S4.1

Complete the partial fragment below for setting the last value of a linked list to 0, where R4 is initially the address of the list's first node. Each node holds its integer data in the first four bytes, and in the next four bytes is the address of the node following it in the list. The last node has “0” marked as the address of the following node.

loop   ; load address of node following R4 into R5; i.e., r5 = r4->next



       TST R5, R5

       MOVNE R4, R5

       BNE loop

       MOV R5, #0

       ; change value in node R4 to be R5; i.e., r4->value = r5

The short solution: LDR R5, [R4, #4] and then STR R5, [R4].

The first instruction might be replaced by two lines:

       ADD R0, R4, #4

       LDR R5, [R0]

Problem S4.2

Complete the partial fragment below for finding the sum of the integer values of a linked list, where R0 is initially the address of the list's first node. Each node holds its integer value in the first four bytes, and in the next four bytes is the address of the node following it in the list. The list's last node is marked by having 0 as the address of its following node.

sumList   MOV R1, #0           ; R1 accumulates sum of nodes seen

sumLoop                        ; TODO: R1 += R0->value





sumNext                        ; TODO: R0 = R0->next





          TST R0, R0           ; if R0 isn't 0, repeat loop for following node

          BNE sumLoop

          ; R1 will now hold sum of list's integer values

sumList   MOV R1, #0           ; R1 should be sum after completing loop

sumLoop   LDR R2, [R0]         ; R1 += R0->value

          ADD R1, R1, R2

sumNext   LDR R0, [R0, #4]     ; R0 = R0->next

          TST R0, R0           ; if R0 isn't 0, repeat loop for following node

          BNE sumLoop

Review S: ARM stacks & subroutines

Problem S1.1

Problem S1.2

Problem S1.3

Problem S1.4

Problem S2.1

Problem S3.1

Problem S3.2

Problem S3.3

Problem S3.4

Problem S3.5

Problem S3.6

Problem S3.7

Problem S4.1

Problem S4.2

Problem S4.3