To ORGH or not to ORGH?

cgracey · 2019-04-12 13:50

LOC is really only useful in hub-exec code, unless you're just exploiting it to load 20-bit constants. That it assembles under ORG is there for the possibility, but it's not practical. LOC was intended only as a mechanism to allow hub code to get addresses of its internal data structures.

evanh · 2019-04-12 15:51

cgracey wrote: »

Dave, the PC is known at runtime, so LOC can give you an absolute address when your code only knows the relative offset from the PC. By writing hub-exec code which uses relative branches and relative LOC's, you can have an entire PASM application which can sit at any byte offset in hub memory in which it fits, and it will execute without any external pointers being required. You only need to JMP to the start of the code.

Emphasising the PA register content is always resolved to absolute:

loc  pa, #label testing
=========================
Cogexec
   PC    Op-code  PA-data  Cog-dref Hub-dref
0000000b fe9ffffc 00000008 0badf00d 00000000
0000000f fe900100 00000110 ff000002 deadbeef
00000011 fe8004a8 000004a8 00000000 cafef00d

Hubexec
   PC    Op-code  PA-data  Cog-dref Hub-dref
00000404 fe800008 00000008 0badf00d 00000000
00000414 fe800110 00000110 ff000002 deadbeef
0000041c fe900088 000004a8 00000000 cafef00d

evanh · 2019-04-12 15:58

Note: The PC there is the address of the LOC opcode. The program counter value required for PC-relative calculation is at the subsequent longword.

Dave Hein · 2019-04-12 16:21

OK, got it. LOC loads the absolute address independent of whether the address is encoded as absolute or relative.

Chip, are you going to change PNut to match the changes in fastspin for the LOC instruction?

Rayman · 2019-04-12 17:38

I thought LOC was just there to save one instruction...

potatohead · 2019-04-12 17:40

That intent was seen in Moto chips, 6809, 68k, and the instruction was LEA.

Load Effective Address.

For PC relative, you specified that address mode.

LEA would deliver the effective address 9f any combination of addressing modes into a register.

LEA PC, X

Address = current PC at time of execute, plus X.

Relocatable, reentrant code was straightforward to write. Drop it anywhere, run it, move it, everything just worked.

P2 benefits from this thinking.

Dave Hein · 2019-04-12 18:54

Now that I understand how LOC works I can see that it would be useful for position-independent-code. Unfortunately, it can only write to PA, PB, PTRA or PTRB. It would be nice if it could write to a larger range of cog memory, but that would have gobbled up a lot of instruction space. Maybe in the P3 there could be a LOC instruction with a smaller hub address range, and a larger range of destinations.

p2gcc generates code that loads addresses into one of the 15 general purpose registers, such as "mov r3, ##variable". A C source file is compiled into an object file starting at address $400. It must then be relocated to another address by the linker. The linker doesn't have to worry about the calls and jumps since they use relative address, but it does have to adjust variable addresses. With LOC the variable addresses could be relative also.

I would have to move the address from the LOC destination register to a general purpose register, so the generated code would look like this:

    loc pb, #variable
    mov r3, pb

However, this would take the same number of instructions as I currently use with the with the long source value.

evanh · 2019-04-12 23:05

Dave Hein wrote: »

Chip, are you going to change PNut to match the changes in fastspin for the LOC instruction?

You can forge ahead on the two buggy cases as they were unusable for everyone.

The one case that could be improved is hubexec referencing hubram data below $400. Currently, this is getting encoded as absolute when it could be encoded as PC-relative.

cgracey · 2019-04-13 00:20

evanh wrote: »

Dave Hein wrote: »

Chip, are you going to change PNut to match the changes in fastspin for the LOC instruction?

You can forge ahead on the two buggy cases as they were unusable for everyone.

The one case that could be improved is hubexec referencing hubram data below $400. Currently, this is getting encoded as absolute when it could be encoded as PC-relative.

Yes, I agree.

evanh · 2019-04-13 01:31

Looking at Fastspin, even though it resolves fine, there is more PC-relative cases than expected. And I think this carries over to branching instructions too.

Amusingly, all cases could default to PC-relative and you wouldn't know they were unless examining the machine code.

 loc  pa, #label testing  (fastspin v3.9.25)
=========================
 cogexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
0000000f fe9ffff8 00000008   11111111 00000000 fa00b7f6
00000011 fe9000fe 00000110   fd60062a 22222222 00000000
00000013 fe800434 00000434   00000000 33333333 f607ee0a
00000015 fe90038a 000003a0   2e202e20 00000000 44444444

 hubexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
00000410 fe800008 00000008   11111111 00000000 fa00b7f6
00000418 fe800110 00000110   fd60062a 22222222 00000000
00000420 fe900010 00000434   00000000 33333333 f607ee0a
00000428 fe8003a0 000003a0   2e202e20 00000000 44444444

 lutexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
00000280 fe9ffd87 00000008   11111111 00000000 fa00b7f6
00000282 fe9ffe8d 00000110   fd60062a 22222222 00000000
00000284 fe800434 00000434   00000000 33333333 f607ee0a
00000286 fe900119 000003a0   2e202e20 00000000 44444444

EDIT: Updated with easier to read memory data.

Dave Hein · 2019-04-13 02:41

I added the fix to p2asm for LOC relative addresses in the cog mode.

evanh · 2019-04-13 02:50

https://github.com/davehein/p2gcc

Yep, that matches fastspin result.

 loc  pa, #label testing  (p2asm v0.015)
=========================
 cogexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
0000000f fe9ffff8 00000008   11111111 00000000 fa00b7f6
00000011 fe9000fe 00000110   fd60062a 22222222 00000000
00000013 fe800434 00000434   00000000 33333333 f607ee0a
00000015 fe90038a 000003a0   2e202e20 00000000 44444444

 hubexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
00000410 fe800008 00000008   11111111 00000000 fa00b7f6
00000418 fe800110 00000110   fd60062a 22222222 00000000
00000420 fe900010 00000434   00000000 33333333 f607ee0a
00000428 fe8003a0 000003a0   2e202e20 00000000 44444444

 lutexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
00000280 fe9ffd87 00000008   11111111 00000000 fa00b7f6
00000282 fe9ffe8d 00000110   fd60062a 22222222 00000000
00000284 fe800434 00000434   00000000 33333333 f607ee0a
00000286 fe900119 000003a0   2e202e20 00000000 44444444

evanh · 2019-04-13 03:07

Here's pnut:

 loc  pa, #label testing  (pnut v32i)
=========================
 cogexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
0000000f fe9fffe0 000ffff0   47ee18aa 00000000 2884fd76
00000011 fe9003f8 0000040a   fdb008fc 024afe80 f100b25b
00000013 fe800434 00000434   00000000 33333333 f607ee0a
00000015 fe900e28 00000e3e   00000000 1d1594ee c607ee31

 hubexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
00000410 fe800008 00000008   11111111 00000000 fa00b7f6
00000418 fe800110 00000110   fd60062a 22222222 00000000
00000420 fe900010 00000434   00000000 33333333 f607ee0a
00000428 fe8003a0 000003a0   2e202e20 00000000 44444444

 lutexec
   PC    Op-code  PA-data    Cog-dref Hub-dref LUT-dref
00000280 fe9ff61c 000ff89d   00000000 59545544 00000000
00000282 fe9ffa34 000ffcb7   00000000 223a81d7 00000000
00000284 fe800434 00000434   00000000 33333333 f607ee0a
00000286 fe900464 000006eb   00000000 3d20736e 00000000

jmg · 2019-04-13 03:39

evanh wrote: »

Yep, that matches fastspin result.
...
Here's pnut:

So that means p2asm and fastspin are now considered 'correct' and pnut is still to be fixed ?

evanh · 2019-04-13 04:07

Yes, in terms of LOC now works in all cases.

Although I think LOC and all branch long-immediates could all be encoded to PC-relative in all #label and #@label cases. Absolute encoding of those instructions doesn't seem to have a specific purpose. Ah, absolute does have a purpose when an actual piece of PIC wants a hard coded specific address. Won't be common I don't think.

ersmith · 2019-04-13 10:56

evanh wrote: »

Although I think LOC and all branch long-immediates could all be encoded to PC-relative in all #label and #@label cases.

LOC probably could, in theory, be encoded relative in all cases. But branches are handled differently from LOC and if I recall correctly doing a relative branch that crosses HUB/COG/LUT domains is forbidden by the hardware. (There's internal state about how PCs are handled in relative branches, and I wouldn't want to mess with that.)

Cluso99 · 2019-04-13 11:27

There are times when LOC should be absolute and not relative. Examples are using hub tables, and we will probably have some fixed areas for mailboxes.
I would rather always declare absolute or relative. The programmer should have the right to use what they require.

evanh · 2019-04-13 12:41

ersmith wrote: »

evanh wrote: »

Although I think LOC and all branch long-immediates could all be encoded to PC-relative in all #label and #@label cases.

LOC probably could, in theory, be encoded relative in all cases. But branches are handled differently from LOC and if I recall correctly doing a relative branch that crosses HUB/COG/LUT domains is forbidden by the hardware. (There's internal state about how PCs are handled in relative branches, and I wouldn't want to mess with that.)

Challenge accepted and met!

I replaced

		jmp	#loctesthub

at the end of my cogram code, cogram address $17, with

		long	$fd900fa0

This successfully jumps to hub address $400 ($1000 at byte scale) and runs the second part of the LOC test code placed there. Calculated as offset = $400 * 4 - ($17 + 1) * 4

It has some peculiar effect on the effective memory map. Still have to get my head around the implication ...

evanh · 2019-04-13 12:43

Or you could write it as offset = ($400 - $17 - 1) * 4

EDIT: New version

 		long	$fd900000 + (($400-$-1)*4)&$fffff

EDIT: And to relative jump to cogexec from hubexec

		long	$fd900000 + (9-$-4)&$fffff

Cog address 9 is start of my LOC testing code.

evanh · 2019-04-13 12:45

Cluso99 wrote: »

I would rather always declare absolute or relative. The programmer should have the right to use what they require.

#\ is always there for forcing absolute.

To ORGH or not to ORGH?

Comments