_COGMEM Automatic Variables

Kye · 2014-12-28 20:57

Is it possible to have lots of _COGMEM automatic variables? I tried defining a bunch of _COGMEM variables outside of a _NAKED main function and I see that code was generated to store to these variables in the asm output. However, it looks like the compiler just stores to the variables and then keeps using its own local or stack variable for computation. My goal was to have it use the _COGMEM variable instead of using the stack.

Can you extend the register space using _COGMEM variables or does the compiler only benefit from those variables when they have initialized data? The compiler is wasting a huge amount of time loading and storing data to the stack versus just having a copy of the variable stored in cogram.

ersmith · 2014-12-29 09:46

You can have as many COGMEM variables as will fit, but they aren't automatic variables (in either the technical C sense of variables declared local to a function, or in the sense of variables used automatically by the compiler). You have to mention them explicitly in expressions in order to make the compiler use them.

If any kind of optimization is enabled then the compiler will keep variables in its own registers (r0-r15) as much as possible, and only spill them out to the stack when it runs out of room. The stack only gets used if you forget to include -O (or -Os, or some other -O variant) on the command line, or if you have a really huge number of local variables. In the latter case it might be worth splitting your function up into smaller, simpler pieces.

Kye · 2014-12-29 14:02

With -Os on cogmem variables are not used at all. The compiler will store a value in them once on init and then keep the variable in a register. I have a lot of static variables in a function I have to deal with and I was trying to use cogmem to keep stuff from being put on the stack. Breaking the code into functions won't help.

Kye · 2014-12-29 16:48

Okay, I've figured out something that works about as optimal as you can get.

1. GCC refuses to uses cogmem variables equivalently as registers and will try to cache everything in its stack. This makes cogmem variables useless. The compiler will do one store to them for init purposes and that's it.
2. If GCC runs out of register space it pushes registers onto the stack. This will waste all your code space in cogmode if this happens due to all the register thrashing going on. Instead of storing a variable back in cogmem it will push it to the stack.
3. If you declare the cogmem variable as volatile then gcc can't cache it in its registers/overflow_stack and will have to move the value in and out of registers. While this method uses up two more moves than normal it saves the pushes and pops to and from the stack. I suppose this is the best you can get.

The https://code.google.com/p/propgcc/wiki/PropGccInDepth article says not to declare your cogmem variable as volatile. You must instead do the opposite for performance/space gain.

ersmith · 2014-12-29 19:39

Kye wrote: »

With -Os on cogmem variables are not used at all. The compiler will store a value in them once on init and then keep the variable in a register. I have a lot of static variables in a function I have to deal with and I was trying to use cogmem to keep stuff from being put on the stack. Breaking the code into functions won't help.

Could you post some sample code? That's not the behavior I'm seeing, but I'm using pretty simple functions. For example:

#include <propeller.h>

_COGMEM int scale;
_COGMEM int offset;

int myfunc(int x)
{
    return (x<<scale) + offset;
}

produces:

	.text
	.balign	4
	.global	_myfunc
_myfunc
	shl	r0, _scale
	add	r0, _offset
	mov	pc,lr
	.section .kernel,"aw",@progbits
_offset
	long	0
_scale
	long	0

which looks correct to me, and there's no stack or compiler registers involved (other than the parameter x).

ersmith · 2014-12-29 19:47

Ah, I think I see the problem -- gcc thinks that COGMEM variables are more expensive to access than registers, so while it's happy to use them for single operations, if they appear inside a loop then it wants to use registers instead. I'll see if I can fix that.

Kye · 2014-12-30 12:08

Still working on the code... Will post soon.

Kye · 2014-12-30 18:43

Attached is my code. I compile with "propeller-elf-gcc.exe -Os -ffunction-sections -fdata-sections -m32bit-doubles -mfcache -std=c99 -mcog -r -xc"

Right now, the code should be able to complete the inner loop in less than 500 clock cycles (I counted ~470). I needed this code to be that fast so that it doesn't miss quadrature encoder samples. If you make the cogmem variables non volatile you'll see the horror with all the stack pushes and pops.

ersmith · 2015-01-02 17:51

Thanks for posting the example, Kye. Using "volatile" with _COGMEM is a good work-around for now. Fixing gcc to treat _COGMEM as equal to a register looks like it will take a while. For simple things everything is fine, but once the code gets complicated or involves loops the optimizer tries to keep the values in registers, and that's turning out to be hard to change :-(.

_COGMEM Automatic Variables

Comments