Question for Chip re ROM code

Cluso99 · 2014-08-10 21:38

Chip,
Would it be possible for you to post the ROM data files for the Font, Sin, Log, etc. You have already posted the interpreter, booter and runner thanks.

How did you build them (instructions)?

I am trying to rebuild the whole ROM file without scrambling and then remove the scrambling from the FPGA source.
I presume I change this

wire [31:0] ramq    = !rd ? mem_q : {mem_q[03], mem_q[07], mem_q[21], mem_q[12],    // unscramble rom data if cog loading
                                      mem_q[06], mem_q[19], mem_q[04], mem_q[17],
                                     mem_q[20], mem_q[15], mem_q[08], mem_q[11],
                                     mem_q[00], mem_q[14], mem_q[30], mem_q[01],
                                     mem_q[23], mem_q[31], mem_q[16], mem_q[05],
                                     mem_q[09], mem_q[18], mem_q[25], mem_q[02],
                                     mem_q[28], mem_q[22], mem_q[13], mem_q[27],
                                     mem_q[29], mem_q[24], mem_q[26], mem_q[10]};

to this...

wire [31:0] ramq    = !rd ? mem_q[31:0];

Then I can start to change the booter and interpreter.
Thanks Chip

cgracey · 2014-08-10 21:56

There is no source data for the sin and log tables. They were created from FOR loops in Delphi, running through a range with some scale applied. I could post that code tomorrow.

The font, though, is in a simple graphics file. Some Delphi code assembles the binary data from the image. I can post those things, too.

That line would become:

wire [31:0] ramq = mem_q[31:0];

The ? ternary operator picks the first or second option, separated by a colon, based on whether the value preceding the ? was true or false. In this case you wouldn't be making any decision, but just doing an assignment.

Cluso99 · 2014-08-10 22:31

Thanks Chip.
It would be just as easy for me to dump the ROM from a P1 to get everything from $8000..$F003.

How did you build the interpreter, booter and runner so they began at $F004 ? Did you just fill longs up to $F004 in PropTool?

Since the first option for the wire.... was just
wire [31:0] ramq = !rd ? mem_q : {mem_q[03],....
would this also be correct...
wire [31:0] ramq = mem_q;

cgracey · 2014-08-11 12:26

Cluso99 wrote: »

Thanks Chip.
It would be just as easy for me to dump the ROM from a P1 to get everything from $8000..$F003.

How did you build the interpreter, booter and runner so they began at $F004 ? Did you just fill longs up to $F004 in PropTool?

Since the first option for the wire.... was just
wire [31:0] ramq = !rd ? mem_q : {mem_q[03],....
would this also be correct...
wire [31:0] ramq = mem_q;

You need to put a 'PUB somename' at the top of the file and compile it (Alt-L). There you will find the data of interest nestled into the framework of a Spin image. You can do something like 'LONG $77777777' before and after the area of interest so you'll be able to easily see where it is.

Here is the code from Delphi that installed the various sections of ROM into the final image:

procedure TEditorForm.InstallBackground;
var
  i: integer;
begin
  for i := 0 to $FFFF do MemoryBuffer[i] := 0;
end;

procedure TEditorForm.InstallFont(Address: integer);
var
  FontBitmap: TBitmap;
  xx, yy, x, y: integer;
  Buff: cardinal;
begin
  FontBitmap := TBitmap.Create;
  FontBitmap.LoadFromFile('FONT.BMP');
  for yy := 0 to 7 do
  begin
      for xx := 0 to 15 do
      begin
          for y := 0 to 31 do
          begin
              for x := 0 to 15 do Buff := (Buff shl 2 +
                  Ord(FontBitmap.Canvas.Pixels[(xx shl 1 + 0) * 17 + 1 + 15 - x, yy * 33 + 1 + y] = clBlack) +
                  Ord(FontBitmap.Canvas.Pixels[(xx shl 1 + 1) * 17 + 1 + 15 - x, yy * 33 + 1 + y] = clBlack) shl 1);
              PIntegerArray(@MemoryBuffer[Address])[yy shl 9 + xx shl 5 + y] := Buff;
          end;
      end;
  end;
  FontBitmap.Free;
end;

procedure TEditorForm.InstallLogTable(Address: integer);
var
  i, s: integer;
  f: extended;
begin
  for i := 0 to $7FF do
  begin
    f := Round(Log2(1 + i / $800) * $10000);
    s := Trunc(f);
    PWordArray(@MemoryBuffer[Address])[i] := s;
  end;
end;

procedure TEditorForm.InstallAntiLogTable(Address: integer);
var
  i, s: integer;
  f: extended;
begin
  for i := 0 to $7FF do
  begin
    f := Round(Power(2, i / $800) * $10000);
    s := Trunc(f);
    PWordArray(@MemoryBuffer[Address])[i] := s;
  end;
end;

procedure TEditorForm.InstallSinTable(Address: integer);
var
  i, s: integer;
  f: extended;
begin
  for i := 0 to $800 do
  begin
    f := Round(Sin((i / $800) * (pi / 2)) * $FFFF);
    s := Trunc(f);
    PWordArray(@MemoryBuffer[Address])[i] := s;
  end;
end;

function TEditorForm.InstallInterpreter(Address: integer): integer;
var
  f: file;
  Size, i: Integer;
begin
  AssignFile(f, 'INTERPRETER.obj');
  try
    Reset(f, 1);
    Size := FileSize(f);
    BlockRead(f, MemoryBuffer2[Address], Size);
    Result := Size;
  finally
    CloseFile(f);
  end;
end;

procedure TEditorForm.InstallObj(Filename: string; Address: integer);
var
  f: file;
  Size, i: Integer;
begin
  AssignFile(f, Filename);
  try
    Reset(f, 1);
    Size := FileSize(f);
    BlockRead(f, MemoryBuffer2[0], Size);
  finally
    CloseFile(f);
    for i := Address to Size-1 do MemoryBuffer[i] := MemoryBuffer2[i];
  end;
end;

procedure TEditorForm.InstallRunner;
var
  f: file;
  Size: Integer;
begin
  AssignFile(f, 'RUNNER.obj');
  try
    Reset(f, 1);
    Size := FileSize(f);
    BlockRead(f, MemoryBuffer[$10000 - Size], Size);
  finally
    CloseFile(f);
  end;
end;

And here is the font.bmp file:

font.bmp

Cluso99 · 2014-08-11 13:44

Thanks Chip.
Last night I wrote a spin program to dump the bytes from Rom in DAT format for compilation. I also have the unscrambling working in this as well. I just need to find out which parts need to be unscrambled. I can work out what sections need to be unscrambled - interpreter and possibly booter and runner.

Licensing idea...
Could the forum rules be modified to be MIT licensed by default unless clearly stated otherwise as GPL3. An exception shall be verilog codefor the P1 and P2 which shall be GPL3 unless clearly stated otherwise.

cgracey · 2014-08-11 14:18

Cluso99 wrote: »

Thanks Chip.
Last night I wrote a spin program to dump the bytes from Rom in DAT format for compilation. I also have the unscrambling working in this as well. I just need to find out which parts need to be unscrambled. I can work out what sections need to be unscrambled - interpreter and possibly booter and runner.

Licensing idea...
Could the forum rules be modified to be MIT licensed by default unless clearly stated otherwise as GPL3. An exception shall be verilog codefor the P1 and P2 which shall be GPL3 unless clearly stated otherwise.

We're working on some policy change. Ken said it will be done soon.

About what to unscramble: It's just the interpreter and booter code. The runner code is Spin byte codes. Only program code loaded into a cog from ROM gets unscrambled. So, you could unscramble those sections (interpreter and booter) and then reduce that big ? ternary expression to just a straight assignment.

Cluso99 · 2014-08-11 14:37

cgracey wrote: »

We're working on some policy change. Ken said it will be done soon.

About what to unscramble: It's just the interpreter and booter code. The runner code is Spin byte codes. Only program code loaded into a cog from ROM gets unscrambled. So, you could unscramble those sections (interpreter and booter) and then reduce that big ? ternary expression to just a straight assignment.

Thanks Chip. That is what I have been aiming towards.

Since you have been changing the pnut compiler for P2, would it be easy for you to release a P1 version with hub ram limitation lifted to 64KB or more? I am not asking for the orgh etc, just to remove the 32KB hub limitation. I presume Roy will do this for openspin but I havent started to use it yet.

cgracey · 2014-08-11 14:45

Cluso99 wrote: »

Thanks Chip. That is what I have been aiming towards.

Since you have been changing the pnut compiler for P2, would it be easy for you to release a P1 version with hub ram limitation lifted to 64KB or more? I am not asking for the orgh etc, just to remove the 32KB hub limitation. I presume Roy will do this for openspin but I havent started to use it yet.

There are some Spin address limitations which limit it to 32KB. I can't remember where these limitations are, at the moment, but I know they are hard and fast. As soon as you try to exceed them, you open a Pandora's box. The trick may be designing some enhancements which don't break the current tools. Even the booter expects a Spin image. It's a showstopper to start changing it all around. In hindsight, it would have been good to design things so that they were more easily expanded. At the time, though, I figured it would be a miracle enough if I got it all working, at all, so I exploited the limitations to make things small. When I started working on the Prop2 Spin, there were many really deep changes.

Bob Lawrence (VE1RLL) · 2014-08-11 15:02

re; Delphi,

Delphi is great. :cool: I'm having a lot of fun with the latest Delphi version (XE6 ) and making apps for Android and iOS. Hopefully, I can do a few Propeller chip projects with a Android interface later.

Re:Here is the code from Delphi that installed the various sections of ROM into the final image:

Thanks! Chip. It's very interesting. I'll try it with XE6 sometime.

Cluso99 · 2014-08-11 19:22

cgracey wrote: »

There are some Spin address limitations which limit it to 32KB. I can't remember where these limitations are, at the moment, but I know they are hard and fast. As soon as you try to exceed them, you open a Pandora's box. The trick may be designing some enhancements which don't break the current tools. Even the booter expects a Spin image. It's a showstopper to start changing it all around. In hindsight, it would have been good to design things so that they were more easily expanded. At the time, though, I figured it would be a miracle enough if I got it all working, at all, so I exploited the limitations to make things small. When I started working on the Prop2 Spin, there were many really deep changes.

Hindsight is a wonderful thing.
I was amazed at how you managed to squeeze all that interpreter code into 496 longs.
I don't recall any 32KB limitations but thanks for the heads up.

I don't think any user code calls the booter, but user code does call the interpreter. Therefore, maintaining this entry point is important.
Anyway, once it is soft and unscrambled, we can play and experiment with it.

For a P1.5 version I would expect that the encryption from P2 would be nice, perhaps even mandatory. But that can be decided later.
More important for you to get back to P2 asap.

Another question while we have time...

I have been thinking that for hubexec, the existing JMPRET instruction could be absolute provided it is currently executing from cog, and relative is it is currently executing from hub (ie PC > $1FF which would be PC[x:9] <> 0 and an easy test). Obviously the compiler would need to know but this could be discussed later. Would need to think about the return address, but the goto address would be fine.
Any thoughts?

cgracey · 2014-08-11 19:49

Cluso99 wrote: »

I have been thinking that for hubexec, the existing JMPRET instruction could be absolute provided it is currently executing from cog, and relative is it is currently executing from hub (ie PC > $1FF which would be PC[x:9] <> 0 and an easy test). Obviously the compiler would need to know but this could be discussed later. Would need to think about the return address, but the goto address would be fine.
Any thoughts?

I was thinking about the same thing and I recall that there was some complication to it. Maybe it had to with branching between cog and hub code.

Cluso99 · 2014-08-11 21:25

cgracey wrote: »

I was thinking about the same thing and I recall that there was some complication to it. Maybe it had to with branching between cog and hub code.

IMHO we would still require an absolute JMP/CALL/RET.

I think I am going to play with this first. I will make one of my cogs 4KB instead of 2KB and test that out using the existing JMPRET as relative for extended cog addresses.

Cluso99 · 2014-08-12 02:28

Chip & others,

Here are the ROM files created from the P8X32A. I would be pleased if someone could do a cursory check with the Delphi files as I know nothing about Delphi.
My main concern is to ensure I have the bytes in the correct order. FYI I have checked the starting and ending bytes for the Interpreter, booter and runner.I have incuded the spin program that I used to dump the various ROM sections.

P8X32A_ROM_DUMP_20140812.zip

There are some holes in the ROM that has data and I don't know what they are for, if anything. I have posted those (unscrambled) in the code window.

''--------------------------------------------------------------------------------------------------------------
''Propeller P8X32A ROM Unknown sections   $F002..$F003
''                                        $F7A4..$F7FF
''                                        $FBA4..$FF6F
''--------------------------------------------------------------------------------------------------------------
  byte            $CA, $02                                                             ' $F002    ..
''--------------------------------------------------------------------------------------------------------------
  byte                      $C1, $C9, $30, $08, $08, $00, $00, $00, $04, $2B, $50, $35 ' $F7A4      ..0......+P5
  byte  $45, $64, $40, $35, $09, $4D, $34, $03, $40, $CC, $0A, $05, $C5, $D9, $48, $28 ' $F7B0  Ed@5.M4.@.....H(
  byte  $2B, $02, $9C, $C2, $91, $CD, $24, $00, $09, $15, $30, $00, $2B, $4B, $B5, $C2 ' $F7C0  +.....$...0.+K..
  byte  $09, $4F, $1E, $02, $8D, $83, $78, $21, $C0, $09, $38, $06, $81, $D9, $34, $21 ' $F7D0  .O....x!..8...4!
  byte  $C1, $F0, $08, $1D, $E7, $92, $DD, $7C, $C1, $49, $30, $04, $09, $EB, $4A, $12 ' $F7E0  .......|.I0...J.
  byte  $41, $05, $10, $05, $00, $00, $10, $00, $81, $CB, $26, $20, $C7, $DF, $D8, $FD ' $F7F0  A.........& ....
''--------------------------------------------------------------------------------------------------------------
  byte                      $09, $C9, $30, $00, $E4, $D9, $59, $37, $E7, $DB, $DD, $79 ' $FB94      ..0...Y7...y
  byte  $C0, $0D, $3C, $00, $E6, $9B, $DD, $78, $41, $C9, $30, $0A, $40, $8B, $3C, $20 ' $FBA0  ..<....xA.0.@.<
  byte  $C5, $D9, $48, $28, $C1, $D9, $08, $00, $09, $CD, $34, $0D, $E7, $93, $DD, $79 ' $FBB0  ..H(......4....y
  byte  $00, $4E, $10, $00, $C1, $CD, $30, $02, $81, $CB, $66, $00, $C4, $BD, $08, $05 ' $FBC0  .N....0...f.....
  byte  $C1, $B5, $08, $2C, $09, $44, $34, $00, $E7, $93, $DD, $78, $E6, $DB, $DD, $79 ' $FBD0  ...,.D4....x...y
  byte  $80, $0C, $30, $11, $01, $49, $14, $06, $C7, $DF, $D8, $FD, $C5, $D9, $48, $28 ' $FBE0  ..0..I........H(
  byte  $C5, $74, $78, $11, $41, $4F, $30, $04, $09, $05, $34, $02, $01, $80, $00, $01 ' $FBF0  .tx.AO0...4.....
  byte  $23, $4D, $9C, $C2, $A1, $59, $11, $03, $C1, $D0, $08, $20, $41, $C9, $30, $0A ' $FC00  #M...Y..... A.0.
  byte  $CC, $E8, $24, $23, $19, $4D, $34, $01, $0C, $23, $54, $33, $09, $EF, $18, $0F ' $FC10  ..$#.M4..#T3....
  byte  $2B, $CB, $94, $C6, $09, $4D, $34, $03, $41, $C9, $38, $06, $08, $0F, $10, $04 ' $FC20  +....M4.A.8.....
  byte  $01, $80, $00, $01, $01, $49, $14, $06, $91, $CD, $24, $00, $91, $CB, $22, $23 ' $FC30  .....I....$..."#
  byte  $2B, $CB, $94, $C6, $81, $CD, $34, $2D, $C7, $DF, $D8, $FD, $00, $DC, $38, $00 ' $FC40  +.....4-......8.
  byte  $C0, $0B, $34, $01, $45, $6D, $50, $35, $C1, $F9, $08, $1C, $41, $54, $38, $05 ' $FC50  ..4.EmP5....AT8.
  byte  $2B, $CB, $94, $C6, $81, $CB, $26, $10, $A1, $D9, $95, $62, $2A, $09, $B0, $C0 ' $FC60  +.....&....b*...
  byte  $81, $C9, $26, $01, $C5, $91, $48, $28, $C5, $D9, $48, $28, $85, $7D, $70, $11 ' $FC70  ..&...H(..H(.}p.
  byte  $C5, $D9, $48, $28, $45, $6D, $50, $35, $19, $C0, $08, $00, $2B, $CB, $9C, $C3 ' $FC80  ..H(EmP5....+...
  byte  $27, $6F, $DC, $F5, $C1, $CB, $30, $23, $2B, $CF, $94, $C0, $45, $6D, $50, $35 ' $FC90  'o....0#+...EmP5
  byte  $00, $DC, $38, $00, $21, $4B, $B1, $44, $40, $8B, $3C, $20, $C1, $D9, $08, $00 ' $FCA0  ..8.!K.D@.< ....
  byte  $0D, $C9, $14, $02, $45, $25, $50, $31, $08, $0A, $30, $04, $81, $CB, $26, $20 ' $FCB0  ....E%P1..0...&
  byte  $91, $CB, $22, $23, $C1, $D9, $48, $3C, $09, $5B, $30, $00, $81, $CB, $34, $2B ' $FCC0  .."#..H<.[0...4+
  byte  $41, $C9, $30, $0A, $09, $49, $14, $06, $E3, $5D, $11, $C0, $45, $6D, $54, $35 ' $FCD0  A.0..I...]..EmT5
  byte  $09, $CD, $14, $00, $05, $6B, $50, $37, $09, $00, $0E, $02, $09, $CD, $38, $00 ' $FCE0  .....kP7......8.
  byte  $8D, $83, $78, $21, $41, $C9, $38, $06, $92, $99, $63, $E6, $C5, $D9, $48, $28 ' $FCF0  ..x!A.8...c...H(
  byte  $95, $4B, $09, $6F, $2B, $CB, $94, $C6, $0D, $C9, $14, $02, $88, $5C, $28, $00 ' $FD00  .K.o+........\(.
  byte  $2B, $CB, $9C, $C6, $C1, $D9, $48, $04, $41, $54, $38, $05, $89, $4D, $32, $04 ' $FD10  +.....H.AT8..M2.
  byte  $84, $7C, $78, $11, $2F, $6F, $F4, $F3, $9B, $F0, $9C, $D9, $C1, $D9, $08, $20 ' $FD20  .|x./o.........
  byte  $91, $CD, $24, $00, $09, $5D, $10, $04, $41, $DF, $34, $28, $C0, $0D, $3C, $00 ' $FD30  ..$..]..A.4(..<.
  byte  $27, $69, $DC, $F4, $2B, $CB, $94, $C6, $E6, $9B, $DD, $78, $C1, $D9, $08, $20 ' $FD40  'i..+......x...
  byte  $45, $6D, $54, $35, $C1, $05, $30, $00, $A1, $D9, $95, $62, $91, $CF, $20, $02 ' $FD50  EmT5..0....b.. .
  byte  $09, $CF, $1C, $02, $41, $CD, $3C, $04, $E7, $DB, $DD, $79, $80, $E4, $70, $3D ' $FD60  ....A.<....y..p=
  byte  $C1, $D9, $08, $20, $05, $69, $50, $37, $41, $54, $38, $05, $08, $8D, $30, $00 ' $FD70  ... .iP7AT8...0.
  byte  $45, $60, $78, $30, $81, $CD, $26, $00, $41, $54, $38, $05, $C4, $3D, $78, $11 ' $FD80  E`x0..&.AT8..=x.
  byte  $41, $C9, $38, $06, $81, $CF, $30, $06, $C1, $D9, $08, $20, $E7, $92, $DD, $7C ' $FD90  A.8...0.... ...|
  byte  $40, $4C, $14, $02, $81, $CF, $30, $06, $09, $4D, $30, $01, $41, $80, $38, $28 ' $FDA0  @L....0..M0.A.8(
  byte  $C0, $0D, $3C, $00, $27, $6D, $CC, $F5, $09, $40, $30, $02, $09, $A1, $30, $00 ' $FDB0  ..<.'m...@0...0.
  byte  $81, $CB, $26, $20, $27, $6F, $D8, $F7, $01, $4D, $38, $00, $19, $C0, $08, $02 ' $FDC0  ..& 'o...M8.....
  byte  $05, $69, $50, $37, $48, $0D, $30, $00, $C0, $0D, $3C, $00, $09, $49, $14, $06 ' $FDD0  .iP7H.0...<..I..
  byte  $E6, $DA, $DD, $79, $41, $81, $3C, $08, $C4, $FC, $08, $34, $81, $CB, $22, $22 ' $FDE0  ...yA.<....4..""
  byte  $C0, $98, $08, $31, $2B, $CB, $9C, $C6, $41, $90, $30, $28, $0D, $C9, $14, $02 ' $FDF0  ...1+...A.0(....
  byte  $C4, $FC, $08, $34, $C4, $BC, $48, $28, $45, $26, $54, $37, $C5, $F4, $08, $09 ' $FE00  ...4..H(E&T7....
  byte  $80, $05, $30, $11, $45, $60, $7C, $30, $C1, $49, $30, $04, $81, $C9, $34, $23 ' $FE10  ..0.E`|0.I0...4#
  byte  $A7, $7D, $FC, $57, $F5, $DD, $59, $33, $41, $CD, $3C, $04, $81, $CB, $26, $20 ' $FE20  .}.W..Y3A.<...&
  byte  $09, $4D, $30, $00, $24, $22, $C5, $BB, $00, $CC, $34, $00, $E7, $93, $DD, $79 ' $FE30  .M0.$"....4....y
  byte  $C5, $6F, $70, $36, $09, $4D, $30, $03, $01, $95, $3C, $02, $45, $6D, $50, $35 ' $FE40  .op6.M0...<.EmP5
  byte  $E0, $BD, $DD, $5E, $00, $DC, $38, $00, $09, $44, $34, $00, $81, $49, $22, $04 ' $FE50  ...^..8..D4..I".
  byte  $79, $9F, $AF, $FF, $FF, $FF, $FF, $FF, $05, $6B, $54, $35, $24, $26, $C5, $BF ' $FE60  y........kT5$&..
  byte  $45, $6D, $54, $35, $41, $49, $04, $02, $40, $D0, $30, $08, $C5, $D9, $48, $28 ' $FE70  EmT5AI..@.0...H(
  byte  $40, $4C, $14, $02, $2E, $62, $B4, $E8, $C0, $09, $38, $06, $C1, $D9, $08, $00 ' $FE80  @L...b....8.....
  byte  $09, $CD, $54, $00, $09, $C4, $3C, $00, $09, $5B, $30, $00, $C5, $74, $78, $11 ' $FE90  ..T...<..[0..tx.
  byte  $E6, $9B, $DD, $7C, $89, $15, $20, $00, $A1, $59, $11, $03, $A7, $7D, $FC, $57 ' $FEA0  ...|.. ..Y...}.W
  byte  $C1, $91, $08, $20, $C0, $D8, $08, $20, $09, $15, $30, $00, $81, $CB, $26, $20 ' $FEB0  ... ... ..0...&
  byte  $09, $4D, $30, $01, $C5, $D1, $48, $28, $00, $CC, $34, $00, $81, $CB, $26, $10 ' $FEC0  .M0...H(..4...&.
  byte  $41, $CD, $3C, $06, $C1, $CF, $30, $02, $2B, $CF, $94, $C0, $09, $CD, $30, $01 ' $FED0  A.<...0.+.....0.
  byte  $91, $C9, $24, $01, $2B, $CB, $9C, $C3, $2A, $8B, $94, $C3, $2B, $4F, $94, $C2 ' $FEE0  ..$.+...*...+O..
  byte  $08, $8D, $30, $00, $E6, $DA, $DD, $7C, $45, $25, $50, $31, $49, $49, $32, $00 ' $FEF0  ..0....|E%P1II2.
  byte  $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D ' $FF00  ================
  byte  $20, $43, $6F, $70, $79, $72, $69, $67, $68, $74, $20, $32, $30, $30, $35, $20 ' $FF10   Copyright 2005
  byte  $20, $50, $61, $72, $61, $6C, $6C, $61, $78, $2C, $20, $49, $6E, $63, $2E, $20 ' $FF20   Parallax, Inc.
  byte  $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D, $2D ' $FF30  ----------------
  byte  $77, $77, $77, $2E, $70, $61, $72, $61, $6C, $6C, $61, $78, $2E, $63, $6F, $6D ' $FF40  www.parallax.com
  byte  $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D, $3D ' $FF50  ================
  byte  $08, $8E, $18, $00, $91, $CB, $20, $03, $C5, $D9, $48, $28, $2B, $CB, $9C, $E2 ' $FF60  ...... ...H(+...
''--------------------------------------------------------------------------------------------------------------

Cluso99 · 2014-08-13 13:38

Chip,
Just to bump this in case you missed it.
In my last post, I asked about the other code in the Rom. Obviously there is a copyright message. Are the others just random or are they used for something?

pik33 · 2014-08-13 21:41

Bob Lawrence (VE1RLL) wrote: »

re; Delphi,

Delphi is great. :cool: I'm having a lot of fun with the latest Delphi version (XE6 ) and making apps for Android and iOS. Hopefully, I can do a few Propeller chip projects with a Android interface later.

Re:Here is the code from Delphi that installed the various sections of ROM into the final image:

Thanks! Chip. It's very interesting. I'll try it with XE6 sometime.

There is the Lazarus. It is Delphi compatible. It is free. BST was made with it

jmg · 2014-08-13 22:04

Cluso99 wrote: »

Last night I wrote a spin program to dump the bytes from Rom in DAT format for compilation. I also have the unscrambling working in this as well. I just need to find out which parts need to be unscrambled. I can work out what sections need to be unscrambled

I'm curious how much logic the silicon un-scramble was costing ?

Cluso99 · 2014-08-13 22:37

jmg wrote: »

I'm curious how much logic the silicon un-scramble was costing ?

In EP4CE22F17 & Quartus 14 it is using
59 LEs
43 Combination
2 Registers

cgracey · 2014-08-14 09:57

Cluso99 wrote: »

Chip,
Just to bump this in case you missed it.
In my last post, I asked about the other code in the Rom. Obviously there is a copyright message. Are the others just random or are they used for something?

Sorry this took me so long. I had to go back into the Delphi files to drum up this code. Here is an expanded set of routines which will answer your questions:

procedure TEditorForm.ComposeROM;
const
  Info: string = '================' +
                 ' Copyright 2005 ' +
                 ' Parallax, Inc. ' +
                 '----------------' +
                 'www.parallax.com' +
                 '================' ;
var
  i: integer;
  sum: byte;
begin
  // install rom items
  InstallBackground;
  InstallFont($8000);
  InstallLogTable($C000);
  InstallAntiLogTable($D000);
  InstallSinTable($E000);
  InstallInterpreter($F004);
  InstallBooter($F800);
  InstallRunner;
  // install info string
  for i := 0 to Length(Info)-1 do MemoryBuffer[$FF00+i] := Byte(Info[1+i]);
  // install rom checksum byte
  sum := 0;
  MemoryBuffer[$F002] := 0;
  for i := $8000 to $FFFF do sum := sum + MemoryBuffer[i];
  MemoryBuffer[$F002] := -sum;
  // save rom file
  SaveFile('ROM', @MemoryBuffer[$8000], $8000);
  // show bitstream (diagnostic)
  //ShowBitStream;
end;

procedure TEditorForm.InstallBackground;
var
  Size, i: integer;
  Background: array[0..$1FF] of cardinal;
begin
  // use scrambled interpreter longs as background
  Size := InstallInterpreter($8000) shr 2;
  for i := 0 to $1FF do Background[i] := PIntegerArray(@MemoryBuffer[$8000])[i];
  RandSeed := $6E3973BF;
  for i := 0 to $1FFF do PIntegerArray(@MemoryBuffer[$8000])[i] := Background[Random(Size)];
end;

procedure TEditorForm.InstallFont(Address: integer);
var
  FontBitmap: TBitmap;
  xx, yy, x, y: integer;
  Buff: cardinal;
begin
  FontBitmap := TBitmap.Create;
  FontBitmap.LoadFromFile('FONT.BMP');
  for yy := 0 to 7 do
  begin
      for xx := 0 to 15 do
      begin
          for y := 0 to 31 do
          begin
              for x := 0 to 15 do Buff := (Buff shl 2 +
                  Ord(FontBitmap.Canvas.Pixels[(xx shl 1 + 0) * 17 + 1 + 15 - x, yy * 33 + 1 + y] = clBlack) +
                  Ord(FontBitmap.Canvas.Pixels[(xx shl 1 + 1) * 17 + 1 + 15 - x, yy * 33 + 1 + y] = clBlack) shl 1);
              PIntegerArray(@MemoryBuffer[Address])[yy shl 9 + xx shl 5 + y] := Buff;
          end;
      end;
  end;
  FontBitmap.Free;
end;

procedure TEditorForm.InstallLogTable(Address: integer);
var
  i, s: integer;
  f: extended;
begin
  for i := 0 to $7FF do
  begin
    f := Round(Log2(1 + i / $800) * $10000);
    s := Trunc(f);
    PWordArray(@MemoryBuffer[Address])[i] := s;
  end;
end;

procedure TEditorForm.InstallAntiLogTable(Address: integer);
var
  i, s: integer;
  f: extended;
begin
  for i := 0 to $7FF do
  begin
    f := Round(Power(2, i / $800) * $10000);
    s := Trunc(f);
    PWordArray(@MemoryBuffer[Address])[i] := s;
  end;
end;

procedure TEditorForm.InstallSinTable(Address: integer);
var
  i, s: integer;
  f: extended;
begin
  for i := 0 to $800 do
  begin
    f := Round(Sin((i / $800) * (pi / 2)) * $FFFF);
    s := Trunc(f);
    PWordArray(@MemoryBuffer[Address])[i] := s;
  end;
end;

function TEditorForm.InstallInterpreter(Address: integer): integer;
var
  f: file;
  Size, i: Integer;
begin
  AssignFile(f, 'INTERPRETER.obj');
  try
    Reset(f, 1);
    Size := FileSize(f);
    BlockRead(f, MemoryBuffer[Address], Size);
    for i := 0 to Size shr 2 - 1 do
      Scramble(PIntegerArray(@MemoryBuffer[Address])[i]);
    Result := Size;
  finally
    CloseFile(f);
  end;
end;

procedure TEditorForm.InstallBooter(Address: integer);
var
  f: file;
  Size, i: Integer;
begin
  AssignFile(f, 'BOOTER.obj');
  try
    Reset(f, 1);
    Size := FileSize(f);
    BlockRead(f, MemoryBuffer[Address], Size);
    for i := 0 to Size shr 2 - 1 do
      Scramble(PIntegerArray(@MemoryBuffer[Address])[i]);
  finally
    CloseFile(f);
  end;
end;

procedure TEditorForm.InstallRunner;
var
  f: file;
  Size: Integer;
begin
  AssignFile(f, 'RUNNER.obj');
  try
    Reset(f, 1);
    Size := FileSize(f);
    BlockRead(f, MemoryBuffer[$10000 - Size], Size);
  finally
    CloseFile(f);
  end;
end;

procedure TEditorForm.Scramble(var x: integer);
begin
  x :=  x shr 14 and 1 shl 31 +
        x shr 17 and 1 shl 30 +
        x shr 03 and 1 shl 29 +
        x shr 07 and 1 shl 28 +
        x shr 04 and 1 shl 27 +
        x shr 01 and 1 shl 26 +
        x shr 09 and 1 shl 25 +
        x shr 02 and 1 shl 24 +
        x shr 15 and 1 shl 23 +
        x shr 06 and 1 shl 22 +
        x shr 29 and 1 shl 21 +
        x shr 23 and 1 shl 20 +
        x shr 26 and 1 shl 19 +
        x shr 10 and 1 shl 18 +
        x shr 24 and 1 shl 17 +
        x shr 13 and 1 shl 16 +
        x shr 22 and 1 shl 15 +
        x shr 18 and 1 shl 14 +
        x shr 05 and 1 shl 13 +
        x shr 28 and 1 shl 12 +
        x shr 20 and 1 shl 11 +
        x shr 00 and 1 shl 10 +
        x shr 11 and 1 shl 09 +
        x shr 21 and 1 shl 08 +
        x shr 30 and 1 shl 07 +
        x shr 27 and 1 shl 06 +
        x shr 12 and 1 shl 05 +
        x shr 25 and 1 shl 04 +
        x shr 31 and 1 shl 03 +
        x shr 08 and 1 shl 02 +
        x shr 16 and 1 shl 01 +
        x shr 19 and 1 shl 00 ;
end;

Cluso99 · 2014-08-14 14:50

Many thanks Chip

Everything makes sense now. I can create a complete clean unscrambled ROM file(s) in spin from a P1.

rogloh · 2014-08-14 18:19

I've been looking at the ROM as well recently and have a question. When does the runner code actually get run by the "run" bytecode in SPIN? I see in the interpreter source code (amazing piece of work by the way) that the run bytecode (opcode 0x15) calls it but none of the SPIN code I experimented with and decoded seems to issue this bytecode. I thought it might be used for launching a spin COG to setup some stack frame stuff but even that code calls coginit instead, (both cognew and coginit operations - pasm and spin).

The reason I ask is that on a DE-0 nano (with its effectively usable 66kB of block RAM) I'd expect for some applications it might be nice to be able to allocate 48kB for hub SRAM, 16kB for COGRAM and the final 2kB for the SPIN interpreter launched at bootup. The problem with this is if "run" is ever executed it seems there needs to be this runner bytecode located in memory somewhere as well. That pushes us just past the 2kB limit if we want it to fit everything in that 2kB block. I'm thinking the original loader code may not be needed if we can boot directly from FPGA RAM already pre-initialized at startup, so we are getting very close to the 2kB target range except for this runner.src thing. When is it used?

Dave Hein · 2014-08-14 20:15

The runner code is used to set up the stack and the values of PBASE, VBASE, DBASE, PCURR and DCURR when starting a Spin cog. The following code

PUB main
  cognew(test1, 0)

PUB test1

generates the following bytecodes.

Addr : 001C:          37 00  : Constant Mask Y=0 00000002
Addr : 001E:             35  : Constant 1 $00000000
Addr : 001F:             15  : Run           
Addr : 0020:             2C  : CogInit(Id, Addr, Ptr)
Addr : 0021:             32  : Return

Cluso99 · 2014-08-14 20:27

rogloh wrote: »

I've been looking at the ROM as well recently and have a question. When does the runner code actually get run by the "run" bytecode in SPIN? I see in the interpreter source code (amazing piece of work by the way) that the run bytecode (opcode 0x15) calls it but none of the SPIN code I experimented with and decoded seems to issue this bytecode. I thought it might be used for launching a spin COG to setup some stack frame stuff but even that code calls coginit instead, (both cognew and coginit operations - pasm and spin).

The reason I ask is that on a DE-0 nano (with its effectively usable 66kB of block RAM) I'd expect for some applications it might be nice to be able to allocate 48kB for hub SRAM, 16kB for COGRAM and the final 2kB for the SPIN interpreter launched at bootup. The problem with this is if "run" is ever executed it seems there needs to be this runner bytecode located in memory somewhere as well. That pushes us just past the 2kB limit if we want it to fit everything in that 2kB block. I'm thinking the original loader code may not be needed if we can boot directly from FPGA RAM already pre-initialized at startup, so we are getting very close to the 2kB target range except for this runner.src thing. When is it used?

There are 3 pieces to the puzzle...
Interpreter $F004-F7A3
Booter $F800-FB93
Runner $FF70-FFFF
There is also a copyright message and web address which can be shrunk but should remain.

The booter is a pasm program and runs first. That is what determines whether to download new code into hub ram (and possibly program to eeprom), or load from eeprom if valid, else stop.

The runner is (I think) called by the interpreter when you start a new spin program.

So currently at least, you must have 4KB devoted to ROM.

BTW I have tried to increase the hub RAM to 50KB and it fails

rogloh · 2014-08-14 21:38

Dave Hein wrote: »
The runner code is used to set up the stack and the values of PBASE, VBASE, DBASE, PCURR and DCURR when starting a Spin cog. The following code
PUB main
  cognew(test1, 0)

PUB test1
generates the following bytecodes.
Addr : 001C:          37 00  : Constant Mask Y=0 00000002
Addr : 001E:             35  : Constant 1 $00000000
Addr : 001F:             15  : Run           
Addr : 0020:             2C  : CogInit(Id, Addr, Ptr)
Addr : 0021:             32  : Return 

Thanks!

I just tried it and got your result. Previously I was incorrectly using a function in another object and getting a different result from this. The first two lines below in "start" was how I was testing it and it wasn't doing the right thing for me, while the third line worked. I don't tend to run SPIN objects, typically just the first one in COG 0 and I usually spawn PASM COGs instead, so I am not so familiar with how it works.

OBJ
  main    : "Main"

PUB start
 cognew(main.start,0)
 cognew(@entry,0)
 cognew(local,0)

PUB local

DAT
entry long 0



I get this result


23                       cognew(main.start,0)
Addr : 0110:             34  : Constant 0 $FFFFFFFF
Addr : 0111:             00  : Drop Anchor Push 
Addr : 0112:       06 03 01  : Call Obj.Sub 3 1
Addr : 0115:             35  : Constant 1 $00000000
Addr : 0116:             2C  : CogInit(Id, Addr, Ptr)
24                       cognew(@entry,0)
Addr : 0117:             34  : Constant 0 $FFFFFFFF
Addr : 0118:       C7 80 FC  : Memory Op Long PBASE + ADDRESS Address = 00FC
Addr : 011B:             35  : Constant 1 $00000000
Addr : 011C:             2C  : CogInit(Id, Addr, Ptr)
Addr : 011D:          37 00  : Constant Mask Y=0 00000002 2
25                       cognew(local,0)
Addr : 011F:             35  : Constant 1 $00000000
Addr : 0120:             15  : Run           
Addr : 0121:             2C  : CogInit(Id, Addr, Ptr)
Addr : 0122:             32  : Return

rogloh · 2014-08-14 21:41

I am wondering if the booter code (if some of it is stlil required) may actually be automatically loaded into the initial 2kB COG 0 RAM by the FPGA initial configuration file (like upper memory can be). If this is done automatically upon FPGA reset this might free up ~2kB of RAM. It only needs to run once upon Prop reset right? I am assuming both resets are coupled of course, so the FPGA reload and prop reset happen together. I suspect this would often be the case in an FPGA design.

Cluso99 · 2014-08-14 22:51

rogloh wrote: »

I am wondering if the booter code (if some of it is stlil required) may actually be automatically loaded into the initial 2kB COG 0 RAM by the FPGA initial configuration file (like upper memory can be). If this is done automatically upon FPGA reset this might free up ~2kB of RAM. It only needs to run once upon Prop reset right? I am assuming both resets are coupled of course, so the FPGA reload and prop reset happen together. I suspect this would often be the case in an FPGA design.

Though you could do this, it isnt going to free up 2KB of hub. The interpreter is 496 longs and the runner is 24 longs = 520 longs! And now you have moved the interpreter which is really a no-no because a number of programs call it directly

As for reset, I cannot see why it wouldn't always call the booter.

Why do you feel the need for more than 44KB of hub plus 4KB ROM in the emulation?

rogloh · 2014-08-14 23:43

I was just trying to figure out how to free up maximum hub memory and freeing a total of 48kB hub RAM on DE-0 nano seemed like a nice round number for buffer sizes fitting into a power of two (that being the lower 32kB hub already supported by existing tools PLUS an additional 16kB extra space above that to pretty much do with as you please, potentially available as general purpose hub RAM instead of having to be read only data).

It seems the spare 2kB could only work if there is no runner bytecode called ie. no spin spawning other spin cogs, and that's too limiting. A customized interpreter with runner code held elsewhere in low memory may possibly allow it, but it won't be compatible with standard setups. Maybe some LEs could be used to hold the 24 longs worth of bytecode stored near $FFFF, but that gets fairly wasteful of LEs.

PS. I wasn't planning on moving the interpreter btw, it would want to stay at $F004 where it normally lives.

If it is not simple/possible to get the spare 2kB to fit with the intepreter it's probably not worth pursuing the idea. I have another idea for that 2kB now anyway, as a palette RAM...

Tubular · 2014-08-15 04:50

Hmmm that's an interesting idea Rogloh. Use the palette ram to boot and then later as a lookup table. Neat.

Cluso99 · 2014-08-15 05:22

rogloh,
I don't think you understand me.I found that you can only configure a total of 64KB, not 66KB. I don't know why, but it fails.
You are going to require 4KB of hub from $F000-$FFFF from what I can see.
While you may get away with putting the booter into cog, the interpreter and runner combined are > 2KB by a small amount.

What I have currently is all 48KB of hub RAM, with the 32-48KB being remapped as 48-64KB as well. ie it is double mapped.
This way I can preload the whole 16KB of ROM (which also appears in 32-48KB) and then overwrite what I dont want as RAM.

This is great for testing, but ultimately I don't think this can work in real silicon.

Cluso99 · 2014-08-15 07:37

Tonight I finally created the rom file in a format that $readmemh was happy with. It did not like the @xxxx adress at the start of each line.
I havve spaces and comments beginning with // and they seem fine.
The file I am generating (well it just outputs from spin on a P1 to the terminal, and i cut and paste into proptool and save as a *.spin file.
The verilog is loading the file into the rom section without complaining. Hoping i can test its working correctly over the w/e.
Didnt get time to post the code and file before i retired for the evening, so i will do that in the morning.

BTW i have found that you can check the syntax by just compiling/checking a single file. Saves heaps of time. So i am gradually finding my way around both quartus and verilog

rogloh · 2014-08-15 07:50

Tubular wrote: »

Hmmm that's an interesting idea Rogloh. Use the palette ram to boot and then later as a lookup table. Neat.

Well actually that would be ideal, but to date Cluso has found issues with being able to use that last 2kB of the 66kB total I was originally talking about. I don't know why it wouldn't be usable memory but he wasn't able to make it work as extra hub RAM it seems (at least yet). Maybe something else in the P1 design stops us from using it, from what I can tell in the data sheets it's supposed to be there inside the Cyclone IV part.

The palette RAM reuse idea could still work in the loader part of the top 4kB of hub memory. It needs to be a dual ported RAM though as my future SDRAM/gfx controller will need to read from it at high speed independently from the prop core. It only needs to be 256 x 18 bits/24 bits wide, which is only taking 1kB so it wouldn't necessarily clobber the runner.src byte code.

jmg · 2014-08-15 14:40

rogloh wrote: »

I am wondering if the booter code (if some of it is stlil required) may actually be automatically loaded into the initial 2kB COG 0 RAM by the FPGA initial configuration file (like upper memory can be)..

This is a good idea, and you could actually FPGA-Boot-preload all 1~8 COGS and any of the HUB using $readmemh.
That could be useful for final FPGA field deployments, as it saves a boot step, but is less suited to any eventual ASIC.

Better may be a dual approach,
a) * Support pre-load of all memory areas, using Verilog $readmemh options.
b) * Develop a minimal P2-like tiny boot ROM, that flips in on Prop-RESET, and then removes itself.
As in P2, the sole purpose of this, is to locally load-run more code. This code could have i2c and SPI choices.

a) is suited to FPGA and especially FLASH based FPGA/CPLD, and b) is closer to real silicon in P1 and P2.

Question for Chip re ROM code

Comments