Prop2 Family

cgracey · 2016-05-27 08:18

In our discussions with pjv, we realized that the Prop2 can be stripped down, pretty much exactly like we've been doing on various FPGA builds, and very significant savings can be realized in fabrication. And by using plain I/O's in place of some smart pins, the edge of the die can be shrunk greatly, while maintaining the same number of I/O's.

Smart pins take 330um, while plain I/O's can take only 100um. By making variants where groups of 4 pins alternate between smart and plain, the pad frame can be reduced dramatically. Then, the guts can be shrunk, as well, by reducing the cogs and the hub RAM.

To synthesize these variants, it's only a matter of parameters in my top-level Verilog module. Then, Treehouse must take our several existing pads, along with a new plain I/O, and assemble pad frames which accept the synthesized guts. Once we get everything scripted, with the constraints set for each type of pad, it becomes very easy (and inexpensive) to make variants.

OnSemi's cost for a 180nm mask set is about $60k, which is way less than I was thinking. I was adding in the initial production run cost, to make it more than that, in my head. Anyway, with simple-to-make layout variations and shuttle runs, we can make all these for a fraction of what we had assumed separate chips would cost.

We're on track for making the big one, first, but as soon as we prove that it works, we can start generating these other designs with little effort and moderate money.

Here's a set of pinouts for what seems like a well-divided family of parts. The smallest part will cost only ~20% of what the big one will. This way, you can prototype with the big guy and then target one that is sufficient. Note that each TQFP package size has two pin-compatible variants, allowing you to use a bigger or smaller part on the same PCB design, as long as your smart pins are in the same place where the lower version has its smart pins (as opposed to plain I/O's).

On second thought, that 28-pin version can have 4 cogs. Two is too few.

cgracey · 2016-05-27 08:20

These variants all use the 16-slot egg-beater, so there's really no need to constrain cogs to a power of two. We could have three cogs, if we wanted. Keeping the eggbeater as is keeps things simple and objects' timing consistent.

Roy Eltham · 2016-05-27 08:54

I really like this family of Prop2 stuff.

Will these all run at the same clock rate? Or will the reduced stuff make it possible to run smaller ones faster?

Would a variant of the large size that is less cogs but more memory be possible? Say 8 cog, 1MB hub, 64 smart (or 32 smart 32 plain), or whatever combination that could fit 1MB hub (or more?).
More hub ram is always useful, even with less cogs.

Heater. · 2016-05-27 09:00

Sounds great. I too wondered about the more RAM option.

cgracey · 2016-05-27 09:04

Roy Eltham wrote: »

I really like this family of Prop2 stuff.

Will these all run at the same clock rate? Or will the reduced stuff make it possible to run smaller ones faster?

Would a variant of the large size that is less cogs but more memory be possible? Say 8 cog, 1MB hub, 64 smart (or 32 smart 32 plain), or whatever combination that could fit 1MB hub (or more?).
More hub ram is always useful, even with less cogs.

On the big pad frame, we could do more RAM and less cogs, but there's not enough room for 1MB.

They should all run at the same speed. The critical path is pretty much in the CORDIC, already.

cgracey · 2016-05-27 09:08

Here's something neat: Do you see all those VDD pins? They are for the 1.8V core. If/when we get to denser processes, those pins become lower voltage, while the I/O supply VIO stays at 3.3V, or maybe goes down to 2.5V. This pin out is durable.

Roy Eltham · 2016-05-27 09:13

Well, 640k or 768k would be useful also.

ErNa · 2016-05-27 09:15

Would allow to run msdos limits ;-)

evanh · 2016-05-27 09:22

I can see the 28 pin being demanded in both DIP and SOIC packages. I suspect the 44 pin variants won't sell unless they are notably cheaper than the 100 pin variants.

Hehe, the myriad of combinations is beginning ...

evanh · 2016-05-27 09:24

Oops, that 28 pin package still has an underside GND! Won't be possible for DIP.

Brian Fairchild · 2016-05-27 09:28

52-pin TQFP ?!?!?!

Please, everyone else uses either 48-pin or 64-pin and those are the sizes for which cheap adapters are available.

I guess the 28-pin is DIP? Why not pair that with a 0.8mm 32-pin QFP

Brian Fairchild · 2016-05-27 09:28

In fact, for family completeness, why not a DIP 40 paired with a 0.8mm QFP 44-pin?

evanh · 2016-05-27 09:39

Ha, yes, 52 pins, that's all about that underside GND pad again.

jmg · 2016-05-27 09:40

Brian Fairchild wrote: »

52-pin TQFP ?!?!?!

Please, everyone else uses either 48-pin or 64-pin and those are the sizes for which cheap adapters are available.

I guess the 28-pin is DIP? Why not pair that with a 0.8mm 32-pin QFP

I think TQFP48 is tight, to fit the targeted 32 io and supplies.
TQFP64 is more common than TQFP52, which may make it cheaper.
There is also a 0,4mm TQFP64 ( as well as 0.5 and 0.8mm ).

28 pins is unusual these days, a better choice would be TQFP32, as mentioned.
That is 0,8mm, which is wave solderable, and much thinner than SO28L

I noticed NXP seem to have pruned all DIP packages from their LOGIC families.

DIP moduiles can make sense, but DIP ICs are dinosaurs.

jmg · 2016-05-27 09:43

cgracey wrote: »

Here's something neat: Do you see all those VDD pins? They are for the 1.8V core. If/when we get to denser processes, those pins become lower voltage, while the I/O supply VIO stays at 3.3V, or maybe goes down to 2.5V. This pin out is durable.

Did you talk to OnSemi about Wide-Vcc, like 1.8~5.5V, for the smaller parts ?

Makes sense to use the fastest process possible for the largest variant, but as these scale down, power and voltage become more important.
Of course, at the same time total power envelopes drop, making the Core Regulator more practical on smaller parts.

evanh · 2016-05-27 09:49

JMG,
I can see the old logic families not selling DIPs now, they aren't exciting for hobby/educational material, but there is plenty of peops still buying DIP micro's. I'm sure Parallax can verify this.

All that said, the underside GND is a nice feature for the Prop2 because of its analogue needs.

Rayman · 2016-05-27 10:56

DIP is a dinosaur, but is it still good for education?

Does DIP P1 still sell?

I suppose it's the need for twin supplies that makes it not as easy as P1...

Can the baby version have an integrated LDO regulator for the core voltage?

Actually, I guess you can just make PCB with DIP footprint...

Cluso99 · 2016-05-27 10:57

Chip,
Would it be possible for all 4/5 variants to be mixed on the one wafer ???
Might therefore a mask set be ~$60K for the wafer ?

If so, it might be actually cheaper to have all the variants produced at one time. Of course, the numbers of each on a wafer would be fixed in whatever ratio you chose, but sometimes this is a cheaper overall cost due to the sharing of the mask costs.

Cluso99 · 2016-05-27 11:02

I would be interested to know what would fit with 1MB hub ram ???

eg 12 Cogs + 32 smart pins + 32 I/O + 1MB hub ram ???

evanh · 2016-05-27 11:56

Cluso99 wrote: »

I would be interested to know what would fit with 1MB hub ram ???

eg 12 Cogs + 32 smart pins + 32 I/O + 1MB hub ram ???

Chip said no to even 8 Cogs, 32+32 I/O, 1MB. I'd guess it would require much more drastic changes like removal of LUTs and more.

Ale · 2016-05-27 13:45

A 28 DIP variant would be great !, specially if it has 4 cogs and 64 kBytes SRAM

. If not it will be compared to the ATMega328

. Sadly it needs 29 pins... What about a 32 narrow PDIP instead ?
I like it

, really.... did you know (of course you do!), there is PLCC52

Ramon · 2016-05-27 14:55

Great news! I will really love to see that 28 pin P2 made. DIP, SOIC, TSSOP, whatever ...

However, you will need to check if the microchip patent is still valid or not. Check this:

http://microcontroller.com/news/luminary_micro_lawsuit.asp
http://www.finetpat.com.tw/default.asp?ID=186
http://www.google.com/patents/US5847450

If not sure, you can choose a package with at least 32 pins.

kbash · 2016-05-27 15:22

This is a wonderful idea Chip! This is exactly what I was hoping might come to be!

The possibility of designing with a full feature chip then easily pruning down for production cost savings gives a nice "Buffer" zone for guys like me who are never quite sure how much "Horsepower" we might need. Introducing a new product with smaller (cheaper) chips but having a solid, easy upward migration path gives another level of comfort too.

I think that keeping this "family" of P2's in mind will make it much easier to attract new designers to the Parallax bandwagon.

Onward and upward!

Heater. · 2016-05-27 15:23

Do frikken what? !!

That patent is about:

“Microcontroller Having An N-bit Data Bus Width with Less than N I/O Pins and A Method Therefore”

WTF?

kwinn · 2016-05-27 15:37

Heater. wrote: »

Do frikken what? !!

That patent is about:

“Microcontroller Having An N-bit Data Bus Width with Less than N I/O Pins and A Method Therefore”

WTF?

All I can say is the patent office needs to set a minimum IQ requirement and some level of knowledge of the area the patent applies to for whoever is making the decisions on what is patentable.

rabaggett · 2016-05-27 16:03

yes, X100 YES!

Heater. · 2016-05-27 17:23

kwinn,

Yep. In the old days patent offices used hire people like Einstein as patent clerks.

The whole patent idea should be scrapped. It has never worked.

Back in the day when it was time to build steam engines that could create rotary motion to dive machines in factories James Watt was not allowed to do it the obvious way, use a crank shaft. Why because some jerk had a patent on using a crank handle in a steam engine. I'm mean really, the crank handle had been around for millenia. So james had to develop all kind of complex and inefficient mechanisms to do it, like sun and planet gears.

A hundred years ago someone wrote a book about the history of steam power, and noted how the patent system had held up the development of steam power by two decades.

Or they should just crowd source the whole patent application checking business. Think "patentoverflow.com". All us people out here could soon tell the them if their was prior art or if the "invention" was not actually trivially obvious to anyone versed in the arts.

Cluso99 · 2016-05-27 20:09

I presume we will be able to overclock the P2 provided we don't use the cordic etc that are in the critical paths ???

jmg · 2016-05-27 20:11

Rayman wrote: »

Can the baby version have an integrated LDO regulator for the core voltage?

Actually, I guess you can just make PCB with DIP footprint...

Yes, note Parallax do not make the Stamp, using DIP parts.

evanh wrote: »

... but there is plenty of peops still buying DIP micro's.

I'm sure DIPs still sell, but for existing designs, much less for NEW design starts ?

evanh wrote: »

All that said, the underside GND is a nice feature for the Prop2 because of its analogue needs.

I think that requirement kills DIP anyway, so the DIP discussion is moot.

The other small package choice is QFN32, but that cannot wave solder.
It comes down to PADDLE size, and actual package is an easy late-choice.

Provided the die fits, if someone wants enough QFN32, Parallax can run that package.

Cluso99 · 2016-05-27 21:18

Chip,

re 28 pin version
Not sure whether there are any specific reasons, but might it be more prudent to shift RESn to pin 14, and shift XI/XO down to pins 16/15 ???
This would mean that the pins 1..13 and 16..28 would match the QFP51 LHS & RHS pinouts.

In the smaller 8 cog versions, would the egg-beater be reduced to 8 wide, meaning normal hub would now be 1:8 rather than 1:16 clocks ie 2x the hub speed of the 16 cog version?
Similarly with the 4 cog version?
Would the 4 cog version actually derive much benefit from the egg-beater silicon? (hub exec would be ~50% speed without egg-beater)

cgracey wrote: »

In our discussions with pjv, we realized that the Prop2 can be stripped down, pretty much exactly like we've been doing on various FPGA builds, and very significant savings can be realized in fabrication. And by using plain I/O's in place of some smart pins, the edge of the die can be shrunk greatly, while maintaining the same number of I/O's.

Smart pins take 330um, while plain I/O's can take only 100um. By making variants where groups of 4 pins alternate between smart and plain, the pad frame can be reduced dramatically. Then, the guts can be shrunk, as well, by reducing the cogs and the hub RAM.

To synthesize these variants, it's only a matter of parameters in my top-level Verilog module. Then, Treehouse must take our several existing pads, along with a new plain I/O, and assemble pad frames which accept the synthesized guts. Once we get everything scripted, with the constraints set for each type of pad, it becomes very easy (and inexpensive) to make variants.

OnSemi's cost for a 180nm mask set is about $60k, which is way less than I was thinking. I was adding in the initial production run cost, to make it more than that, in my head. Anyway, with simple-to-make layout variations and shuttle runs, we can make all these for a fraction of what we had assumed separate chips would cost.

We're on track for making the big one, first, but as soon as we prove that it works, we can start generating these other designs with little effort and moderate money.

Here's a set of pinouts for what seems like a well-divided family of parts. The smallest part will cost only ~20% of what the big one will. This way, you can prototype with the big guy and then target one that is sufficient. Note that each TQFP package size has two pin-compatible variants, allowing you to use a bigger or smaller part on the same PCB design, as long as your smart pins are in the same place where the lower version has its smart pins (as opposed to plain I/O's).

On second thought, that 28-pin version can have 4 cogs. Two is too few.

David Betz · 2016-05-27 21:25

Interesting. The versions with fewer COGs would actually have lower latency in accessing the hub. That will mean that code written for the 16 COG version might not work correctly on a version with fewer COGs. Is that correct?

Prop2 Family

Comments