Shop OBEX P1 Docs P2 Docs Learn Events
Prop II info from the Webinar... — Parallax Forums

Prop II info from the Webinar...

Cluso99Cluso99 Posts: 18,069
edited 2011-11-13 15:51 in Propeller 1
The webinar finished 30 mins ago, with Chip's presentation the last. I understand the webinar will be posted but it is after 10pm at Parallax.

Here is what I recall...

It has been quite a learning curve with the layout. There are about 40,000 flip-flops in the Prop II. There were about 160*8=1280 in Prop I.

After the original chip layout simulation the critical path was showing only about 80MHz clocking. Chip has now refined the critical paths and it now looks like 180MHz.

There will be ~126KB Hub SRAM and ~2KB Hub ROM.
128 pins in QFP 14mm?
92 General purpose I/O
8 Cogs
64bit CNT register
Video will clock at ~230MHz
Maybe ~$12-$15 at launch
180nm

Expect the instruction set info to be posted on the forum shortly.

If the prop were done in 45nm then perhaps it would clock at 1GHz, have 2MB hub ram and 16 or 24 cogs. But this would require ~$1M+ for prototype layout costs vs ~$160K for 180nm. Anyone with a spare $1M ??? Could be some super ROI (return on investement) if you do! This could give the ARM a run for its money hey.

Perhaps we should start an account for Parallax to buy tickets in the lotto ;)
«1345

Comments

  • william chanwilliam chan Posts: 1,326
    edited 2011-10-27 22:47
    Video will clock at ~230MHz

    The Video runs on a separate clock?
  • Cluso99Cluso99 Posts: 18,069
    edited 2011-10-27 22:53
    Video runs on a separate clock now, via PLL. Guess it will be the same concept but allowing a higher output than now.

    Just remembered...

    Each counter is able to do 2 PWM outputs. There is a lot of additional functions in the counters (understatement).

    16x16 multiply in 1 clock; 32x32 multiply in 16 clocks; 32/16 divide in 16 clocks; Cordic ? clocks.
  • User NameUser Name Posts: 1,451
    edited 2011-10-28 00:29
    Thanks for posting this, Cluso. I had every intention of "tuning in" the webinar but got lost in other activities.

    Prop II is looking pretty exciting!

    180 nm works for me. I also like the projected introductory price. Kudos to Chip and Co.
  • BaggersBaggers Posts: 3,019
    edited 2011-10-28 01:56
    Thanks for the post Cluso :D, great news, and a great price for what is/will be a great chip!
    Now where was that $1m+ I had laying around?... if only! cos it would be worth it!
  • jmgjmg Posts: 15,184
    edited 2011-10-28 02:46
    Cluso99 wrote: »
    Each counter is able to do 2 PWM outputs. There is a lot of additional functions in the counters (understatement).

    and also Two quadrature counting channels.
    Less clear on Capture, and overflow handling features ?
    I hope more complete info on the counters goes up soon, so we can check they have not missed anything.

    Being able to run the core slower than the Counters, would be nice for power saving.
    With simple gate and steering done the right way, you can create a Auto-ranging reciprocal Frequency counter with two 32 bit counters,
    with a common capture enable ( SW SFR flag AND with Edge ), and a CaptDone flag.

    Such counters have very wide dynamic range (Hz to 10's MHz), with a fixed digits/second precision, so would make an ideal Obex..

    32 bits is smallish these days, so a system like on the System counter of reading > 32 bits would give Prop 2 and edge.
  • RaymanRayman Posts: 14,854
    edited 2011-10-28 03:30
    Thanks Cluso. I missed it too...

    Any update on timeline? (I'm guessing 6 months).
  • Cluso99Cluso99 Posts: 18,069
    edited 2011-10-28 03:48
    There was no timeline. My guess would be min 6 months (no inside info)

    i recall Chip saying that the video can be loaded and another load can be done without having to do a waitvid - meaning you can have a waiting set loaded in advance. Video can be 1080p.

    These counters are really going to be fantastic, along with the I/O pins.

    No guarantees with first iteration that the fuses will work, nor code decryption. Neither for intracog high speed comms (between chips).



    Wish I had a spare $1M :) There are not many certainties, but this would have to be one with great prospects!
  • David BetzDavid Betz Posts: 14,516
    edited 2011-10-28 06:30
    Cluso99 wrote: »
    If the prop were done in 45nm then perhaps it would clock at 1GHz, have 2MB hub ram and 16 or 24 cogs. But this would require ~$1M+ for prototype layout costs vs ~$160K for 180nm. Anyone with a spare $1M ??? Could be some super ROI (return on investement) if you do! This could give the ARM a run for its money hey.

    Perhaps we should start an account for Parallax to buy tickets in the lotto ;)

    That $1M sounds cheap compared to what I thought Chip said. Didn't he say more like $3M? I guess it doesn't matter one way or the other if they win the lottery though. :-)
  • tonyp12tonyp12 Posts: 1,951
    edited 2011-10-28 07:01
    "90 nm process refers to the level of CMOS process technology that was reached in the 2002–2003 timeframe"

    How about 90nm then, $500k (guessing) should not be hard to come up with.

    12cogs
    360mhz
    512k sram, 32k rom

    But maybe it's other technical issues arises when going smaller and Chip have to start from scratch.
  • Oldbitcollector (Jeff)Oldbitcollector (Jeff) Posts: 8,091
    edited 2011-10-28 11:00
    Most excellent! I had trouble connecting to the live event last night, so I'm looking forward to the "tape delay" version. :)

    Thanks for posting that!

    OBC
  • Oldbitcollector (Jeff)Oldbitcollector (Jeff) Posts: 8,091
    edited 2011-10-28 11:02
    Cluso99 wrote: »

    If the prop were done in 45nm then perhaps it would clock at 1GHz, have 2MB hub ram and 16 or 24 cogs. But this would require ~$1M+ for prototype layout costs vs ~$160K for 180nm. Anyone with a spare $1M ??? Could be some super ROI (return on investement) if you do! This could give the ARM a run for its money hey.

    Will they take a check? ;)
  • Ken GraceyKen Gracey Posts: 7,401
    edited 2011-10-28 11:27
    To clarify the numbers Chip mentioned, the current Propeller 2 design process is 180 nm. The test chip shuttle runs and first production lot are in excess of $500K but around $1M by the time we've got our characterization, package startup and early documentation efforts completed. The 45 nm process would support 1GHz per cog but would cost in excess of $3M (unverified, at least by me) of manufacturing costs. Indeed, the costs for designing and producing chips are very high.

    Ken Gracey
  • __red____red__ Posts: 470
    edited 2011-10-28 12:36
    I wonder how much of that cost is "tooling" verses the cost of the actual devices. IE, with a $3m cost - what would the cost per device be when it hit market.

    I'm REALLY excited because I think the P2 may hit a price/performance point that will beat both GPU and FPGA arrays in my application.
  • Cluso99Cluso99 Posts: 18,069
    edited 2011-10-28 13:13
    Thanks for the clarification costs Ken. I did think Chips prices sounded cheap. Of course none of that includes the man-years development costs and the hardware and software that has been bought along the way.

    If I win the lotto I would gladly sponsor the P3 (45nm version). And I don't think I would be the only one! IMHO the P3 would be an ARM killer for ipad clones and phones so long as the power consumption could be controlled. Nice to dream anyway ;)

    Ken: (OT) WOuld it be possible to get some sort of lower level block diagram of the counter logic (inc video logic) in the existing prop? I don't believe we have touched the surface of what these counters could be made to do, if only we just had more low-level info. I don't want to take Chip or Beau away from the PropII - could an FAE delve into this perhaps?
  • RaymanRayman Posts: 14,854
    edited 2011-10-28 13:37
    Marketing idea: Maybe they could use some of those fuses to disable 6 cogs and sell a cheaper, 2 cog version...
  • JRetSapDoogJRetSapDoog Posts: 954
    edited 2011-10-28 13:42
    Regarding a release timeframe, when asked about this, Chip safely stated that he didn't know. However, he indicated that there was maybe 8 weeks of addtional integration work (my words due to poor memory), and then that there'd probably be at least 2 iterations of testing (maybe that means at least one shuttle or sample run) before production could begin in earnest. He also said something to the effect that the integration work (my words again) and/or coordination with the synthesis team were proceding particularly well recently and, paraphrasing, picking up steam. Given all that, I'd guess that the above estimate of a minimum of six months is accurate since it's difficult to imagine how required test runs and testing could get done any faster. Also, if memory serves from reading this forum, there seems to be a foundry lead time of at least two to three months to enter mass production. Hope that helps shed light on the timeframe issue. In the nutshell, it looks like things are looking real good for a release sometime in 2012 (though perhaps we shouldn't expect anything before UPEW at the earliest).

    About other chip configurations and/or process technologies, Chip said something to the effect of, while those are interesting/exciting possibilities, the team needs to focus on following through with the current design, which, of course, makes sense and is what we have all been clamoring for. Still, I really like Tony's above-mentioned 12-cog, 512KB 90nm "compromise" design (i.e., the process technology between 45nm and 180nm) because the additional memory would be so useful for certain designs without requiring going off chip for more. I can't help but wonder what it would take to make that happen, maybe just a commitment to the change (and reduced pressure from us). It sounds like significant portions of the current design (memory elements, etc.) could be replicated and it sounds like their synthesis partner is flexible and used to working with bleeding-edge designs.

    Not to belabor this topic (please don't stone me), but, we've waited this long, so I, for one (perhaps the only one), could tolerate a few months of delay for a significantly enhanced version. I say this because, even though Parallax Semiconductor now exists as an entity, it doesn't sound easy for them to come out with multiple versions of chips, considering their current level of resources and the overhead costs associated with each chip design. Moreover, technology has advanced so much that a more powerful chip might be better received in the market place. On the other hand, based on what Chip said, producing other versions should be much more feasible now that they have attached the capabilities of the outside synthesis company. But I also heard Chip ever-so-slightly muse for a split second about a 64-bit architecture (who knows, maybe with more cog register address space or perhaps a third instruction register per instruction (src1, src2 and dest)), and perhaps something like that would more naturally justify the effort and cost to move to a 45nm process. Such a small process technology perhaps is a more natural candidate for a Prop III than a suped-up version of the current Prop II design (although bigger companies often produce such large chip families). As such, it might be the case that a somewhat more powerful/flexible Prop II chip (such as one meeting Tony's wish list) would fit better between such a possible Prop III and the current Prop. On the other hand, if Parallax has or can ramp-up the resources to do various versions, then sticking with the plan makes complete sense, and, arguably, sticking with the plan makes sense even if they can't marshal such resouces, too, because a Prop II of the current design represents a significant new device in and of itself.

    Lastly, of interest was Chip's comment that his increasing confidence in his team's ability to work with the synthesis company has led him to consider changes and/or features that he would have never comtemplated using only the old methods (as implementing them would have been impossible in practice), and that he had needed to unlearn some of the formerly applicable rules and/or expand his thinking. That's pretty interesting to me because we sometimes wall ourselves in unnecessarily without even realizing it.
  • Bill HenningBill Henning Posts: 6,445
    edited 2011-10-28 19:47
    Rayman wrote: »
    Marketing idea: Maybe they could use some of those fuses to disable 6 cogs and sell a cheaper, 2 cog version...

    I HATE marketing gimmicks like that. Intel/AMD do this, and it is a sign of companies not really wanting to innovate, but wanting squeeze the last bit of blood from stone by spending more to make crippled parts to sell for less.

    One cost-reduced design possibility would be 48 I/O's, 4 cogs, 64KB ram. It would be 1/2 the die size, and fit TQFP-64
  • SRLMSRLM Posts: 5,045
    edited 2011-10-29 00:18
    Thank you for posting. Is there a target audience for the chip? It seems like in embedded systems world there is a strong divergence: high power systems (EDDIE, Linux based systems, etc) and very low performance systems (small AVRs, Arduino). The P2 seems to fit right in the middle.
  • frank freedmanfrank freedman Posts: 1,983
    edited 2011-10-29 02:33
    If someone were to start a pool on engineering sample availability, I would be picking christmas 2012. Just based on what chip had said plus a guestimate of complete testing / validation of all functionality. These test suites were what I was questioning regarding tools for P2.

    Frank
  • Cluso99Cluso99 Posts: 18,069
    edited 2011-10-29 03:34
    Whatever we finally get in the PropII it will be amazing. Meanwhile, the current Prop continues to reveal newfound treasures and thus uses never before considered.
  • RaymanRayman Posts: 14,854
    edited 2011-10-29 04:47
    I HATE marketing gimmicks like that. Intel/AMD do this, and it is a sign of companies not really wanting to innovate, but wanting squeeze the last bit of blood from stone by spending more to make crippled parts to sell for less.

    I think it makes a lot of sense... The market is very tight these days and they just said how much $$ and time it takes to create a new design. But, if you can use the same design for two products, you really save.
    Plus, the customers benefit from having a choice. If they would sell more chips and make more money doing this, isn't that a win, win?
    Maybe another way to go it to limit the operating frequency on one version.
    Or, you could have a package with less I/O pins and increased pin spacing (easier to solder by hand).

    One the other hand, Parallax does seem to stand out by having just one version of a chip.
  • cgraceycgracey Posts: 14,256
    edited 2011-10-29 09:28
    I'll restate some things from the Thursday webinar, for clarity.

    We seem to be about eight weeks from completing the main synthesized logic block that forms the brains of the next Propeller chip. I don't know how long it will take from there to get the production chip out.

    The mask cost for this 180nm process is about $150k. Currently, the design is synthesizing and laying out at 180MHz. That means 180 MIPS per cog, for a total of 1,440 MIPS per chip. I'm really pleased with how things are moving along. It's been a big learning experience. To put 180 32-bit MIPS into perspective for us old-timers, an Apple II would have taken (if I remember) eight instructions to add two 32-bit (4-byte) values together (existing in zero-page, for efficiency). Eight clocks at 1MHz would mean 125,000 32-bit adds per second. If you divide 180,000,000 by 125,000, you get 1,440. So, a single cog could be said to have 1,440 times the computational ability of an Apple II. And there are eight of those! And when it comes to multiplying, instead of 1,440x, the speed increase jumps to probably 60,000x.

    For fun, I asked the company that is doing our synthesis work to pass our design through a 40nm-process design flow (TSMC 45G, which I mistakenly called '45nm' in the webinar). I was interested to hear how fast the design could run at 40nm. I was thinking it would top out at ~600MHz. It turns out that it closes timing at 1GHz! That's 1,000 MIPS per cog. Imagine 1,000 instructions every microsecond. If we were to build a version of the Propeller in this 40nm process, it would probably have 16 cogs and 2MB of hub RAM. Imagine 16,000 MIPS per chip! That's 100x the performance of the current Propeller chip. The only big roadblock to realizing this is the $3M required for the mask set.

    Back to the current project... Below is a colored map of the currently-synthesizing 180nm logic block that includes the eight cogs and hub circuitry. Each color represents the circuitry of a particular cog. Nobody told the tools to make a pie. They just did that, since each cog's support memories are located in a circular pattern around the perimeter of the block, and each cog mainly connects to its own internal circuitry. Where they come together in the center, there is a lot of wiring congestion. At first, we tried to arrange the cogs as rectangular blocks arrayed left to right, with their memories overhead. It wouldn't route because of the many lateral wires required to connect them all together. This rotational layout is much better, because the cogs can converge in a common point and this reduces the heck out of the wiring required among them.

    die.png


    The dark cyan splotches in the bottom two pie slices belong to the cogs they are within, but because some logic paths were optimized, they lost the label associations that tied them to the cogs they belonged to. In this mass of logic, there are 378k standard cells and over 200k nets. The actual cell area is 8.05 square mm. The layout area is about 14 square mm to afford some interstitial spaces for buffering and the clock tree. It would take person forever to do this, let alone the optimization and path reduction to get the whole thing to meet timing requirements. It took their computers only several hours to do this (along with a few $M of software).
    954 x 942 - 165K
    die.png 165.1K
  • LoopyBytelooseLoopyByteloose Posts: 12,537
    edited 2011-10-29 10:05
    Thanks Chip, and thanks Ken. Product development of any kind takes real courage as it requires endless patience. This is all rather exciting and amazing to follow - and an honor that you let us in on the actual development. It is wonderful to know that one day soon, the Propeller II will be challenging us with new ideas and new horizons.

    I am hoping that you will have something by Chinese New Years, but waiting is half the fun.
  • RaymanRayman Posts: 14,854
    edited 2011-10-29 10:09
    That's a cool image Chip. Why isn't it symmetric? You should put those computers back to work until they come up with 8-fold symmetry :)
  • Heater.Heater. Posts: 21,230
    edited 2011-10-29 10:13
    Is it normal now a days to have such organic looking chip layouts? Over the years from time to time one gets to see chip floor plans and they have always been regimented rectangular areas. This is something else, at least from where I'm looking.
  • mindrobotsmindrobots Posts: 6,506
    edited 2011-10-29 10:20
    Thanks for sharing! With speeds like that, I'll have REALLY fast poorly designed code!!!

    1ghz.....hmmmm, if every prop customer chipped in $20 or $50 how much could we raise? Think of it as open sourced funding or user community funding. It could take a bite out of the R&D cost for Prop 3 and our return on investment would be having the chip. The rewards of investment funding don't always need to be some % growth on your money. A Prop 3 has certain "intangible" benefits to many of us including growth and success for Parallax which really benefits all of us.

    Open up the "Kickstarter" project. I'm in at the $50 level!
  • rod1963rod1963 Posts: 752
    edited 2011-10-29 11:32
    It would take 60,000 Prop customers each paying $50 to get to the $3 million. Good luck with that.

    Still I don't get where is the market for a stripped down PropII? Because you will be competing directly with the likes of the M4 ARM's from ST and Freescale along with the offerings from TI and Microchip for market share.
  • Martin HodgeMartin Hodge Posts: 1,246
    edited 2011-10-29 11:44
    You know, if Makerbot can get $10M in venture capital for their burned plywood printers, why can't Parallax get a measly $3M for a possible game-changing silicon IP? I guess it really does boil down to "who you know".
  • rod1963rod1963 Posts: 752
    edited 2011-10-29 12:04
    Martin

    If Parallax scores a decent design win with the Prop II then they'll have the cash to do the 1Ghz version. It will be interesting to see the Prop II going head to head with some of the ARM designs out there like the Da Vinci(which I think Parallax is targeting with the PropII).
  • frank freedmanfrank freedman Posts: 1,983
    edited 2011-10-29 12:15
    You know, if Makerbot can get $10M in venture capital for their burned plywood printers, why can't Parallax get a measly $3M for a possible game-changing silicon IP? I guess it really does boil down to "who you know".
    ]
    Actually has anyone ever met a venture capitalist or group that would not demand a certain amount of control and guaranteed performance milestones? HHmmmm, neither have I. Would Chip make lots of money? Probably. Would he loose control of the company? Sharper players than he have. Would it be as fun for him? Hell no. He has a great product now, designed at the his pace for his own good reasons. Are we happy? Yes, to paraphrase Jim Kinkaid "but like children everywhere, prepared to ask for more". Hence the drooling or whatever over the prop 2.

    Not having the Prop 2 right now will not kill the company, I can do stuff with it I can not do as easily as say a PIC, but for dsp or compute intensive the Prop is not my choice. Pick the most suitable part. Seems lots of people like the suitability of the prop as is; and when the prop 2 comes out will do the same. Maybe less the hobbiests understand that, but those doing this for a living (should) have this outlook cold. Or they quickly become hobby level.

    Just my $0.02USD worth. By the way, who will be starting the release date pool????

    Frank
Sign In or Register to comment.