Dark Silicon
cgracey
Posts: 14,155
David Betz sent me some things to read about the RISK V project that were interesting. Linking from those resources, I started reading about what has been termed "Dark Silicon", which is an outcome of processes shrinking faster than energy-per-function, so you can practically only clock a portion of your silicon at full speed, due to heat buildup or shear power constraints. In 8 years' time, only 1/16 the area of a leading-edge chip will be able to clock at full speed. This leads to interesting future design considerations. Here is a paper about it:
http://darksilicon.org/papers/papers/taylor_dark_silicon_horsemen_dac_2012.pdf
http://darksilicon.org/papers/papers/taylor_dark_silicon_horsemen_dac_2012.pdf
Comments
It's linked from the one you put here Chip. Tons of other good stuff there, but this is one I'm reading with great interest! Explains a whole lot.
So, turning 99.9% of the silicon into memory blocks is not a bad way to use the space and still keep on shrinking.
EDIT: Hmm, maybe they count memory blocks as being dark ...
Only joking, have a relaxing Easter.
Right, yep, they do seem to do exactly that. But at the same time they also list memory blocks as an example of using the space productively ... but only as cache, they also note adding more cache runs out of puff when it gets too big.
The Propeller would be in heaven with a few hundred megs of HubRAM.
It also contains some great arguments for 16 cogs, the HUB memory design, smart pins, etc...
Those of us who were asking for other blocks, math, SERDES, etc... aren't on a bad path either.
Chip doing some reading and getting some perspective is a good thing.
Is this a promise? Spring gets more wonderful each year.
Ok, sorry, I'm being a jerk, but I couldn't resist.
It's really just the total amount of work that takes the time. Be patient.
Now that's certainly making some presumptions. While I'm sure Chip enjoys a break now and then I'd be surprised if he's ready to stop. I've certainly noticed Chip, and Ken too, has learnt quite a lot in the last year, some of which has had impact on progress. That's not the actions of someone getting stuck.
No dramas prior to shuttle run. One year tops - And half of that time is people saying, is it ready yet?
At which point Chip went very much quieter here and restarted with a new design that sounds a lot more elegant as far as we know so far.
Last night I got the new transfer/DDS block finished and I hooked it into the cog. It shuttles bytes/word/longs of I/O pin data to/from hub RAM at up to 32 bits per clock. It also drives DACs at those rates and performs DDS/Goertzel operations. It uses a 256x32 look-up RAM for outputting pixel-type and DDS/Goertzel data. All cogs can utilize the full bandwidth without affecting the others. I need to thoroughly test it now and then get onto the next things: hub execution (not much code, but challenging to think about), hub-based CORDIC (straightforward), and smart pins (not hard, but rather open-ended).
I hope that in two months' time, I'll have an FPGA image ready. We are making a final PCB for our Cyclone V -A7 board now. We've already proven it and developed its loader which uses 2Mbps FTDI USB serial talking to a Prop 1. It loads about 35x-70x faster than Quartus' built-in programmer (3 seconds to load straight into the FPGA, 6 seconds to load into flash for cold booting). Our -A7 board will support all 16 cogs and 512KB hub RAM. The DE2-115 will fit ~12 cogs and 256KB hub RAM. All this memory and I/O bandwidth, plus hub exec, is going to be really fun to work with.
I think you'll find the thermal issues were present even in the first shuttle run. The forum requested features were just a bonus.
I thought by the time that 1st shuttle run had been done, there had already been significant addition/lobbying for inclusions already?
Hard to say, but certainly too many cooks in the kitchen.
For some reason though, I'm still concerned with the either inability or unwillingness to simply output a 1 page post on what the current feature set is, and generally where Chip is at.
From a PM POV, I sense rightly or wrongly an apparent lack of interest in prioritizing some simple project communication with the most rabid of all Prop fans on the planet.
Even the most basic of PM's on a project like this, and almost anything smaller, would be having at least a weekly meeting tracking what is done, what isn't, milestone hit or roadblocks encountered, etc,etc.
Any of the knowledgeable engineers at Parallax involved in the P2 should be able to take some meeting minutes, and be able to draft up a simple post which would answer 80-90% of the big question people have.
But maybe I am missing something simple, like Parallax' timeline. Maybe a late Q4, early Q1-2 '16 is what being aimed for?
Thats fine, I'd just hate to see the forum lose some of its more advanced users/volunteers over a simple lack of communication.
Turns out the shuttle run was two years ago! Gee, time flies. As far as I can tell Chip and Beau had worked on the design themselves up till that point. There was plenty of banter when Chip released the FPGA images and lots of ideas flying around but the shuttle run was already set at that stage.
Just two. Now, just the one. Plus others performing the extra jobs outside of design work.
Progress updates have occurred, you can consider Chips latest above as the one Ken promised. Again, there is so few cooks involved there isn't anyone who specialises in keeping documentation for the design while it's still in flux. That'll come later.
PS: There was documentation on the specs but it went out of date pretty quick.
Chip did the architecture piece by piece and supplied updates as he went. Forum comments were mostly of the what can it do rather than it should do such and such. What's quite notable to me now is there is a full two years of tasters there. Again, longer than I thought.
I have no idea how this relates to any shuttle run in the time line but there was certainly a lot ideas being presented by forum members, many of which seemed to have been added into the design, over the 300 odd pages of this thread http://forums.parallax.com/showthread.php/141706-Propeller-II[url]
In fact I suggested turning the task switching mechanism from a programmer controlled one to an automatic instruction by instruction thread interleaving one. Which to my amazement Chip implemented with a week or two.
See here:
[/url]http://forums.parallax.com/showthread.php/141706-Propeller-II?p=1117188&viewfull=1#post1117188
and here:
http://forums.parallax.com/showthread.php/141706-Propeller-II?p=1122235&viewfull=1#post1122235
So I could claim that is at least 3 cooks in the kitchen. I am very sure you can find more in that thread.
Feature was piled upon feature until the edifice collapsed.
I'm sure Chip now has a much more elegant design without all that helpful input.