Anyone have any practical experience with POSIT math?
cgracey
Posts: 14,231
in Propeller 2
Any experience out there with POSIT math, as opposed to FLOAT math?
Comments
https://forums.parallax.com/discussion/166008/john-gustafson-presents-beyond-floating-point-next-generation-computer-arithmetic
https://cs.rutgers.edu/~santosh.nagarakatte/papers/cordic-cf-2020.pdf
https://cs.rutgers.edu/~santosh.nagarakatte/papers/PositDebug-PLDI-2020-preprint.pdf
https://arxiv.org/pdf/2006.00364.pdf
Addendum:
https://carrv.github.io/2020/papers/CARRV2020_paper_5_MV.pdf
https://posithub.org/
https://posithub.org/docs/PDS/PositEffortsSurvey.html
https://arxiv.org/pdf/2007.05344.pdf
Addendum: Fresh Posit standard:
https://00248909357804553642.googlegroups.com/attach/7967db941064b/PositStandard4.9.pdf?part=0.1.1&view=1&view=1&vt=ANaJVrEmv4tqVsYp-qE_uU3ckUN2-lApdt4y7vAGB-PlXl2yrduHGDX8KteDpL9A8YC0R7K16GQQUcy3UpdWOzzAZXfxoUIpJatn2YBu3RvXnWGoqpgLTdw
I think that precision advantage of Posits is vastly overestimated. It might be useful to mathematicians when you have to handle formulas as abstract theory and you need a best fit for any possible case. However, in the real world the error of measuring is almost always worse that the error of calculation (if done right!).
If you are clever you often can make assumptions about where to expect best accuracy and which cases are to be avoided. Example: It is much better to completely avoid cases where the numbers become very big or very small instead of relying on the number format to handle them.
The example with the scalar product of a = (3.2e7...) b = (4.0e7...) is just plain wrong (as Heater already found out). 32bit IEEE floating point math gives the correct result. Precision for even much bigger numbers could be maintained by re-ordering the calculations and doing the subtraction of large numbers first before adding the small ones.
And I doubt that the variable bit size of exponent and mantissa is very efficient. It might be for hardware implementations but it is surely not very helpful for software emulation. Most of the complexity of the IEEE format comes from the special cases, denormalized numbers, NaN and so on. Geting rid of those and instead using a single "error" flag would make things much simpler.
But I agree that the support of "inline" floating point math (software emulated for the P2 and maybe in hardware for the future P3) would be a big advantage, no matter if it'd be IEEE or Posits. Although almost anything cyn be done with fixed point math floats make the code much simpler, easier to read and maintain and can avoid most of the bugs (overflows...)
Without hardware support for either, like the Propellers, the question then is can a Prop1/2 do Posits quicker than IEEE floats? I'd guess that's what Chip is wanting to know.
https://eee.hku.hk/~hso/Publications/jaiswal_date_2018.pdf
https://hal.inria.fr/hal-01959581v4/document
Maybe you can give me some guidance here. If so I'd certainly appreciate it!
I'm currently fighting precision issues dealing with GPS coordinates in FlexBASIC. Single-precision IEEE floats just wont cut it, and the implementation of IEEE double-precision math in any of the Propeller compilers is still way out on the horizon.
Real example: The GPS module I'm dealing with outputs a longitude of 118.123456. That coordinate reflects an actual, repeatable, point on the ground. It's not just some "funny number". (In actuality this ZED-F9 GPS module does *centimeter* RTK accuracy, but I'm not using it that way). So the GPS output is easily accurate to 1/1,000,000 of a degree (ie, about 4 inches). By the time I run this position through single-precision math, that 1/1,000,000 of a degree gets devalued to about 36 feet CPE just in the conversion process. Run a few trig calcs and the cumulative error goes to almost 100 feet in the worst case.
Not to hijack this thread, but how would you deal with this? It seems like either Posits or IEEE double-precision or DEC64/fixed-point would be a better answer, but maybe I'm wrong?
I've now manually installed their test package and run the plot demo (sin, cos and atan) in that PDF. There is instructions on last page. It's my first time explicitly knowing I'm using Python.
Their instructions are for Ubuntu 18.04 but I'm using 20.04 so not all the listed components had to be newly installed. For example Python3 is already installed. Only had to add matplotlib and libmpfr-dev. The slow part was downloading their "artifact" source code from their webpage. Very slow responses.
My C is obviously rusty because it doesn't make sense to assign a value into an auto variable just before returning. Namely, the "uZ.ui = uiA;"
Here's the union definition:
The fastest data type that is at least 32 bit wide - i.e. it doesn't need to have the defined under/overflow behaviour of the normal uint32_t
The circumference of the earth is ~40,000km so 32 bit fixed point math would give you ~1cm resolution. As Evan already said it won't work with single precision IEEE floats because they only have 24 bits mantissa. Posits may give you some more bits but definitely less than 32.
By clever use of the formulas, re-ordering the operands and so on you can avoid unnecessarily loosing precision. But you can't beat the best case. Even the best number format can't pack more information into the available bits.
Scaling polar coordinates as fixed point numbers in such a way that 2^32 is one full circle would have the advantage that you can add angles without having to care about overflows, BTW.
Using floats (worse: radians as floats!) for angles is just deeply flawed in most cases.
I had never considered that. Then for short distance/vector calcs, you could avoid most of the trig functions entirely and just do a "run over rise" type of math using a local grid (Pythagorean style). It wouldnt be pretty, but on a small scale it would probably work just fine. Food for thought there.
So true. Unfortunately, I'm sort of stuck with them since they are the natural units for all of my spherical trig functions. I just don't have the math background needed to do much in the way of workarounds. There are some games that I might play, but the essential flaw remains. At one point I even tried to subsitute a scaled Taylor expansion for a couple of the primative functions, but that has its own set of problems (convergence) and it's really slow if you want any precision (which was the entire point!).
I believe @ersmith has been pondering implementing a fixed-point math option in the Flex suite. If/when that becomes a reality, it'll be a very happy day for me.
Sorry to have hijacked this thread, but I really appreciate your responses.
In C only unsigned integers have a defined overflow behavior, which is the "natural roll-over" you describe. Signed integers can do anything at all upon overflow. Taking only the bottom 32 bits is a common behavior (common enough that many people think it's what will always happen) but implementations are free to use saturating arithmetic (overfloat gives MAXINT), to throw an exception, or indeed to return a random number upon overflow.
Another alternative is to use two floats to represent the numbers, a high and low part. You can google for "double-double arithmetic". This is most commonly used as a "poor mans quad precision" with two doubles, but the same principle can be used with two floats to implement a kind of double.
It's a form of casting between datatypes. So the value being assigned to integer uZ.ui is then immediately used as return value posit uZ.p.