tg at gmplib.org
Mon Jan 14 12:16:48 CET 2013
nisse at lysator.liu.se (Niels Möller) writes:
And for neon instructions, cycle numbers are in
Seems it should be able to do one vmull per cycle. Not sure how to
get latency from the given table, but maybe 6 cycles.
If that is true, and the clock is the same as the main CPU, and if we
can sum things up at that speed, we could expect a 4-fold improvement
compared to the current GMP code for Arm.
More information about the gmp-devel