Sandybridge addmul_N challenge
Niels Möller
nisse at lysator.liu.se
Thu Feb 23 21:09:32 CET 2012
nisse at lysator.liu.se (Niels Möller) writes:
> So the recurrency, for one iteration, seems to be just 3 cycles. But the
> loop mixer doesn't find anything faster then 6.36 cycles for one
> iteration, or 3.18 per limb product. Which isn't too bad (a slight
> improvement over 3.24, which I think is the best reported earlier), but
> stubbornly above 3 c/l.
One update. I have now tried unrolling four times. Then I've seen one
sequence running at 6.16 cycles per iteration, or 3.08 c/l.
See shell:~nisse/hack/loopmix/lms/addmul_2-nisse-2.nlms.
Regards,
/Niels
--
Niels Möller. PGP-encrypted email is preferred. Keyid C0B98E26.
Internet email is subject to wholesale government surveillance.
More information about the gmp-devel
mailing list