mpn_sqrtrem{1,2}

Torbjörn Granlund tg at gmplib.org
Thu Mar 9 15:09:52 UTC 2017


"Marco Bodrato" <bodrato at mail.dm.unipi.it> writes:

  >   After the patch.
  >   real	3m47.247s
  
  The time to compute and check 2^33 square roots, but it can be reduced.
  
Computing just 2^32 square roots would be another great speedup...

  > Almost 2x speedup, awesome!
  
  Yes, Adrien spotted a real problem. The sqrtrem2 function was "hidden"
  under a heavy wrapper. Needed for large operands, but too heavy for
  smaller ones.
  
  Now, I believe he will be able to gain another 2x improving sqrtrem2, I
  did not touch that core functions, only the wrappers !-)
  
Nice!

And then another 2x might be gained by using the division-free iteration
(for A^{-0.5}) with mulhi/mullo/mulmid.

-- 
Torbjörn
Please encrypt, key id 0xC8601622


More information about the gmp-devel mailing list