mpn_sqrtrem{1,2} - patch for pure C implem
Adrien Prost-Boucle
adrien.prost-boucle at laposte.net
Tue Mar 28 19:46:33 UTC 2017
> The only branch is for the final correction at end of mpn_sqrtrem1.
> I tried with the previous mpn_sqrtrem1 version, which has a condition,
> and with an unconditional code that needs 2 multiplications.
>
> My version with unconditional correction had same speed.
> My version with current mpn_sqrtrem1 correction is slightly faster.
> So it's the "core" that is faster.
So the floating-point sqrtrem1 the I proposed before should be a bit faster
is using the correction with condition, instead of the correction with only
unconditional instructions.
Adrien
More information about the gmp-devel
mailing list