hgcd1/2

Laurent Desnogues laurent.desnogues at gmail.com
Tue Sep 3 09:08:03 UTC 2019


On Tue, Sep 3, 2019 at 10:49 AM Torbjörn Granlund <tg at gmplib.org> wrote:
>
> nisse at lysator.liu.se (Niels Möller) writes:
>
>   And this on a laptop with an Intel U4100 (5 years old?), so I'd assume
>   it doesn't have a particularly fast div instruction. Should we just
>   delete div1 ? On which architectures can we expect it to be beneficial?
>   It should be fairly easy to find out, if we define a HGCD_DIV1_METHOD
>   known to tuneup, to select between plain division and the div1 function.
>
> Interesting but not too surprising results.
>
> Intel ark doesn't seem to know any processor called "U4100" so I cannot
> figure out what generation it belongs to.

That most likely is 4100U, a 4th generation chip (aka Haswell).

> IIRC, Intel has not improved plain 64b/64b division since Haswell, which
> is older than 5 years.
>
> Again, if IIRC, small quotients may result in 16 cycle latency.  That's
> the lowest possible timing.
>
>
>
> --
> Torbjörn
> Please encrypt, key id 0xC8601622
> _______________________________________________
> gmp-devel mailing list
> gmp-devel at gmplib.org
> https://gmplib.org/mailman/listinfo/gmp-devel


More information about the gmp-devel mailing list