[PATCH] Optimize 32-bit sparc T1 multiply routines.

David Miller davem at davemloft.net
Sun Jan 6 01:54:36 CET 2013


From: Torbjorn Granlund <tg at gmplib.org>
Date: Sun, 06 Jan 2013 01:19:39 +0100

> David Miller <davem at davemloft.net> writes:
> 
>   Each load can issue in 1 cycle, there is a 4 cycle latency, the
>   loads will fully pipeline.  Therefore the overhead is around 3n.
>   
> At most one memop / cycle?

Yes.


More information about the gmp-devel mailing list