[PATCH] T3/T4 sparc shifts, plus more timings
Torbjorn Granlund
tg at gmplib.org
Sun Mar 31 22:57:36 CEST 2013
David Miller <davem at davemloft.net> writes:
I'm going to play around with some things to try and fix this.
Interestingly, UltraSPARC-1 and UltraSPARC-2 would not group the
final cycle of the loop this way, because of it's requirement that
integer operations must occur in the first three instructions of
a group.
But the code might not run well on US1/2 because shift needs to come
before any other inter artithmetic op.
I think I'll commit the code to the top-level sparc64 directory now, then
remove ultrasparct1/[rl]shift{c,}.asm and but leave the files in
ultrasparc1234 for now.
US1/2 is not very important, but we should check the lshift and rshift
performance before we remove the special ultrasparc1234 code, not to
cause outright slowdown.
Reducing the function variants is a nice thing, but it is not the most
important goal.
--
Torbjörn
More information about the gmp-devel
mailing list