[PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.
Torbjorn Granlund
tg at gmplib.org
Thu Mar 7 00:51:24 CET 2013
David Miller <davem at davemloft.net> writes:
From: Torbjorn Granlund <tg at gmplib.org>
Date: Wed, 06 Mar 2013 12:36:34 +0100
> I think all you T3/T4 changes are now in. Please check that I didn't
> mess something up.
>
> Thanks for this contribution!
Looks good, there is some trailing whitespace in the ChangeLog but
that's probably my fault:
+
* mpn/sparc32/ultrasparct1/mul_1.asm: Unroll main loop one time, add
T2/T3/T4 timings.
- * mpn/sparc32/ultrasparct1/addmul_1.asm: Likewise.
+ * mpn/sparc32/ultrasparct1/addmul_1.asm: Likewise.
* mpn/sparc32/ultrasparct1/submul_1.asm: Likewise.
I optimised submul_1.asm, and then edited both addmul_1 and submul_1 to
use as similar operand order as possible. Please test these using
tests/devel/try, and please time this new submul_1.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: addmul_1.asm
Type: application/octet-stream
Size: 1815 bytes
Desc: not available
URL: <http://gmplib.org/list-archives/gmp-devel/attachments/20130307/c5dc1f82/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: submul_1.asm
Type: application/octet-stream
Size: 1836 bytes
Desc: not available
URL: <http://gmplib.org/list-archives/gmp-devel/attachments/20130307/c5dc1f82/attachment-0001.obj>
-------------- next part --------------
--
Torbj?rn
More information about the gmp-devel
mailing list