[PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

Torbjorn Granlund tg at gmplib.org
Thu Mar 7 00:51:24 CET 2013


David Miller <davem at davemloft.net> writes:

  From: Torbjorn Granlund <tg at gmplib.org>
  Date: Wed, 06 Mar 2013 12:36:34 +0100
  
  > I think all you T3/T4 changes are now in. Please check that I didn't
  > mess something up.
  > 
  > Thanks for this contribution!
  
  Looks good, there is some trailing whitespace in the ChangeLog but
  that's probably my fault:
  
  +
   	* mpn/sparc32/ultrasparct1/mul_1.asm: Unroll main loop one time, add
   	T2/T3/T4 timings.
  -	* mpn/sparc32/ultrasparct1/addmul_1.asm: Likewise.	
  +	* mpn/sparc32/ultrasparct1/addmul_1.asm: Likewise.
   	* mpn/sparc32/ultrasparct1/submul_1.asm: Likewise.
   
I optimised submul_1.asm, and then edited both addmul_1 and submul_1 to
use as similar operand order as possible.  Please test these using
tests/devel/try, and please time this new submul_1.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: addmul_1.asm
Type: application/octet-stream
Size: 1815 bytes
Desc: not available
URL: <http://gmplib.org/list-archives/gmp-devel/attachments/20130307/c5dc1f82/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: submul_1.asm
Type: application/octet-stream
Size: 1836 bytes
Desc: not available
URL: <http://gmplib.org/list-archives/gmp-devel/attachments/20130307/c5dc1f82/attachment-0001.obj>
-------------- next part --------------

-- 
Torbj?rn


More information about the gmp-devel mailing list