[Gmp-commit] /var/hg/gmp: Fix timing table and add some comments.
mercurial at gmplib.org
mercurial at gmplib.org
Wed Mar 9 22:11:23 CET 2011
details: /var/hg/gmp/rev/1c891bd40cff
changeset: 14021:1c891bd40cff
user: Torbjorn Granlund <tege at gmplib.org>
date: Wed Mar 09 22:11:17 2011 +0100
description:
Fix timing table and add some comments.
diffstat:
mpn/x86_64/aorrlsh_n.asm | 11 +++++++----
1 files changed, 7 insertions(+), 4 deletions(-)
diffs (24 lines):
diff -r 22d94377cc74 -r 1c891bd40cff mpn/x86_64/aorrlsh_n.asm
--- a/mpn/x86_64/aorrlsh_n.asm Wed Mar 09 21:37:26 2011 +0100
+++ b/mpn/x86_64/aorrlsh_n.asm Wed Mar 09 22:11:17 2011 +0100
@@ -22,13 +22,16 @@
C cycles/limb
C AMD K8,K9 2.87 < 3.85 for lshift + add_n
C AMD K10 2.75 < 3.85 for lshift + add_n
-C Intel P4 14.6 > 7.33 for lshift + add_n
-C Intel core2 4 > 3.27 for lshift + add_n
-C Intel NHM 2.83 < 3.75 for lshift + add_n
-C Intel SBR 3.25 < 3.46 for lshift + add_n
+C Intel P4 22 > 7.33 for lshift + add_n
+C Intel core2 4.1 > 3.27 for lshift + add_n
+C Intel NHM 4.4 < 3.75 for lshift + add_n
+C Intel SBR 3.17 < 3.46 for lshift + add_n
C Intel atom ? < 8.75 for lshift + add_n
C VIA nano 4.7 < 6.25 for lshift + add_n
+C TODO
+C * Can we propagate carry into rdx instead of using a special carry register?
+C That could save enough insns to get to 10 cycles/iteration.
define(`rp', `%rdi')
define(`up', `%rsi')
More information about the gmp-commit
mailing list