[Gmp-commit] /var/hg/www: 3 new changesets
mercurial at gmplib.org
mercurial at gmplib.org
Tue Apr 30 00:17:11 CEST 2013
details: /var/hg/www/rev/cb9228b7f203
changeset: 57:cb9228b7f203
user: Torbjorn Granlund <tege at gmplib.org>
date: Mon Apr 29 23:35:22 2013 +0200
description:
One more sparc item completed.
details: /var/hg/www/rev/30dc233c2048
changeset: 58:30dc233c2048
user: Torbjorn Granlund <tege at gmplib.org>
date: Mon Apr 29 23:36:23 2013 +0200
description:
Bring systems list up-to-date.
details: /var/hg/www/rev/c004de8197ac
changeset: 59:c004de8197ac
user: Torbjorn Granlund <tege at gmplib.org>
date: Tue Apr 30 00:16:59 2013 +0200
description:
Sparc addmul_1 update.
diffstat:
devel/asm.html | 14 +++++++-------
devel/sparc.html | 4 ++--
devel/testsystems.html | 9 +++++----
3 files changed, 14 insertions(+), 13 deletions(-)
diffs (94 lines):
diff -r 5e84e93031bf -r c004de8197ac devel/asm.html
--- a/devel/asm.html Mon Apr 29 18:22:29 2013 +0200
+++ b/devel/asm.html Tue Apr 30 00:16:59 2013 +0200
@@ -113,13 +113,13 @@
<tr> <td> rsh1add_n <td> <td> 4.5 <td> 5.25 <td> <td> <td> 2 <td> 2{1.67}<td>2.75{2.5}<td>3.25{2.7}<td>5.63 <td>3.1{2.67}<td>3.3{2.5}<td> 2.05 <td> 5.25 <td> 3 <td> (5) <td> #4 <td> 3.5 <td> 4.5 <td> 3.45 <td> <td> <td> (4) <td> (3.5) <td> 1.5 <td>3.64-3.7<td> 3.72 <td> 2.5[2]
<tr> <td> rsh1sub_n <td> <td> <td> <td> <td> <td> 2 <td> 2{1.67}<td>2.75{2.5}<td>3.25{2.7}<td>5.63 <td>3.1{2.67}<td>3.3{2.5}<td> 2.05 <td> 5.25 <td> 3 <td> (5) <td> #4 <td> 3.5 <td> 4.5 <td> 3.45 <td> <td> <td> (4.5) <td> (3.5) <td> 1.5 <td>3.64-3.7<td> 3.72 <td> 2.5[2]
-<tr> <td> cnd_add_n <td> 3.4 <td> 5 <td> 5.25 <td> 4.67 <td> 11 <td> 2.25 <td> 2 <td> 2.5 <td> 3.55 <td> 13 <td> 2.9 <td> 2.9 <td> 2.4 <td> 6.5 <td> 3 <td> <td> 2.25 <td> ? <td> 3 <td> ? <td> <td> <td> 3? <td> <td> 1.5 <td> 3 <td> 1.78 <td> 1.78
-<tr> <td> cnd_sub_n <td> 3.4 <td> 5 <td> 5.25 <td> 4.67 <td> 11 <td> 2.25 <td> 2 <td> 2.5 <td> 3.55 <td> 13 <td> 2.9 <td> 2.9 <td> 2.4 <td> 6.5 <td> 3 <td> <td> 2.25 <td> ? <td> 3 <td> ? <td> <td> <td> 3? <td> <td> 1.5 <td> 3 <td> 1.78 <td> 1.78
+<tr> <td> cnd_add_n <td> 3.4 <td> 5 <td> 5.25 <td> 4.67 <td> 11 <td> 2.25 <td> 2 <td> 2.5 <td> 3.55 <td> 13 <td> 2.9 <td> 2.9 <td> 2.4 <td> 6.5 <td> 3 <td> <td> 2.25 <td> ? <td> 3 <td> ? <td> <td> <td> 3? <td> <td> 1.5 <td> 3 <td> 1.78 <td> 1.78
+<tr> <td> cnd_sub_n <td> 3.4 <td> 5 <td> 5.25 <td> 4.67 <td> 11 <td> 2.25 <td> 2 <td> 2.5 <td> 3.55 <td> 13 <td> 2.9 <td> 2.9 <td> 2.4 <td> 6.5 <td> 3 <td> <td> 2.25 <td> ? <td> 3 <td> ? <td> <td> <td> 3? <td> <td> 1.5 <td> 3 <td> 1.78 <td> 1.78
<tr bgcolor="#f0f0f0"><td> mul_1 <td> 3.25 <td> 4 <td> 4.5 <td>4.16{3.75}<td>7.5 <td> 2.5 <td> 2.5 <td> 4 <td> 5 <td> 12.6 <td> 4 <td> 3.75 <td> 2.5 <td>19.75{17}<td> 4.25 <td> 6 <td> 7.25 <td> 7.25 <td> 13.5(8)<td> 2.9 <td> 18.25 <td> 68 <td> 3 <td> 2.25 <td> 2{1.5}<td> 3.25 <td> 2.25[2]<td> 2.25{1.35}
<tr bgcolor="#f0f0f0"><td> mul_1c <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> Y <td> N <td> Y <td> Y <td> <td> Y <td> Y <td> Y <td> Y <td> N <td> <td> <td> N <td> [Y] <td> <td> <td>
-<tr bgcolor="#f0f0f0"><td> addmul_1 <td> 3.75 <td> 5{4} <td> 5 <td>5.21{4.75}<td>8 <td> 2.5 <td> 2.5 <td> 4.5 <td> 5 <td> 14.9 <td> 4.25 <td> 5{4} <td> 3.25 <td>21.25{19}<td> 5 <td> 9.5 <td> 8 <td> 8 <td> 12.25 <td> 3.77 <td> 17.3 <td> 74 <td>5.5(4.25)<td> 3.5 <td> 2(1.75)<td> 3.25 <td> 2 <td> 2{1.65}
-<tr bgcolor="#f0f0f0"><td> submul_1 <td> 3.75 <td> 6 <td> 6.5 <td> #5.5 <td> 8 <td> 2.5 <td> 2.5 <td> 4.5 <td> 5 <td> 14.9 <td> 4.25 <td> 5{4} <td> 3.25 <td>21.25{19}<td> 5 <td> 10.5 <td> 8.3 <td> 8.25 <td> 12.8 <td>4.9{4.3}<td> 22.75 <td> 74 <td>5.8(4.75)<td> 3.5 <td> 2.25(2)<td> 3.75 <td> 2.32 <td> 2.32(1.8)
+<tr bgcolor="#f0f0f0"><td> addmul_1 <td> 3.75 <td> 5{4} <td> 5 <td>5.21{4.75}<td>8 <td> 2.5 <td> 2.5 <td> 4.5 <td> 5 <td> 14.9 <td> 4.25 <td> 5{4} <td> 3.25 <td>21.25{19}<td> 5 <td> 9.5 <td> 8 <td> 8 <td> 12.25 <td> 3.77 <td> 17.3 <td> 74 <td> 4.25? <td> 3.5 <td> 2(1.75)<td> 3.25 <td> 2 <td> 2{1.65}
+<tr bgcolor="#f0f0f0"><td> submul_1 <td> 3.75 <td> 6 <td> 6.5 <td> #5.5 <td> 8 <td> 2.5 <td> 2.5 <td> 4.5 <td> 5 <td> 14.9 <td> 4.25 <td> 5{4} <td> 3.25 <td>21.25{19}<td> 5 <td> 10.5 <td> 8.3 <td> 8.25 <td> 12.8 <td>4.9{4.3}<td> 22.75 <td> 74 <td> 4.5? <td> 3.5 <td> 2.25(2)<td> 3.75 <td> 2.32 <td> 2.32(1.8)
<tr> <td> mul_2 <td> <td> (4) <td> (4) <td> <td> <td> 2.25 <td> 2.25 <td> 5{4} <td> 5.62{5}<td>13.5[12.3]<td>4 <td>3.83{3.67}<td>3.15 <td> 19.5 <td> 4.12 <td> <td> (4.75) <td> (4.75) <td> (5.5) <td> <td> <td> <td> 3.25(3)<td> (3) <td> 1.5 <td> 2.25 <td> #2.5{2} <td> #2.5{1}
<tr> <td> mul_3 <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> [1.333]<td> <td> <td>
<tr> <td> mul_4 <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> [1.25] <td> <td> <td>
@@ -139,8 +139,8 @@
<tr bgcolor="#f0f0f0"><td> sqr_diag_addlsh1<td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> 2 <td> <td> <td>
<tr bgcolor="#f0f0f0"><td> redc_1 <td> <td> <td> <td> <td> <td> 2.5 <td> 2.5 <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> * <td> * <td> * <td> * <td> <td> <td> <td> <td> * <td> * <td> * <td> *
<tr bgcolor="#f0f0f0"><td> redc_2 <td> <td> <td> <td> <td> <td> {2.375}<td> {2.375}<td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> * <td> * <td> * <td> *
-<tr> <td> lshift <td> 1.2 <td> 1.75 <td> 2 <td>1.75{1.46}<td>5 <td> 2.35 <td>1.8{1.3}<td>1.9{1.3}<td> 3.5{3}<td>3.33{2.7}<td> 1.27 <td>1.375[1.25]<td> 1.3 <td>4.5(2.5)<td> 3.25[2]<td> 2.25(1)<td> 2.33 <td> 2.25 <td> 4 <td> 2.15 <td> 2.5 <td> 17.5 <td> 3 <td> 1.75 <td> 1 <td> 3 <td> 2.92(1.5}<td> 1.5{>1}
-<tr> <td> rshift <td> 1.2 <td> 1.75 <td> 2 <td>1.75{1.46}<td>5 <td> 2.35 <td>1.8{1.3}<td>1.9{1.3}<td> 3.5{3}<td>3.33{2.7}<td> 1.27 <td>1.375[1.25]<td> 1.3 <td>4.5(2.5)<td> 3.25{2}<td> 2.25(1)<td> 2.33 <td> 2.25 <td> 3.5 <td> 2.15 <td> 2.5 <td> 17.5 <td> 3 <td> 1.75 <td> 1 <td> 3 <td> 2.92{1.5}<td> 1.5{>1}
+<tr> <td> lshift <td> 1.2 <td> 1.75 <td> 2 <td>1.75{1.46}<td>5 <td> 2.35 <td>1.8{1.3}<td>1.9{1.3}<td> 3.5{3}<td>3.33{2.7}<td> 1.27 <td>1.375[1.25]<td> 1.3 <td>4.5(2.5)<td> 3.25[2]<td> 2.25(1)<td> 2.33 <td> 2.25 <td> 4 <td> 2.15 <td> 2.5 <td> 17.5 <td> 3 <td> 1.75 <td> 1 <td> 3 <td>2.92(1.5}<td> 1.5{>1}
+<tr> <td> rshift <td> 1.2 <td> 1.75 <td> 2 <td>1.75{1.46}<td>5 <td> 2.35 <td>1.8{1.3}<td>1.9{1.3}<td> 3.5{3}<td>3.33{2.7}<td> 1.27 <td>1.375[1.25]<td> 1.3 <td>4.5(2.5)<td> 3.25{2}<td> 2.25(1)<td> 2.33 <td> 2.25 <td> 3.5 <td> 2.15 <td> 2.5 <td> 17.5 <td> 3 <td> 1.75 <td> 1 <td> 3 <td>2.92{1.5}<td> 1.5{>1}
<tr> <td> lshiftc <td> * <td> * <td> * <td> * <td> 5.5 <td> 2.75 <td> 2{1.5}<td>1.9{1.5}<td> 4{3.7}<td>4.15{3.5}<td> 1.5 <td> 1.75 <td> 1.45 <td> 5(3) <td>3.5{2.5}<td> 2.25 <td> 2.33 <td> 2.25 <td> 4 <td> 2.15 <td> 2.67 <td> 17 <td> 3.5 <td> * <td> 1.25 <td> 3.5 <td>3.53{1.75}<td> 1.75{>1}
<tr> <td> copyd <td> 0.75-1 <td> #2 <td> #2 <td>0.73{0.5}<td>1.75{0.5}<td>1 <td> 1[0.85]<td> 1.36 <td> 1.5 <td>2.8[2.3]<td>0.52-0.8<td>0.52-0.64<td> 0.52 <td>1.16-1.66<td> 1.1 <td> 0.75 <td> #1 <td> 1.13 <td> 1.9{1}<td> 1.4 <td> 2.5 <td> 17 <td> 2 <td> 1 <td> 0.5 <td>1.25-1.5<td> 1.25 <td> 0.52
<tr> <td> copyi <td> 0.75-1 <td> #1 <td> #1.5 <td>0.73{0.5}<td>1.75{0.5}<td>1 <td> 1[0.85]<td> 1.36 <td> 1.5 <td>2.8[2.3]<td>0.52-0.8<td>0.52-0.64<td> 0.54 <td>1.16-1.66<td> 1.1 <td> 0.75 <td> #1 <td> 1 <td> 2{1} <td> 1.4 <td> 2.5 <td> 17 <td> 2 <td> 1 <td> 0.5 <td>1.25-1.5<td> 1.25 <td> 0.52
@@ -170,7 +170,7 @@
<tr bgcolor="#f0f0f0"><td> mod_1s_2p <td> <td> <td> <td> <td> <td> 4 <td> 4 <td> 7{6.3} <td> 8.61 <td> 19 <td> 8 <td>6.5{6}<td>4.5{4}<td><strike>28</strike><td>7.65<td> <td> <td> <td> <td> <td> <td> <td> <td> <td> (4.5) <td> 4.25 <td> 3 <td> 3
<tr bgcolor="#f0f0f0"><td> mod_1s_3p <td> <td> <td> <td> <td> <td> {3} <td> {3} <td> {5.5} <td> {8} <td>{16} <td> {5.41} <td> {4.5} <td> {3} <td> <td> {5} <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td> <td>
<tr bgcolor="#f0f0f0"><td> mod_1s_4p <td>4.75{4.25}<td> 4 <td> 4.5 <td> 3.4 <td> 8.75 <td>3{2.75} <td>3{2.75} <td> 5.7{5}<td> 7.67 <td> 15.75 <td> 5 <td> 4[3.75]<td>3.25{2.5}<td>23 <td>4.75{4.17}<td>[6.5] <td> 9 <td> 9 <td> 13 <td> 3.5 <td> <td> <td> 4 <td> 3 <td> (2.25) <td> <td> <td>
-<tr bgcolor="#f0f0f0"><td> mod_34lsub1 <td> #1 <td> 1.25 <td> 1.25 <td> #1.9 <td> 2.33 <td> 0.67 <td> 0.67 <td> 1{0.5}<td> 1.125 <td> 3.2 <td> 1.25 <td> 1.15 <td> 0.93 <td> 2.45 <td> 1.25 <td> 0.87 <td> 1.5 <td> 1.32 <td> 2.35 <td> 1 <td> <td> <td> 1.67? <td> #1.67 <td> 1 <td> 1.33{1}<td>1.33{0.59}<td> 1.33{0.59}
+<tr bgcolor="#f0f0f0"><td> mod_34lsub1 <td> #1 <td> 1.25 <td> 1.25 <td> #1.9 <td> 2.33 <td> 0.67 <td> 0.67 <td> 1{0.5}<td> 1.125 <td> 3.2 <td> 1.25 <td> 1.15 <td> 0.93 <td> 2.45 <td> 1.25 <td> 0.87 <td> 1.5 <td> 1.32 <td> 2.35 <td> 1 <td> <td> <td> 1.67? <td> #1.67 <td> 1 <td> 1.33{1}<td>1.33{0.92}<td> 1.33{0.59}
<tr> <td> gcd_1 <td> 5.31/b<td> [10/b] <td> [10/b] <td> 5.09/b<td> [8.9/b]<td> 5.21/b<td> 4.30/b<td> 5.00/b<td> 6.71/b<td> 13.5/b<td> 3.83/b<td> 5.17/b<td> 4.69/b<td> 8.77/b<td> 5.44/b<td> <td> <td> <td> <td> <td> 5.00/b<td> 11.4/b <td> 6.0/b <td> 3.4/b <td> 6.35/b<td> 5.3/b <td> 3.5/b <td> 3.5/b
<tr> <td> invert_limb <td> 41 <td> <td> <td> <td> <td> 48 <td> 48 <td> 63 <td> 64 <td>135 <td> 69 <td> 55 <td> 44 <td>130 <td> 78 <td> 32 <td> 86 <td> 86 <td>170 <td> 66 <td> <td> <td> [?] <td> 71 <td> 56 <td> 43 <td> 41 <td> 41
<tr> <td> popcount <td> 5(4) <td> 3.9 <td> 4.25 <td> #4.6 <td> 5.5 <td> 6 <td> 1.125 <td>4.4{2.5}<td> 6.1 <td> 8 <td> 3.67{3}<td> 1.25 <td> 1.5{1}<td> 10.75 <td> 6.5{5}<td> 1.125 <td> 2.25 <td> {2.16} <td> <td> 2 <td> <td> <td> 2.5 <td> #1.5 <td> 1 <td> 1.13 <td> 5.67 <td> 0.56
diff -r 5e84e93031bf -r c004de8197ac devel/sparc.html
--- a/devel/sparc.html Mon Apr 29 18:22:29 2013 +0200
+++ b/devel/sparc.html Tue Apr 30 00:16:59 2013 +0200
@@ -93,8 +93,6 @@
unrolled. Commit the one which gives best performance for the critical
operand sizes.
</li>
- <li> Rewrite mpn_submul_1 for speedup 5.8 c/l → 4.75 c/l.
- </li>
<li> Explore using the mpmul instruction for mpn_mul_basecase. Since mpmul
handles just same-size operands, a GMP {up,un} × {vp,vn} multiply (where
un ≥ vn) will require an initial {up,vn} × {vp,vn} multiply, then
@@ -121,6 +119,8 @@
DONE:
<p>
<ul>
+ <li> Rewrite mpn_submul_1 for speedup 5.8 c/l → 4.5 c/l.
+ </li>
<li> Finish the cnd_aors_n.asm code.
</li>
<li> Write a generic file aorsorrlshC_n.asm for addlsh1_n, sublsh1_n,
diff -r 5e84e93031bf -r c004de8197ac devel/testsystems.html
--- a/devel/testsystems.html Mon Apr 29 18:22:29 2013 +0200
+++ b/devel/testsystems.html Tue Apr 30 00:16:59 2013 +0200
@@ -73,15 +73,16 @@
<tr> <td> repentium <td> x86-64 <td> Core 2 6400 <td> Conroe <td align="center"> 2 <td align="right"> 2160 <td align="right"> 2048 <td align="center"> N <td> fbsd
<tr> <td> hehe <td> x86-64 <td> Atom 330 <td> Diamondville <td align="center"> 2 <td align="right"> 1600 <td align="right"> 2048 <td align="center"> N <td> fbsd <td>
<tr> <td> element <td> x86-64 <td> Pentium4-4 (Xeon) <td> Nocona <td align="center"> 2 <td align="right"> 3400 <td align="right"> 8192 <td align="center"> Y <td> fbsd
-<tr> <td> nana <td> x86-64 <td> Nano L2200 <td> Nano <td align="center"> 1 <td align="right"> 1600 <td align="right"> 4096 <td align="center"> N <td> fbsd
+<tr> <td> nana <td> x86-64 <td> Nano L2200 <td> Isaiah <td align="center"> 1 <td align="right"> 1600 <td align="right"> 4096 <td align="center"> N <td> fbsd
<tbody>
<tr> <td> tambo <td> x86-32 <td> Athlon <td> K7 Barton <td align="center"> 1 <td align="right"> 2083 <td align="right"> 1024 <td align="center"> N <td> fbsd <td>
<tr> <td> parks <td> x86-32 <td> Pentium4-2 <td> Northwood <td align="center"> 1 <td align="right"> 2600 <td align="right"> 512 <td align="center"> N <td> fbsd
<tr> <td> plattityd <td> x86-32 <td> Pentium M <td> Dothan <td align="center"> 1 <td align="right"> 1867 <td align="right"> 512 <td align="center"> N <td> fbsd
<tbody>
-<tr> <td> ev56 <td> alpha <td> 21164A <td> ev56 <td align="center"> 1 <td align="right"> 600 <td align="right"> 384 <td align="center"> Y <td> fbsd <td>
+<tr> <td> ev56 <td> alpha <td> 21164A <td> EV56 <td align="center"> 1 <td align="right"> 600 <td align="right"> 384 <td align="center"> Y <td> fbsd <td>
<tbody>
-<tr> <td> titanic <td> ia-64 <td> Itanium 2 <td> Mckinley <td align="center"> 2 <td align="right"> 900 <td align="right"> 2048 <td align="center"> Y <td> gnu/linux <td> offline due to disk crash (ilo at 10.0.0.220:23)
+<tr> <td> titanic <td> ia-64 <td> Itanium 2 <td> Mckinley <td align="center"> 2 <td align="right"> 900 <td align="right"> 2048 <td align="center"> Y <td> gnu/linux <td> disk crashed (ilo at 10.0.0.220:23)
+<tr> <td> olympic <td> ia-64 <td> Itanium 2 <td> Mckinley <td align="center"> 2 <td align="right"> 900 <td align="right"> 2048 <td align="center"> Y <td> gnu/linux <td> not always powered-on (ilo at 10.0.0.221:23)
<tbody>
<tr> <td> g5 <td> ppc64 <td> PPC-970 <td> <td align="center"> 2 <td align="right"> 1800 <td align="right"> 2048 <td align="center"> N <td> macos/darwin
<tr> <td> spigg <td> ppc32 <td> PPC-7447 <td> <td align="center"> 1 <td align="right"> 1416 <td align="right"> 512 <td align="center"> N <td> gnu/linux
@@ -152,7 +153,7 @@
</div>
<div id="footer">
-<font size="-4">Last modified: 2013-04-02 </font>
+<font size="-4">Last modified: 2013-04-29 </font>
<table cellpadding=0 width="100%" bgcolor="#e8e8e8">
<tr>
<td align="center">
More information about the gmp-commit
mailing list